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Novel Proteins and Nucleic Acids Encoding Same 



FIELD OF THE INVENTION 

5 The present invention relates to novel polypeptides that are targets of small 

molecule drugs and that have properties related to stimulation of biochemical or 
physiological responses in a cell, a tissue, an organ or an organism. More particularly, the 
novel polypeptides are gene products of novel genes, or are specified biologically active 
fragments or derivatives thereof. Methods of use encompass diagnostic and prognostic 
10 assay procedures as well as methods of treating diverse pathological conditions. 
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BACKGROUND 

Eukaryotic cells are characterized by biochemical and physiological processes 
which under normal conditions are exquisitely balanced to achieve the preservation and 
propagation of the cells. When such cells are components of multicellular organisms such 
5 as vertebrates, or more particularly organisms such as mammals, the regulation of the 
biochemical and 

physiological processes involves intricate signaling pathways. Frequently, such signaling 
pathways are constituted of extracellular signaling proteins, cellular receptors that bind the 
signaling proteins and signal transducing components located within the cells. 

1 0 Signaling proteins may be classified as endocrine effectors, paracrine effectors or 

autocrine effectors. Endocrine effectors are signaling molecules secreted by a given organ 
into the circulatory system, which are then transported to a distant target organ or tissue. 
The target cells include the receptors for the endocrine effector, and when the endocrine 
effector binds, a signaling cascade is induced. Paracrine effectors involve secreting cells 

15 and receptor cells in close proximity to each other, for example two different classes of 
cells in the same tissue or organ. One class of cells secretes the paracrine effector, which 
then reaches the second class of cells, for example by diffusion through the extracellular 
fluid. The second class of cells contains the receptors for the paracrine effector; binding 
of the effector results in induction of the signaling cascade that elicits the corresponding 

20 biochemical or physiological effect. Autocrine effectors are highly analogous to paracrine 
effectors, except that the same cell type that secretes the autocrine effector also contains 
the receptor. Thus the autocrine effector binds to receptors on the same cell, or on 
identical neighboring cells. The binding process then elicits the characteristic biochemical 
or physiological effect. 

25 Signaling processes may elicit a variety of effects on cells and tissues including by 

way of nonlimiting example induction of cell or tissue proliferation, suppression of growth 
or proliferation, induction of differentiation or maturation of a cell or tissue, and 
suppression of differentiation or maturation of a cell or tissue. 

Many pathological conditions involve dysregulation of expression of important 

30 effector proteins. In certain classes of pathologies the dysregulation is manifested as 
diminished or suppressed level of synthesis and secretion protein effectors. In a clinical 
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setting a subject may be suspected 1 of suffering from a condition brought on by diminished 
or suppressed levels of a protein effector of interest. Therefore there is a need to be able to 
assay for the level of the protein effector of interest in a biological sample from such a 
subject, and to compare the level with that characteristic of a nonpathological condition. 
5 There further is a need to provide the protein effector as a product of manufacture. 
Administration of the effector to a subject in need thereof is useful in treatment of the 
pathological condition, or the protein effector deficiency or suppression may be favorably 
acted upon by the administration of another small molecule drug product. Accordingly, 
there is a need for a method of treatment of a pathological condition brought on by a 

10 diminished or suppressed levels of the protein effector of interest. 

Small molecule targets have been implicated in various disease states or 
pathologies. These targets may be proteins, and particularly enzymatic proteins, which are 
acted upon by small molecule drugs for the purpose of altering target function and 
achieving a desired result. Cellular, animal and clinical studies can be performed to 

1 5 elucidate the genetic contribution to the etiology and pathogenesis of conditions in which 
small molecule targets are implicated in a variety of physiologic, pharmacologic or native 
states. These studies utilize the core technologies at CuraGen Corporation to look at 
differential gene expression, protein-protein interactions, large-scale sequencing of 
expressed genes and the association of genetic variations such as, but not limited to, single 

20 nucleotide polymorphisms (SNPs) or splice variants in and between biological samples 
from experimental and control groups. The goal of such studies is to identify potential 
avenues for therapeutic intervention in order to prevent, treat the consequences or cure the 
conditions. 

In order to treat diseases, pathologies and other abnormal states or conditions in 
25 which a mammalian organism has been diagnosed as being, or as being at risk for 
becoming, other than in a normal state or condition, it is important to identify new 
therapeutic agents. Such a procedure includes at least the steps of identifying a target 
component within an affected tissue or organ, and identifying a candidate therapeutic 
agent that modulates the functional attributes of the target. The target component may be 
30 any biological macromolecule implicated in the disease or pathology. Commonly the 
target is a polypeptide or protein with specific functional attributes. Other classes of 
macromolecule may be a nucleic acid, a polysaccharide, a lipid such as a complex lipid or 
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a glycolipid; in addition a target may be a sub-cellular structure or extra-cellular structure 
that is comprised of more than one of these classes of macromolecule. Once such a target 
has been identified, it may be employed in a screening assay in order to identify favorable 
candidate therapeutic agents from among a large population of substances or compounds. 
5 In many cases the objective of such screening assays is to identify small molecule 

candidates; this is commonly approached by the use of combinatorial methodologies to 
develop the population of substances to be tested. The implementation of high throughput 
screening methodologies is advantageous when working with large, combinatorial 
libraries of compounds. 

1 0 SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences 
encoding novel polypeptides. These nucleic acids and polypeptides, as well as derivatives, 
homologs, analogs and fragments thereof, will hereinafter be collectively designated as 
"NOVX" nucleic acid, which represents the nucleotide sequence selected from the group 

15 consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, or 

polypeptide sequences, which represents the group consisting of SEQ ID NO: 2n, wherein 
n is an integer between 1 and 101. 

In one aspect, the invention provides an isolated polypeptide comprising a mature 
form of a NOVX amino acid. One example is a variant of a mature form of a NOVX 

20 amino acid sequence, wherein any amino acid in the mature form is changed to a different 
amino acid, provided that no more than 15% of the amino acid residues in the sequence of 
the mature form are so changed. The amino acid can be, for example, a NOVX amino 
acid sequence or a variant of a NOVX amino acid sequence, wherein any amino acid 
specified in the chosen sequence is changed to a different amino acid, provided that no 

25 more than 15% of the amino acid residues in the sequence are so changed. The invention 
also includes fragments of any of these. In another aspect, the invention also includes an 
isolated nucleic acid that encodes a NOVX polypeptide, or a fragment, homolog, analog or 
derivative thereof. 

Also included in the invention is a NOVX polypeptide that is a naturally occurring 
30 allelic variant of a NOVX sequence. In one embodiment, the allelic variant includes an 
amino acid sequence that is the translation of a nucleic acid sequence differing by a single 
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nucleotide from a NOVX nucleic acid sequence. In another embodiment, the NOVX 
polypeptide is a variant polypeptide described therein, wherein any amino acid specified in 
the chosen sequence is changed to provide a conservative substitution. In one 
embodiment, the invention discloses a method for determining the presence or amount of 

5 the NOVX polypeptide in a sample. The method involves the steps of: providing a 
sample; introducing the sample to an antibody that binds immunospecifically to the 
polypeptide; and determining the presence or amount of antibody bound to the NOVX 
polypeptide, thereby determining the presence or amount of the NOVX polypeptide in the 
sample. In another embodiment, the invention provides a method for determining the 

1 0 presence of or predisposition to a disease associated with altered levels of a NOVX 

polypeptide in a mammalian subject. This method involves the steps of: measuring the 
level of expression of the polypeptide in a sample from the first mammalian subject; and 
comparing the amount of the polypeptide in the sample of the first step to the amount of 
the polypeptide present in a control sample from a second mammalian subject known not 

15 to have, or not to be predisposed to, the disease, wherein an alteration in the expression 
level of the polypeptide in the first subject as compared to the control sample indicates the 
presence of or predisposition to the disease. 

In a further embodiment, the invention includes a method of identifying an agent 
that binds to a NOVX polypeptide. This method involves the steps of: introducing the 

20 polypeptide to the agent; and determining whether the agent binds to the polypeptide. In 
various embodiments, the agent is a cellular receptor or a downstream effector. 

In another aspect, the invention provides a method for identifying a potential 
therapeutic agent for use in treatment of a pathology, wherein the pathology is related to 
aberrant expression or aberrant physiological interactions of a NOVX polypeptide. The 

25 method involves the steps of: providing a cell expressing the NOVX polypeptide and 
having a property or function ascribable to the polypeptide; contacting the cell with a 
composition comprising a candidate substance; and determining whether the substance 
alters the property or function ascribable to the polypeptide; whereby, if an alteration 
observed in the presence of the substance is not observed when the cell is contacted with a 

30 composition devoid of the substance, the substance is identified as a potential therapeutic 
agent. In another aspect, the invention describes a method for screening for a modulator 
of activity or of latency or predisposition to a pathology associated with the NOVX 
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polypeptide. This method involves the following steps: administering a test compound to 
a test animal at increased risk for a pathology associated with the NOVX polypeptide, 
wherein the test animal recombinantly expresses the NOVX polypeptide. This method 
involves the steps of measuring the activity of the NOVX polypeptide in the test animal 
5 after administering the compound of step; and comparing the activity of the protein in the 
test animal with the activity of the NOVX polypeptide in a control animal not 
administered the polypeptide, wherein a change in the activity of the NOVX polypeptide 
in the test animal relative to the control animal indicates the test compound is a modulator 
of latency of, or predisposition to, a pathology associated with the NOVX polypeptide. In 

10 one embodiment, the test animal is a recombinant test animal that expresses a test protein 
transgene or expresses the transgene under the control of a promoter at an increased level 
relative to a wild-type test animal, and wherein the promoter is not the native gene 
promoter of the transgene. In another aspect, the invention includes a method for 
modulating the activity of the NOVX polypeptide, the method comprising introducing a 

15 cell sample expressing the NOVX polypeptide with a compound that binds to the 
polypeptide in an amount sufficient to modulate the activity of the polypeptide. 

The invention also includes an isolated nucleic acid that encodes a NOVX 
polypeptide, or a fragment, homolog, analog or derivative thereof. In a preferred 
embodiment, the nucleic acid molecule comprises the nucleotide sequence of a naturally 

20 occurring allelic nucleic acid variant. In another embodiment, the nucleic acid encodes a 
variant polypeptide, wherein the variant polypeptide has the polypeptide sequence of a 
naturally occurring polypeptide variant. In another embodiment, the nucleic acid molecule 
differs by a single nucleotide from a NOVX nucleic acid sequence. In one embodiment, 
the NOVX nucleic acid molecule hybridizes under stringent conditions to the nucleotide 

25 sequence selected from the group consisting of SEQ ID NO: 2n-l, wherein n is an integer 
between 1 and 101, or a complement of the nucleotide sequence. In another aspect, the 
invention provides a vector or a cell expressing a NOVX nucleotide sequence. 

In one embodiment, the invention discloses a method for modulating the activity of 
a NOVX polypeptide. The method includes the steps of: introducing a cell sample 

30 expressing the NOVX polypeptide with a compound that binds to the polypeptide in an 
amount sufficient to modulate the activity of the polypeptide. In another embodiment, the 
invention includes an isolated NOVX nucleic acid molecule comprising a nucleic acid 
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sequence encoding a polypeptide comprising a NOVX amino acid sequence or a variant of 
a mature form of the NOVX amino acid sequence, wherein any amino acid in the mature 
form of the chosen sequence is changed to a different amino acid, provided that no more 
than 1 5% of the amino acid residues in the sequence of the mature form are so changed. 
5 In another embodiment, the invention includes an amino acid sequence that is a variant of 
the NOVX amino acid sequence, in which any amino acid specified in the chosen 
sequence is changed to a different amino acid, provided that no more than 15% of the 
amino acid residues in the sequence are so changed. 

In one embodiment, the invention discloses a NOVX nucleic acid fragment 

1 0 encoding at least a portion of a NOVX polypeptide or any variant of the polypeptide, 
wherein any amino acid of the chosen sequence is changed to a different amino acid, 
provided that no more than 10% of the amino acid residues in the sequence are so 
changed. In another embodiment, the invention includes the complement of any of the 
NOVX nucleic acid molecules or a naturally occurring allelic nucleic acid variant. In 

1 5 another embodiment, the invention discloses a NOVX nucleic acid molecule that encodes 
a variant polypeptide, wherein the variant polypeptide has the polypeptide sequence of a 
naturally occurring polypeptide variant. In another embodiment, the invention discloses a 
NOVX nucleic acid, wherein the nucleic acid molecule differs by a single nucleotide from 
a NOVX nucleic acid sequence. 

20 In another aspect, the invention includes a NOVX nucleic acid, wherein one or 

more nucleotides in the NOVX nucleotide sequence is changed to a different nucleotide 
provided that no more than 15% of the nucleotides are so changed. In one embodiment, 
the invention discloses a nucleic acid fragment of the NOVX nucleotide sequence and a 
nucleic acid fragment wherein one or more nucleotides in the NOVX nucleotide sequence 

25 is changed from that selected from the group consisting of the chosen sequence to a 

different nucleotide provided that no more than 15% of the nucleotides are so changed. In 
another embodiment, the invention includes a nucleic acid molecule wherein the nucleic 
acid molecule hybridizes under stringent conditions to a NOVX nucleotide sequence or a 
complement of the NOVX nucleotide sequence. In one embodiment, the invention 

30 includes a nucleic acid molecule, wherein the sequence is changed such that no more than 
15% of the nucleotides in the coding sequence differ from the NOVX nucleotide sequence 
or a fragment thereof. 
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In a further aspect, the invention includes a method for determining the presence 
or amount of the NOVX nucleic acid in a sample. The method involves the steps of: 
providing the sample; introducing the sample to a probe that binds to the nucleic acid 
molecule; and determining the presence or amount of the probe bound to the NOVX 
5 nucleic acid molecule, thereby determining the presence or amount of the NOVX nucleic 
acid molecule in the sample. In one embodiment, the presence or amount of the nucleic 
acid molecule is used as a marker for cell or tissue type. 

In another aspect, the invention discloses a method for determining the presence of 
or predisposition to a disease associated with altered levels of the NOVX nucleic acid 

10 molecule of in a first mammalian subject. The method involves the steps of: measuring 
the amount of NOVX nucleic acid in a sample from the first mammalian subject; and 
comparing the amount of the nucleic acid in the sample of step (a) to the amount of 
NOVX nucleic acid present in a control sample from a second mammalian subject known 
not to have or not be predisposed to, the disease; wherein an alteration in the level of the 

15 nucleic acid in the first subject as compared to the control sample indicates the presence of 
or predisposition to the disease. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although methods and materials similar or equivalent to those 

20 described herein can be used in the practice or testing of the present invention, suitable 

methods and materials are described below. All publications, patent applications, patents, 
and other references mentioned herein are incorporated by reference in their entirety. In 
the case of conflict, the present specification, including definitions, will control. In 
addition, the materials, methods, and examples are illustrative only and not intended to be 

25 limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides novel nucleotides and polypeptides encoded 
30 thereby. Included in the invention are the novel nucleic acid sequences, their encoded 
polypeptides, antibodies, and other related compounds. The sequences are collectively 
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referred to herein as "NOVX nucleic acids" or "NOVX polynucleotides" and the 
corresponding encoded polypeptides are referred to as "NOVX polypeptides" or "NOVX 
proteins." Unless indicated otherwise, "NOVX" is meant to refer to any of the novel 
sequences disclosed herein. Table 1 provides a summary of the NOVX nucleic acids and 
their encoded polypeptides. 



TABLE 1. Sequences and Corresponding SEQ ID Numbers 



NOVX 


Internal 


M) 
acid) 


SEQ ID NO 


Homology 


la 


CGI 00 126-01 


1 


2 


KERATIN ASSOCIATED 
PROTEIN 4.9 




CGI 00 146-01 






UDP-Glucuronosyltransferase 
2B 1 5 Precursor like homo 
sapiens 




CGI 00 179-01 






cyclophilin A like homo sapiens 


4a 


CG100212-01 


7 


8 


Zinc-containing alcohol 
dehydrogenase like homo 
sapiens 


4b 


CGI 002 12-02 


9 


10 


Zinc-containing alcohol 
dehydrogenase like homo 
sapiens 


5a 


CGI 00222-01 


11 


12 


NADP-Dependent Leukotriene 
B4 1 2-Hydroxydehydrogenase 
like homo sapiens 


6a 


CGI 00266-01 


13 


14 


cyclophilin A like homo sap 


ens 


6b 


CGI 00266-02 


15 


16 


cyclophilin A like homo sap 


ens 




CGI 00427-01 






cyclophilin A like homo sap 


ens 


8a 


CGI 00456-01 


19 


20 


CoA Transferase like homo 
sapiens 


9a 


CGI 00466-01 


21 


22 


Novel Adenine Nucleotide 
Translocator 2 like homo sapiens 


10a 


CGI 00609-01 


23 


24 


GLUTATHIONE S- 
TRANSFERASE THETA Hike 
homo sapiens 


11a 


CG100631-01 


25 


26 


Clathrin Light Chain A like 
homo sapiens 


lib 


CGI 0063 1-02 


27 


28 


Clathrin Light Chain A like 
homo sapiens 


11c 


CGI 0063 1-04 


29 


30 


Clathrin Light Chain A like 
homo sapiens 


12a 


CGI 007 10-01 


31 


32 


AAA (ATPase Associated with 
various Activities) like homo 
sapiens 


13a 


CGI 00730-01 


33 


34 


exoribonuclease like homo 
sapiens 


14a 


CGI 008 19-01 


35 


36 


POLYNUCLEOTIDE 
PHOSPHORYLASE like homo 
sapiens 


15a 


CGI 00872-01 


37 


38 


PROTEIN-ARGININE 
DE1MINASE TYPE V like 
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homo sapiens 


16a 


CGI 00980-01 


39 


40 


PROTEIN-ARGININE 
DEIMINASE TYPE III like 
homo sapiens 


17a 


CGI 72805-01 


41 


42 


Peroxisomal Enoyl-CoA 
Hydratase like homo sapiens 


18a 


CG56763-01 


43 


44 


GPCR like homo sapiens 


19a 


CG56777-01 


45 


46 


Prostaglandin-F synthase 1 like 
homo sapiens 


19b 


CG56777-02 


47 


48 


Prostaglandin-F synthase 1 like 
homo sapiens 


20a 


CG56941-01 


49 


50 


Ribonuclease H type II like 
homo sapiens 


21a 


CG57 109-01 


51 


52 


Doublecortin/CAM kinase like 
homo sapiens 


21b 


CG57 109-02 


53 


54 


Doublecortin/CAM kinase like 
homo sapiens (also been filed as 
Cura691A) 


21c 


vAj j / iuy-uj 


55 


56 


Doublecortin/CAM kinase like 
homo sapiens 




CG57 109-04 






Doublecortin/CAM kinase like 
homo sapiens 


21e 


CG5 7 109-05 


59 


60 


Doublecortin/CAM kinase like 
homo sapiens 


21f 


CG57 109-06 


61 


62 


Double Cortin 


22a 


CG57366-01 


63 


64 


KIAA1223 like homo sapiens 


23a 


CG57368-01 


65 


66 


Adenylate cyclase type IV like 
homo sapiens 


23b 


CG57368-02 


67 


68 




24a 


CG59955-01 


69 


70 


GPCR like homo sapiens 


25a 


CG892 11-01 


71 


72 


GPCR like homo sapiens 


26a 


CG90530-02 


73 


74 


UBIQUITIN-CONJUGATING 
ENZYME ISOLOG like homo 
sapiens 


27a 


CG93076-01 


75 


76 


GPCR like homo sapiens 




CG94235-01 






Thymidylate Kinase like homo 
sapiens 


28b 


CG94235-02 


79 


80 


Thymidylate Kinase like homo 
sapiens 


29a 


CG94692-01 


81 


82 


CARNITINE/ ACYLCARNITIN 
E TRANSLOCASE like homo 
sapiens 


29b 


CG94692-02 


83 


84 


CARNITINE/ ACYLCARNITIN 
E TRANSLOCASE like homo 
sapiens 


30a 


CG94724-01 


85 


86 


CARNITINE/ ACYLCARNITIN 
E TRANSLOCASE like homo 
sapiens 


31a 


CG94871-01 


87 


88 


Josephin MJD1 like homo 
sapiens 


31b 


CG94871-02 


89 


90 


Josephin MJD1 like homo 
sapiens 


31c 


CG94871-04 


91 


92 


Josephin MJD1 like homo 
sapiens 


31d 


CG94871-05 


93 


94 


Josephin MJD1 like homo 
sapiens 


32a 


CG94946-01 


95 


96 


Agrin Precursor 
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32b 


CG94946-02 


97 


98 


Agrin Precursor 


32c 


CG94946-03 


99 


100 


Agrin Pre&ursor 


32d 


CG94946-04 


101 


102 ' 


Agrin Precursor 


32e 


CG94946-05 


103 


104 


Agrin Precursor 


32f 


CG94946-06 


105 


106 


Agrin Precursor 


32g 


CG94946-07 


107 


108 


Agrin Precursor 


33a 


CG95 165-01 


109 


110 


ADENYLATE CYCLASE, 
TYPE II like homo sapiens 


34a 


CG95 175-01 


111 


112 


Ephrine type A Receptor like 
homo sapiens 


35a 


CG95693-01 


113 


114 


' DP Y- 1 9 PROTEIN Hike homo 
sapiens 


36a 


CG958 14-01 


115 


116 


RHO-GTPASE-ACTIVATING 
PROTEIN 4 like homo sapiens 


37a 


CG95 824-01 


117 118 


Rho GAP like homo sapiens 






119 120 


Gonadotropin-releasing hormone 
receptor 




CrtQA 1 qq no 


121 


122 


Gonadotropin-releasing hormone 
receptor 




LuVOiJ 1 -U 1 


123 


124 


OTU-like cysteine protease like 
homo sapiens 




CCIQAO "5 i no 
v^vJ VOZ J l-UZ 


125 


126 


OTU-like cysteine protease like 
homo sapiens 




CG96^60-0 1 


127 


128 


phospholipase A2 like homo 
sapiens 


41a 


CG96364-01 


129 


130 


A DP/ATP TRANSLOCASE 2 
like homo sapiens 


41b 


CG96364-03 


131 


132 


A DP/ATP TRANSLOCASE 2 
like homo sapiens 


3 


CCQAA11 m 


133 


134 


ADP/ATP translocase 3 like 
homo sapiens 


43a 


CG96442-01 


135 


136 


Acyl CoA-domain protein like 
homo sapiens 


44a 


CG96501-01 


137 


138 


BRAIN MITOCHONDRIAL 
CARRIER PROTEIN- 1 like 
homo sapiens 


45a 


CG96557-01 


139 


140 


L-ALLO-THREONINE 
ALDOLASE 


46a 


CG96581-01 


141 


142 


RP42 


46b 


CG96581-02 


143 


144 


RP42 


47a 


CG96624-01 


145 


146 


Putative seven pass 
transmembrane protein 


47b 


CG96624-02 


147 


148 


Putative seven pass 
transmembrane protein 


48a 


CG96747-01 


149 


150 


VOLTAGE-DEPENDENT 
CALCIUM CHANNEL 
GAMMA-3 SUBUNIT 


49a 


CG96789-01 


151 


152 


GLYCINE CLEAVAGE 
SYSTEM PROTEIN H 


50a 


CG97253-01 


153 


154 


Galectin like homo sapiens 


51a 


CG97400-01 


155 


156 


Myotubularin-related protein 6 
like homo sapiens 


52a 


CG97462-01 


157 


158 


Prohibitin like homo sapiens 


53a 


CG97472-01 


159 


160 


Glucose Transporter like homo 
sapiens 


54a 


CG97528-01 


161 


162 


Guanylate Binding Protein like 
homo sapiens 
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55a 


CG97629-01 


163 


164 


CELL DIVISION PROTEIN 
KINASE 7 like homo sapiens 


56a 


CG97648-01 


165 


166 


G PROTEIN-COUPLED 
RECEPTOR KINASE GRK7 
like homo sapiens 


57a 


CG97658-01 


167 


168 


PROTEIN-TYROSINE 
PHOSPHATASE, NON- 
RECEPTOR TYPE 5 like homo 
sapiens 


58a 


CG97842-01 


169 


170 


protein kinase-formin like homo 
sapiens 


59a 


CG98021-01 


171 


172 


Synaptotagmin III like homo 
sapiens 


60a 


CG98030-01 


173 


174 


Protein-tyrosine phosphatase like 
homo sapiens 


60b 


CG98030-02 


175 


176 




61a 


CG98061-01 


177 


178 


Histidine acid phosphatase 
domain like homo sapiens 


61b 


CG98061-02 


179 


180 


Novel Protein containing 
Histidine acid phosphatase 
domain-like Proteins and Nucleic 
Acids Encoding Same 


62a 


CG98071-01 


181 


182 


Histidine acid phosphatase 
domain like homo sapiens 


63a 


CG98131-01 


183 


184 


MDJ6 like homo sapiens 


64a 


CG98 164-01 


185 


186 


LRR and Kinase domain like 
homo sapiens 


64b 


CG98 1 64-02 


187 


188 


LRR and Kinase domain like 
homo sapiens 


65a 


CG995 88-01 


189 


190 


transmembrane protein like 
homo sapiens 


66a 


CG996 18-01 


191 


192 


PROTEIN-TYROSINE 
PHOSPHATASE 2C 


67a 


CG99832-01 


193 


194 


gene containing NUDIX 
hydrolase domain like homo 
sapiens 


68a 


CG99842-01 


195 


196 


Tensin 


69a 


CG99944-01 


197 


198 


SUGAR ABC TRANSPORTER 


70a 


CG99963-01 


199 


200 


cyclophilin 18 like homo sapiens 


70b 


CG99963-02 


201 


202 


cyclophilin 18 like homo sapiens 



Table 1 indicates homology of NOVX nucleic acids to known protein families. 
Thus, the nucleic acids and polypeptides, antibodies and related compounds according to 
the invention corresponding to a NOVX as identified in column 1 of Table 1 will be useful 
in therapeutic and diagnostic applications implicated in, for example, pathologies and 
disorders associated with the known protein families identified in column 5 of Table 1. 

NOVX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NOVX nucleic acids and polypeptides according 
to the invention are useful as novel members of the protein families according to the 
presence of domains and sequence relatedness to previously described proteins. 
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Additionally, NOVX nucleic acids and polypeptides can also be used to identify proteins 
that are members of the family to which the NOVX polypeptides belong. 

Consistent with other known members of the family of proteins, identified in 
column 5 of Table 1, the NOVX polypeptides of the present invention show homology to, 
5 and contain domains that are characteristic of, other members of such protein 

families. Details of the sequence relatedness and domain analysis for each NOVX are 
presented in Example A. 

The NOVX nucleic acids and polypeptides can also be used to screen for 
molecules, which inhibit or enhance NOVX activity or function. Specifically, the nucleic 

1 0 acids and polypeptides according to the invention may be used as targets for the 

identification of small molecules that modulate or inhibit diseases associated with the 
protein families listed in Table 1 . 

The NOVX nucleic acids and polypeptides are also useful for detecting specific 
cell types. Details of the expression analysis for each NOVX are presented in Example C. 

1 5 Accordingly, the NOVX nucleic acids, polypeptides, antibodies and related compounds 
according to the invention will have diagnostic and therapeutic applications in the 
detection of a variety of diseases with differential expression in normal vs. diseased 
tissues, e.g. a variety of cancers. 

Additional utilities for NOVX nucleic acids and polypeptides according to the 

20 invention are disclosed herein. 

The present invention is based on the identification of biological macromolecules 
differentially modulated in a pathologic state, disease, or an abnormal condition or state. 
Among the pathologies or diseases of present interest include metabolic diseases including 
those related to endocrinologic disorders, cancers, various tumors and neoplasias, , 

25 inflammatory disorders, central nervous system disorders, and similar abnormal conditions 
or states. In very significant embodiments of the present invention, the biological 
macromolecules implicated in the pathologies and conditions are proteins and 
polypeptides, and in such cases the present invention is related as well to the nucleic acids 
that encode them. Methods that may be employed to identify relevant biological 

30 macromolecules include any procedures that detect differential expression of nucleic acids 
encoding proteins and polypeptides associated with the disorder, as well as procedures that 
detect the respective proteins and polypeptides themselves. Significant methods that have 
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been employed by the present inventors, include GeneCalling ® technology and 
SeqCalling TM technology, disclosed respectively, in U. S. Patent No. 5,871,697, and in 
U. S. Ser. No. 09/417,386, filed Oct. 13, 1999, each of which is incorporated herein by 
reference in its entirety. GeneCalling ® is also described in Shimkets, et al., "Gene 
5 expression analysis by transcript profiling coupled to a gene database query" Nature 
Biotechnology 17:198-803 (1999). 

The invention provides polypeptides and nucleotides encoded thereby that have 
been identified as having novel associations with a disease or pathology, or an abnormal 
state or condition, in a mammal. The present invention further identifies a set of proteins 

1 0 and polypeptides, including naturally occurring polypeptides, precursor forms or 

proproteins, or mature forms of the polypeptides or proteins, which are implicated as 
targets for therapeutic agents in the treatment of various diseases, pathologies, abnormal 
states and conditions. A target may be employed in any of a variety of screening 
methodologies in order to identify candidate therapeutic agents which interact with the 

1 5 target and in so doing exert a desired or favorable effect. The candidate therapeutic agent 
is identified by screening a large collection of substances or compounds in an important 
embodiment of the invention. Such a collection may comprise a combinatorial library of 
substances or compounds in which, in at least one subset of substances or compounds, the 
individual members are related to each other by simple structural variations based on a 

20 particular canonical or basic chemical structure. The variations may include, by way of 
nonlimiting example, changes in length or identity of a basic framework of bonded atoms; 
changes in number, composition and disposition of ringed structures, bridge structures, 
alicyclic rings, and aromatic rings; and changes in pendent or substituents atoms or groups 
that are bonded at particular positions to the basic framework of bonded atoms or to the 

25 ringed structures, the bridge structures, the alicyclic structures, or the aromatic structures. 
A polypeptide or protein described herein, and that serves as a target in the 
screening procedure, includes the product of a naturally occurring polypeptide or 
precursor form or proprotein. The naturally occurring polypeptide, precursor or proprotein 
includes, e.g., the full-length gene product, encoded by the corresponding gene. The 

30 naturally occurring polypeptide also includes the polypeptide, precursor or proprotein 

encoded by an open reading frame described herein. A "mature" form of a polypeptide or 
protein arises as a result of one or more naturally occurring processing steps as they may 
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occur within the cell, including a host cell. The processing steps occur as the gene product 
arises, e.g., via cleavage of the amino-terminal methionine residue encoded by the 
initiation codon of an open reading frame, or the proteolytic cleavage of a signal peptide 
or leader sequence. Thus, a mature form arising from a precursor polypeptide or protein 
5 that has residues 1 to N, where residue 1 is the N-terminal methionine, would have 
residues 2 through N remaining. Alternatively, a mature form arising from a precursor 
polypeptide or protein having residues 1 to N, in which an amino-terminal signal sequence 
from residue 1 to residue M is cleaved, includes the residues from residue M+l to residue 
N remaining. A "mature" form of a polypeptide or protein may also arise from non- 
10 proteolytic post-translational modification. Such non-proteolytic processes include, e.g., 
glycosylation, myristylation or phosphorylation. In general, a mature polypeptide or 
protein may result from the operation of only one of these processes, or the combination of 
any of them. 

As used herein, "identical" residues correspond to those residues in a comparison 

1 5 between two sequences where the equivalent nucleotide base or amino acid residue in an 
alignment of two sequences is the same residue. Residues are alternatively described as 
"similar" or "positive" when the comparisons between two sequences in an alignment 
show that residues in an equivalent position in a comparison are either the same amino 
acid or a conserved amino acid as defined below. 

20 As used herein, a "chemical composition" relates to a composition including at 

least one compound that is either synthesized or extracted from a natural source. A 
chemical compound may be the product of a defined synthetic procedure. Such a 
synthesized compound is understood herein to have defined properties in terms of 
molecular formula, molecular structure relating the association of bonded atoms to each 

25 other, physical properties such as chromatographic or spectroscopic characterizations, and 
the like. A compound extracted from a natural source is advantageously analyzed by 
chemical and physical methods in order to provide a representation of its defined 
properties, including its molecular formula, molecular structure relating the association of 
bonded atoms to each other, physical properties such as chromatographic or spectroscopic 

30 characterizations, and the like. 

As used herein, a "candidate therapeutic agent" is a chemical compound that 
includes at least one substance shown to bind to a target biopolymer. In important 
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embodiments of the invention, the target biopolymer is a protein or polypeptide, a nucleic 
acid, a polysaccharide or proteoglycan, or a lipid such as a complex lipid. The method of 
identifying compounds that bind to the target effectively eliminates compounds with little 
or no binding affinity, thereby increasing the potential that the identified chemical 
5 compound may have beneficial therapeutic applications. In cases where the "candidate 
therapeutic agent" is a mixture of more than one chemical compound, subsequent 
screening procedures may be carried out to identify the particular substance in the mixture 
that is the binding compound, and that is to be identified as a candidate therapeutic agent. 
As used herein, a "pharmaceutical agent" is provided by screening a candidate 
10 therapeutic agent using models for a disease state or pathology in order to identify a 

candidate exerting a desired or beneficial therapeutic effect with relation to the disease or 
pathology. Such a candidate that successfully provides such an effect is termed a 
pharmaceutical agent herein. Nonlimiting examples of model systems that may be used in 
such screens include particular cell lines, cultured cells, tissue preparations, whole tissues, 
15 organ preparations, intact organs, and nonhuman mammals. Screens employing at least 
one system, and preferably more than one system, may be employed in order to identify a 
pharmaceutical agent. Any pharmaceutical agent so identified may be pursued in further 
investigation using human subjects. 
NOVX Nucleic Acids and Polypeptides 
20 NOVX clones 

NOVX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NOVX nucleic acids and polypeptides according 
to the invention are useful as novel members of the protein families according to the 
presence of domains and sequence relatedness to previously described proteins. 
25 Additionally, NOVX nucleic acids and polypeptides can also be used to identify proteins 
that are members of the family to which the NOVX polypeptides belong. 

The NOVX genes and their corresponding encoded proteins are useful for 
preventing, treating or ameliorating medical conditions, e.g., by protein or gene therapy. 
Pathological conditions can be diagnosed by determining the amount of the new protein in 
30 a sample or by determining the presence of mutations in the new genes. Specific uses are 
described for each of the NOVX genes, based on the tissues in which they are most highly 
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expressed. Uses include developing products for the diagnosis or treatment of a variety of 
diseases and disorders. 

The NOVX nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications and as a research tool. These include serving as a 
5 specific or selective nucleic acid or protein diagnostic and/or prognostic marker, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a 
small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), (iv) a nucleic acid useful in gene therapy (gene 

10 delivery/gene ablation), and (v) a composition promoting tissue regeneration in vitro and 
in vivo (vi) biological defense weapon. 

In one specific embodiment, the invention includes an isolated polypeptide 
comprising an amino acid sequence selected from the group consisting of: (a) a mature 
form of the amino acid sequence selected from the group consisting of SEQ ID NO: 2n, 

15 wherein n is an integer between 1 and 101; (b) a variant of a mature form of the amino 
acid sequence selected from the group consisting of SEQ ID NO: 2n, wherein n is an 
integer between 1 and 101, wherein any amino acid in the mature form is changed to a 
different amino acid, provided that no more than 1 5% of the amino acid residues in the 
sequence of the mature form are so changed; (c) an amino acid sequence selected from the 

20 group consisting of SEQ ID NO: 2n, wherein n is an integer between 1 and 101; (d) a 

variant of the amino acid sequence selected from the group consisting of SEQ ID NO:2n, 
wherein n is an integer between 1 and 101 wherein any amino acid specified in the chosen 
sequence is changed to a different amino acid, provided that no more than 15% of the 
amino acid residues in the sequence are so changed; and (e) a fragment of any of (a) 

25 through (d). 

In another specific embodiment, the invention includes an isolated nucleic acid 
molecule comprising a nucleic acid sequence encoding a polypeptide comprising an amino 
acid sequence selected from the group consisting of: (a) a mature form of the amino acid 
sequence given SEQ ID NO: 2n, wherein n is an integer between 1 and 101; (b) a variant 
30 of a mature form of the amino acid sequence selected from the group consisting of SEQ 
ID NO: 2n, wherein n is an integer between 1 and 101 wherein any amino acid in the 
mature form of the chosen sequence is changed to a different amino acid, provided that no 
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more than 1 5% of the amino acid residues in the sequence of the mature form are so 
changed; (c) the amino acid sequence selected from the group consisting of SEQ ID NO: 
2n, wherein n is an integer between 1 and 1 01 ; (d) a variant of the amino acid sequence 
selected from the group consisting of SEQ ID NO: 2n, wherein n is an integer between 1 
5 and 101, in which any amino acid specified in the chosen sequence is changed to a 
different amino acid, provided that no more than 15% of the amino acid residues in the 
sequence are so changed; (e) a nucleic acid fragment encoding at least a portion of a 
polypeptide comprising the amino acid sequence selected from the group consisting of 
SEQ ID NO: 2n, wherein n is an integer between 1 and 101 or any variant of said 

10 polypeptide wherein any amino acid of the chosen sequence is changed to a different 
amino acid, provided that no more than 10% of the amino acid residues in the sequence 
are so changed; and (f) the complement of any of said nucleic acid molecules. 

In yet another specific embodiment, the invention includes an isolated nucleic acid 
molecule, wherein said nucleic acid molecule comprises a nucleotide sequence selected 

15 from the group consisting of: (a) the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101; (b) a 
nucleotide sequence wherein one or more nucleotides in the nucleotide sequence selected 
from the group consisting of SEQ ID NO: 2n-l , wherein n is an integer between 1 and 
101 is changed from that selected from the group consisting of the chosen sequence to a 

20 different nucleotide provided that no more than 1 5% of the nucleotides are so changed; (c) 
a nucleic acid fragment of the sequence selected from the group consisting of SEQ ID 
NO: 2n-l, wherein n is an integer between 1 and 101 ; and (d) a nucleic acid fragment 
wherein one or more nucleotides in the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101 is changed 

25 from that selected from the group consisting of the chosen sequence to a different 
nucleotide provided that no more than 15% of the nucleotides are so changed. 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
NOVX polypeptides or biologically active portions thereof. Also included in the 
invention are nucleic acid fragments sufficient for use as hybridization probes to identify 

30 NOVX-encoding nucleic acids (e.g. , NOVX mRNAs) and fragments for use as PCR 

primers for the amplification and/or mutation of NOVX nucleic acid molecules. As used 
herein, the term "nucleic acid molecule" is intended to include DNA molecules (e.g., 
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cDNA or genomic DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA 
generated using nucleotide analogs, and derivatives, fragments and homologs thereof. The 
nucleic acid molecule may be single-stranded or double-stranded, but preferably is 
comprised double-stranded DNA. 
5 An NOVX nucleic acid can encode a mature NOVX polypeptide. As used herein, 

a "mature" form of a polypeptide or protein disclosed in the present invention is the 
product of a naturally occurring polypeptide or precursor form or proprotein. The 
naturally occurring polypeptide, precursor or proprotein includes, by way of nonlimiting 
example, the full-length gene product, encoded by the corresponding gene. Alternatively, 

10 it may be defined as the polypeptide, precursor or proprotein encoded by an ORF 
described herein. The product "mature" form arises, again by way of nonlimiting 
example, as a result of one or more naturally occurring processing steps as they may take 
place within the cell, or host cell, in which the gene product arises. Examples of such 
processing steps leading to a "mature" form of a polypeptide or protein include the 

1 5 cleavage of the N-terminal methionine residue encoded by the initiation codon of an ORF, 
or the proteolytic cleavage of a signal peptide or leader sequence. Thus a mature form 
arising from a precursor polypeptide or protein that has residues 1 to N, where residue 1 is 
the N-terminal methionine, would have residues 2 through N remaining after removal of 
the N-terminal methionine. Alternatively, a mature form arising from a precursor 

20 polypeptide or protein having residues 1 to N, in which an N-terminal signal sequence 
from residue 1 to residue M is cleaved, would have the residues from residue M+l to 
residue N remaining. Further as used herein, a "mature" form of a polypeptide or protein 
may arise from a step of post-translational modification other than a proteolytic cleavage 
event. Such additional processes include, by way of non-limiting example, glycosylation, 

25 myristoylation or phosphorylation. In general, a mature polypeptide or protein may result 
from the operation of only one of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable 
length, preferably between at least about 10 nucleotides (nt), 100 nt, or as many as 
approximately, e.g., 6,000 nt, depending upon the specific use. Probes are used in the 

30 detection of identical, similar, or complementary nucleic acid sequences. Longer length 
probes are generally obtained from a natural or recombinant source, are highly specific, 
and much slower to hybridize than shorter-length oligomer probes. Probes may be single- 
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or double-stranded and designed to have specificity in PCR, membrane-based 
hybridization technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is 
separated from other nucleic acid molecules which are present in the natural source of the 
5 nucleic acid. Preferably, an "isolated" nucleic acid is free of sequences which naturally 
flank the nucleic acid (i.e., sequences located at the 5'- and 3 '-termini of the nucleic acid) 
in the genomic DNA of the organism from which the nucleic acid is derived. For 
example, in various embodiments, the isolated NOVX nucleic acid molecules can contain 
less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 

10 naturally flank the nucleic acid molecule in genomic DNA of the cell/tissue from which 
the nucleic acid is derived (e.g., brain, heart, liver, spleen, etc.). Moreover, an "isolated" 
nucleic acid molecule, such as a cDNA molecule, can be substantially free of other 
cellular material or culture medium when produced by recombinant techniques, or of 
chemical precursors or other chemicals when chemically synthesized. 

15 A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 

nucleotide sequence SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, or a 
complement of this aforementioned nucleotide sequence, can be isolated using standard 
molecular biology techniques and the sequence information provided herein. Using all or 
a portion of the nucleic acid sequence of SEQ ID NO: 2n-l , wherein n is an integer 

20 between 1 and 101 as a hybridization probe, NOVX molecules can be isolated using 
standard hybridization and cloning techniques (e.g., as described in Sambrook, et ah, 
(eds.), Molecular Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989; and Ausubel, et ah, (eds.), Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993.) 

25 A nucleic acid of the invention can be amplified using cDNA, mRNA or 

alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can be 
cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to NOVX nucleotide sequences can be 

30 prepared by standard synthetic techniques, e.g. , using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 
residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a 
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PCR reaction. A short oligonucleotide sequence may be based on, or designed from, a 
genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 
Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, 

5 or 100 nt in length, preferably about 1 5 nt to 30 nt in length. In one embodiment of the 
invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in length 
would further comprise at least 6 contiguous nucleotides SEQ ID NO: 2n-l, wherein n is 
an integer between 1 and 101, or a complement thereof. Oligonucleotides may be 
chemically synthesized and may also be used as probes. 

10 In another embodiment, an isolated nucleic acid molecule of the invention 

comprises a nucleic acid molecule that is a complement of the nucleotide from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101 , or a portion of 
this nucleotide sequence (e.g., a fragment that can be used as a probe or primer or a 
fragment encoding a biologically-active portion of an NOVX polypeptide). A nucleic acid 

15 molecule that is complementary to the nucleotide sequence from the group consisting of 
SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101 is one that is sufficiently 
complementary to the nucleotide sequence from the group consisting of SEQ ID NO: 2n- 
1, wherein n is an integer between 1 and 101 that it can hydrogen bond with little or no 
mismatches to the nucleotide sequence from the group consisting of SEQ ID NO: 2n-l, 

20 wherein n is an integer between 1 and 1 01, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen 
base pairing between nucleotides units of a nucleic acid molecule, and the term "binding" 
means the physical or chemical interaction between two polypeptides or compounds or 
associated polypeptides or compounds or combinations thereof. Binding includes ionic, 

25 non-ionic, van der Waals, hydrophobic interactions, and the like. A physical interaction 
can be either direct or indirect. Indirect interactions may be through or due to the effects 
of another polypeptide or compound. Direct binding refers to interactions that do not take 
place through, or due to, the effect of another polypeptide or compound, but instead are 
without other substantial chemical intermediates. 

30 Fragments provided herein are defined as sequences of at least 6 (contiguous) 

nucleic acids or at least 4 (contiguous) amino acids, a length sufficient to allow for 
specific hybridization in the case of nucleic acids or for specific recognition of an epitope 
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in the case of amino acids, respectively, and are at most some portion less than a full 
length sequence. Fragments may be derived from any contiguous portion of a nucleic acid 
or amino acid sequence of choice. Derivatives are nucleic acid sequences or amino acid 
sequences formed from the native compounds either directly or by modification or partial 
5 substitution. Analogs are nucleic acid sequences or amino acid sequences that have a 
structure similar to, but not identical to, the native compound but differs from it in respect 
to certain components or side chains. Analogs may be synthetic or from a different 
evolutionary origin and may have a similar or opposite metabolic activity compared to 
wild type. Homologs are nucleic acid sequences or amino acid sequences of a particular 

1 0 gene that are derived from different species. 

A full-length NOVX clone is identified as containing an ATG translation start 
codon and an in-frame stop codon. Any disclosed NOVX nucleotide sequence lacking an 
ATG start codon therefore encodes a truncated C-terminal fragment of the respective 
NOVX polypeptide, and requires that the corresponding full-length cDNA extend in the 5' 

1 5 direction of the disclosed sequence. Any disclosed NOVX nucleotide sequence lacking an 
in-frame stop codon similarly encodes a truncated N-terminal fragment of the respective 
NOVX polypeptide, and requires that the corresponding full-length cDNA extend in the 3' 
direction of the disclosed sequence. 

Derivatives and analogs may be full length or other than full length, if the 

20 derivative or analog contains a modified nucleic acid or amino acid, as described below. 
Derivatives or analogs of the nucleic acids or proteins of the invention include, but are not 
limited to, molecules comprising regions that are substantially homologous to the nucleic 
acids or proteins of the invention, in various embodiments, by at least about 70%, 80%, or 
95% identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid 

25 sequence of identical size or when compared to an aligned sequence in which the 

alignment is done by a computer homology program known in the art, or whose encoding 
nucleic acid is capable of hybridizing to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent conditions. 
See e.g. Ausubel, et ai, Current PROTOCOLS IN Molecular Biology, John Wiley & 

30 Sons, New York, NY, 1 993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level 
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or amino acid level as discussed above. Homologous nucleotide sequences encode those 
sequences coding for isoforms of NOVX polypeptides. Isoforms can be expressed in 
different tissues of the same organism as a result of, for example, alternative splicing of 
RNA. Alternatively, isoforms can be encoded by different genes. In the invention, 
5 homologous nucleotide sequences include nucleotide sequences encoding for an NOVX 
polypeptide of species other than humans, including, but not limited to: vertebrates, and 
thus can include, e.g., frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. 
Homologous nucleotide sequences also include, but are not limited to, naturally occurring 
allelic variations and mutations of the nucleotide sequences set forth herein. A 

1 0 homologous nucleotide sequence does not, however, include the exact nucleotide 

sequence encoding human NOVX protein. Homologous nucleic acid sequences include 
those nucleic acid sequences that encode conservative amino acid substitutions (see 
below) in SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, as well as a 
polypeptide possessing NOVX biological activity. Various biological activities of the 

1 5 NOVX proteins are described below. 

An NOVX polypeptide is encoded by the open reading frame ( u ORF") of an 
NOVX nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially 
be translated into a polypeptide. A stretch of nucleic acids comprising an ORF is 
uninterrupted by a stop codon. An ORF that represents the coding sequence for a full 

20 protein begins with an ATG "start" codon and terminates with one of the three "stop" 

codons, namely, TAA, TAG, or TGA. For the purposes of this invention, an ORF may be 
any part of a coding sequence, with or without a start codon, a stop codon, or both. For an 
ORF to be considered as a good candidate for coding for a bona fide cellular protein, a 
minimum size requirement is often set, e.g., a stretch of DNA that would encode a protein 

25 of 50 amino acids or more. 

The nucleotide sequences determined from the cloning of the human NOVX genes 
allows for the generation of probes and primers designed for use in identifying and/or 
cloning NOVX homologues in other cell types, e.g. from other tissues, as well as NOVX 
homologues from other vertebrates. The probe/primer typically comprises substantially 

30 purified oligonucleotide. The oligonucleotide typically comprises a region of nucleotide 
sequence that hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 
200, 250, 300, 350 or 400 consecutive sense strand nucleotide sequence SEQ ID NO: 2n- 
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1, wherein n is an integer between 1 and 101; or an anti-sense strand nucleotide sequence 
of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101. 

Probes based on the human NOVX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 
5 embodiments, the probe further comprises a label group attached thereto, e.g. the label 
group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. 
Such probes can be used as a part of a diagnostic test kit for identifying cells or tissues 
which mis-express an NOVX protein, such as by measuring a level of an NOVX-encoding 
nucleic acid in a sample of cells from a subject e.g., detecting NOVX mRNA levels or 

10 determining whether a genomic NOVX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an NOVX polypeptide" 
refers to polypeptides exhibiting activity similar, but not necessarily identical to, an 
activity of a polypeptide of the invention, including mature forms, as measured in a 
particular biological assay, with or without dose dependency. A nucleic acid fragment 

15 encoding a "biologically-active portion of NOVX" can be prepared by isolating a portion 
SEQ ID NO: 2n-l , wherein n is an integer between 1 and 101, that encodes a polypeptide 
having an NOVX biological activity (the biological activities of the NOVX proteins are 
described below), expressing the encoded portion of NOVX protein {e.g., by recombinant 
expression in vitro) and assessing the activity of the encoded portion of NOVX. 

20 NOVX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown in SEQ ID NO: 2n-l, wherein n is an integer between 1 and 
101 due to degeneracy of the genetic code and thus encode the same NOVX proteins as 
that encoded by the nucleotide sequences shown in SEQ ID NO: 2n-l, wherein n is an 
25 integer between 1 and 101 . In another embodiment, an isolated nucleic acid molecule of 
the invention has a nucleotide sequence encoding a protein having an amino acid sequence 
shown in SEQ ID NO: 2n, wherein n is an integer between 1 and 101. 

In addition to the human NOVX nucleotide sequences shown in SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 101, it will be appreciated by those skilled in the art 
30 that DNA sequence polymorphisms that lead to changes in the amino acid sequences of 
the NOVX polypeptides may exist within a population (e.g., the human population). Such 
genetic polymorphism in the NOVX genes may exist among individuals within a 
24 
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population due to natural allelic variation. As used herein, the terms "gene" and 
"recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
(ORE) encoding an NOVX protein, preferably a vertebrate NOVX protein. Such natural 
allelic variations can typically result in 1-5% variance in the nucleotide sequence of the 
5 NOVX genes. Any and all such nucleotide variations and resulting amino acid 

polymorphisms in the NOVX polypeptides, which are the result of natural allelic variation 
and that do not alter the functional activity of the NOVX polypeptides, are intended to be 
within the scope of the invention. 

Moreover, nucleic acid molecules encoding NOVX proteins from other species, 

10 and thus that have a nucleotide sequence that differs from the human SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 101 are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and 
homologues of the NOVX cDNAs of the invention can be isolated based on their 
homology to the human NOVX nucleic acids disclosed herein using the human cDNAs, or 

15 a portion thereof, as a hybridization probe according to standard hybridization techniques 
under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 2n-l, wherein n 

20 is an integer between 1 and 101 . In another embodiment, the nucleic acid is at least 10, 
25, 50, 100, 250, 500, 750, 1000, 1500, or 2000 or more nucleotides in length. In yet 
another embodiment, an isolated nucleic acid molecule of the invention hybridizes to the 
coding region. As used herein, the term "hybridizes under stringent conditions" is 
intended to describe conditions for hybridization and washing under which nucleotide 

25 sequences at least 60% homologous to each other typically remain hybridized to each 
other. 

Homologs (i.e., nucleic acids encoding NOVX proteins derived from species other 
than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate 
or high stringency hybridization with all or a portion of the particular human sequence as a 
30 probe using methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 
under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to 
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no other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures 
than shorter sequences. Generally, stringent conditions are selected to be about 5 °C lower 
than the thermal melting point (Tm) for the specific sequence at a defined ionic strength 
5 and pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid 
concentration) at which 50% of the probes complementary to the target sequence 
hybridize to the target sequence at equilibrium. Since the target sequences are generally 
present at excess, at Tm, 50% of the probes are occupied at equilibrium. Typically, 
stringent conditions will be those in which the salt concentration is less than about 1.0 M 

10 sodium ion, typically about 0.01 to 1 .0 M sodium ion (or other salts) at 

pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides (e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers 
and oligonucleotides. Stringent conditions may also be achieved with the addition of 
destabilizing agents, such as formamide. 

1 5 Stringent conditions are known to those skilled in the art and can be found in 

Ausubel, et al., (eds.), Current Protocols in Molecular Biology, John Wiley & 
Sons, N.Y. (1989). 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least 
about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically 
remain hybridized to each other. A non-limiting example of stringent hybridization 

20 conditions are hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl 
(pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured 
salmon sperm DNA at 65°C, followed by one or more washes in 0.2X SSC, 0.01% BSA at 
50°C. An isolated nucleic acid molecule of the invention that hybridizes under stringent 
conditions to the sequences SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, 

25 corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 

"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 
nucleotide sequence that occurs in nature (e.g. , encodes a natural protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic 
acid molecule comprising the nucleotide sequence of SEQ ID NO: 2n-l, wherein n is an 

30 integer between 1 and 101, or fragments, analogs or derivatives thereof, under conditions 
of moderate stringency is provided. A non-limiting example of moderate stringency 
hybridization conditions are hybridization in 6X SSC, 5X Denhardt's solution, 0.5% SDS 
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and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more washes in 
IX SSC, 0.1% SDS at 37°C. Other conditions of moderate stringency that may be used 
are well-known within the art. See, e.g., Ausubel, et al. (eds.), 1993, Current Protocols 
in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990; Gene Transfer 
5 and Expression, A Laboratory Manual, Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid 
molecule comprising the nucleotide sequences SEQ ID NO: 2n-l , wherein n is an integer 
between 1 and 101, or fragments, analogs or derivatives thereof, under conditions of low 
stringency, is provided. A non-limiting example of low stringency hybridization 

10 conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm 
DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 
25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low 
stringency that may be used are well known in the art (e.g., as employed for cross-species 

15 hybridizations). See, e.g., Ausubel, et al. (eds.), 1993, Current Protocols in 

Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY; Shilo and Weinberg, 1981. 
Proc Natl Acad Sci USA 78: 6789-6792. 

Conservative Mutations 

20 In addition to naturally-occurring allelic variants of NOVX sequences that may 

exist in the population, the skilled artisan will further appreciate that changes can be 
introduced by mutation into the nucleotide sequences SEQ ID NO: 2n-l, wherein n is an 
integer between 1 and 101, thereby leading to changes in the amino acid sequences of the 
encoded NOVX proteins, without altering the functional ability of said NOVX proteins. 

25 For example, nucleotide substitutions leading to amino acid substitutions at 

"non-essential" amino acid residues can be made in the sequence SEQ ID NO: 2n, wherein 
n is an integer between 1 and 101. A "non-essential" amino acid residue is a residue that 
can be altered from the wild-type sequences of the NOVX proteins without altering their 
biological activity, whereas an "essential" amino acid residue is required for such 

30 biological activity. For example, amino acid residues that are conserved among the 
NOVX proteins of the invention are predicted to be particularly non-amenable to 
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alteration. Amino acids for which conservative substitutions can be made are well-known 
within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding 
NOVX proteins that contain changes in amino acid residues that are not essential for 
5 activity. Such NOVX proteins differ in amino acid sequence from SEQ ID NO: 2n-l , 
wherein n is an integer between 1 and 101 yet retain biological activity. In one 
embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding 
a protein, wherein the protein comprises an amino acid sequence at least about 45% 
homologous to the amino acid sequences SEQ ID NO: 2n, wherein n is an integer between 

10 1 and 101. Preferably, the protein encoded by the nucleic acid molecule is at least about 
60% homologous to SEQ ID NO: 2n, wherein n is an integer between 1 and 101; more 
preferably at least about 70% homologous SEQ ID NO: 2n, wherein n is an integer 
between 1 and 101; still more preferably at least about 80% homologous to SEQ ID NO: 
2n, wherein n is an integer between 1 and 101; even more preferably at least about 90% 

15 homologous to SEQ ID NO: 2n, wherein n is an integer between 1 and 101; and most 
preferably at least about 95% homologous to SEQ ID NO: 2n, wherein n is an integer 
between 1 and 101. 

An isolated nucleic acid molecule encoding an NOVX protein homologous to the 
protein of SEQ ID NO: 2n, wherein n is an integer between 1 and 101 can be created by 
20 introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 
sequence of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, such that one 
or more amino acid substitutions, additions or deletions are introduced into the encoded 
protein. 

Mutations can be introduced into SEQ ID NO: 2n-l, wherein n is an integer 
25 between 1 and 101 standard techniques, such as site-directed mutagenesis and 

PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at 
one or more predicted, non-essential amino acid residues. A "conservative amino acid 
substitution" is one in which the amino acid residue is replaced with an amino acid residue 
having a similar side chain. Families of amino acid residues having similar side chains 
30 have been defined within the art. These families include amino acids with basic side 
chains {e.g., lysine, arginine, histidine), acidic side chains {e.g., aspartic acid, glutamic 
acid), uncharged polar side chains {e.g., glycine, asparagine, glutamine, serine, threonine, 
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tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains (e.g. , threonine, valine, 
isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). 
Thus, a predicted non-essential amino acid residue in the NOVX protein is replaced with 
5 another amino acid residue from the same side chain family. Alternatively, in another 
embodiment, mutations can be introduced randomly along all or part of an NOVX coding 
sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for 
NOVX biological activity to identify mutants that retain activity. Following mutagenesis 
SEQ ID NO: 2n- 1 , wherein n is an integer between 1 and 1 0 1 , the encoded protein can be 
1 0 expressed by any recombinant technology known in the art and the activity of the protein 
can be determined. 

The relatedness of amino acid families may also be determined based on side chain 
interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
conserved "weak" residues. The "strong" group of conserved amino acid residues may be 

1 5 any one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, 
HY, FYW, wherein the single letter amino acid codes are grouped by those amino acids 
that may be substituted for each other. Likewise, the "weak" group of conserved residues 
may be any one of the following: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, 
NDEQHK, NEQHRK, HFY, wherein the letters within each group represent the single 

20 letter amino acid code. 

In one embodiment, a mutant NOVX protein can be assayed for (/) the ability to 
form proteimprotein interactions with other NOVX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (ii) complex formation between a mutant NOVX 
protein and an NOVX ligand; or (Hi) the ability of a mutant NOVX protein to bind to an 

25 intracellular target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant NOVX protein can be assayed for the ability 
to regulate a specific biological function (e.g., regulation of insulin release). 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid 
30 molecules that are hybridizable to or complementary to the nucleic acid molecule 

comprising the nucleotide sequence of SEQ ID NO: 2n-l, wherein n is an integer between 
1 and 101, or fragments, analogs or derivatives thereof. An "antisense" nucleic acid 
29 
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comprises a nucleotide sequence that is complementary to a "sense" nucleic acid encoding 
a protein (e.g., complementary to the coding strand of a double-stranded cDNA molecule 
or complementary to an mRNA sequence). In specific aspects, antisense nucleic acid 
molecules are provided that comprise a sequence complementary to at least about 10, 25, 
50, 1 00, 250 or 500 nucleotides or an entire NOVX coding strand, or to only a portion 
thereof. Nucleic acid molecules encoding fragments, homologs, derivatives and analogs 
of an NOVX protein of SEQ ID NO: 2n, wherein n is an integer between 1 and 101, or 
antisense nucleic acids complementary to an NOVX nucleic acid sequence of SEQ ID NO: 
2n-l, wherein n is an integer between 1 and 101, are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an NOVX protein. The 
term "coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues. In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a 
nucleotide sequence encoding the NOVX protein. The term "noncoding region" refers to 
5' and 3' sequences which flank the coding region that are not translated into amino acids 
(i.e., also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding the NOVX protein disclosed herein, 
antisense nucleic acids of the invention can be designed according to the rules of Watson 
and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 
complementary to the entire coding region of NOVX mRNA, but more preferably is an 
oligonucleotide that is antisense to only a portion of the coding or noncoding region of 
NOVX mRNA. For example, the antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of NOVX mRNA. An antisense 
oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 
nucleotides in length. An antisense nucleic acid of the invention can be constructed using 
chemical synthesis or enzymatic ligation reactions using procedures known in the art. For 
example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically 
synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and sense nucleic acids (e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used). 
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Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
5 dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

2- methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 

10 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 

pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 

15 using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered to 
20 a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding an NOVX protein to thereby inhibit expression of the 
protein (e.g., by inhibiting transcription and/or translation). The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific 
25 interactions in the major groove of the double helix. An example of a route of 

administration of antisense nucleic acid molecules of the invention includes direct 
injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 
administration, antisense molecules can be modified such that they specifically bind to 
30 receptors or antigens expressed on a selected cell surface (e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 
antigens). The antisense nucleic acid molecules can also be delivered to cells using the 
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vectors described herein. To achieve sufficient nucleic acid molecules, vector constructs 
in which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is 
5 an ct-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 

specific double-stranded hybrids with complementary RNA in which, contrary to the usual 
P-units, the strands run parallel to each other. See, e.g., Gaultier, et al, 1987. Nucl. Acids 
Res. 15: 6625-6641. The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (See, e.g., Inoue, et al 1987. Nucl. Acids Res. 15: 6131-6148) 
10 or a chimeric RNA-DNA analogue (See, e.g., Inoue, et al, 1 987. FEBS Lett. 215: 
327-330. 

Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified 
bases, and nucleic acids whose sugar phosphate backbones are modified or derivatized. 

15 These modifications are carried out at least in part to enhance the chemical stability of the 
modified nucleic acid, such that they may be used, for example, as antisense binding 
nucleic acids in therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 

20 • cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 

complementary region. Thus, ribozymes (e.g., hammerhead ribozymes as described in 
Haselhoff and Gerlach 1 988. Nature 334: 585-591) can be used to catalytically cleave 
NOVX mRNA transcripts to thereby inhibit translation of NOVX mRNA. A ribozyme 
having specificity for an NOVX-encoding nucleic acid can be designed based upon the 

25 nucleotide sequence of an NOVX cDNA disclosed herein (i.e. , SEQ ID NO: 2n-l , wherein 
n is an integer between 1 and 101). For example, a derivative of a Tetrahymena L-19 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an NOVX-encoding mRNA. 
See, e.g., U.S. Patent 4,987,071 to Cech, et al. and U.S. Patent 5,1 16,742 to Cech, et al 

30 NOVX mRNA can also be used to select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel et al., (1993) Science 
261:1411-1418. 
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Alternatively, NOVX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NOVX nucleic acid (e.g., the 
NOVX promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the NOVX gene in target cells. See, e.g., Helene, 1991 . Anticancer Drug 

5 Des. 6: 569-84; Helene, et al 1992. Ann. N. Y. Acad. Sci. 660: 27-36; Maher, 1 992. 
Bioassays 14: 807-15. 

In various embodiments, the NOVX nucleic acids can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, 
or solubility of the molecule. For example, the deoxyribose phosphate backbone of the 

1 0 nucleic acids can be modified to generate peptide nucleic acids. See, e.g., Hyrup, et al, 
1996. Bioorg Med Chem 4: 5-23. As used herein, the terms "peptide nucleic acids" or 
"PNAs" refer to nucleic acid mimics (e.g., DNA mimics) in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs has been shown to allow for 

1 5 specific hybridization to DNA and RNA under conditions of low ionic strength. The 

synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis 
protocols as described in Hyrup, et al, 1996. supra; Perry-O'Keefe, et al, 1996. Proc. 
Natl. Acad. Sci. USA 93: 14670-14675. 

PNAs of NOVX can be used in therapeutic and diagnostic applications. For 

20 example, PNAs can be used as antisense or antigene agents for sequence-specific 

modulation of gene expression by, e.g., inducing transcription or translation arrest or 
inhibiting replication. PNAs of NOVX can also be used, for example, in the analysis of 
single base pair mutations in a gene (e.g., PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other enzymes, e.g., Si nucleases (See, 

25 Hyrup, et al, \996.supra); or as probes or primers for DNA sequence and hybridization 
(See, Hyrup, et al, 1996, supra; Perry-O'Keefe, et al, 1996. supra). . 

In another embodiment, PNAs of NOVX can be modified, e.g. , to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 

30 delivery known in the art. For example, PNA-DNAxhimeras of NOVX can be generated 
that may combine the advantageous properties of PNA and DNA. Such chimeras allow 
DNA recognition enzymes (e.g., RNase H and DNA polymerases) to interact with the 
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DNA portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of 
base stacking, number of bonds between the nucleobases, and orientation (see, Hyrup, et 
al., 1 996. supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup, et al., 1996. supra and Finn, et al, 1996. Nucl Acids Res 24: 3357-3363. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5'-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the 
PNA and the 5' end of DNA. See, e.g., Mag, et al., 1989. Nucl Acid Res 17: 5973-5988. 
PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule 
with a 5' PNA segment and a 3' DNA segment. See, e.g., Finn, et al, 1996. supra. 
Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' 
PNA segment. See. e.g., Petersen, et al, 1975. Bioorg. Med. Chem. Lett. 5: 1119-1 1 124. 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger, et al, 1989. Proc. Natl. Acad. Sci. 
U.S.A. 86: 6553-6556; Lemaitre, et al, 1987. Proc. Natl. Acad. Sci. 84: 648-652; PCT 
Publication No. WO88/0981 0) or the blood-brain barrier (see, e.g. , PCT Publication No. 
WO 89/10134). In addition, oligonucleotides can be modified with hybridization triggered 
cleavage agents (see, e.g., Krol, et al, 1988. BioTechniques 6:958-976) or intercalating 
agents (see, e.g., Zon, 1988. Pharm. Res. 5: 539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, e.g., a peptide, a hybridization triggered 
cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, and the 
like. 

NOVX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the 
amino acid sequence of NOVX polypeptides whose sequences are provided in SEQ ID 
NO: 2n, wherein n is an integer between 1 and 101. The invention also includes a mutant 
or variant protein any of whose residues may be changed from the corresponding residues 
shown in SEQ ID NO: 2n, wherein n is an integer between 1 and 101 while still encoding 
a protein that maintains its NOVX activities and physiological functions, or a functional 
fragment thereof. 
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In general, an NOVX variant that preserves NOVX-like function includes any 
variant in which residues at a particular position in the sequence have been substituted by 
other amino acids, and further include the possibility of inserting an additional residue or 
residues between two residues of the parent protein as well as the possibility of deleting 
5 one or more residues from the parent sequence. Any amino acid substitution, insertion, or 
deletion is encompassed by the invention. In favorable circumstances, the substitution is a 
conservative substitution as defined above. 

One aspect of the invention pertains to isolated NOVX proteins, and biologically- 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 

10 provided are polypeptide fragments suitable for use as immunogens to raise anti-NOVX 
antibodies. In one embodiment, native NOVX proteins can be isolated from cells or tissue 
sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NOVX proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, an NOVX protein or polypeptide can 

15 be synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or protein or biologically-active portion 
thereof is substantially free of cellular material or other contaminating proteins from the 
cell or tissue source from which the NOVX protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. The language 

20 "substantially free of cellular material" includes preparations of NOVX proteins in which 
the protein is separated from cellular components of the cells from which it is isolated or 
recombinantly-produced. In one embodiment, the language "substantially free of cellular 
material" includes preparations of NOVX proteins having less than about 30% (by dry 
weight) of non-NOVX proteins (also referred to herein as a "contaminating protein"), 

25 more preferably less than about 20% of non-NOVX proteins, still more preferably less 
than about 10% of non-NOVX proteins, and most preferably less than about 5% of 
non-NOVX proteins. When the NOVX protein or biologically-active portion thereof is 
recombinantly-produced, it is also preferably substantially free of culture medium, i.e., 
culture medium represents less than about 20%, more preferably less than about 10%, and 

30 most preferably less than about 5% of the volume of the NOVX protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NOVX proteins in which the protein is separated from chemical 
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precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other chemicals" 
includes preparations of NOVX proteins having less than about 30% (by dry weight) of 
chemical precursors or non-NOVX chemicals, more preferably less than about 20% 
5 chemical precursors or non-NOVX chemicals, still more preferably less than about 1 0% 
chemical precursors or non-NOVX chemicals, and most preferably less than about 5% 
chemical precursors or non-NOVX chemicals. 

Biologically-active portions of NOVX proteins include peptides comprising amino 
acid sequences sufficiently homologous to or derived from the amino acid sequences of 

10 the NOVX proteins (e.g. , the amino acid sequence shown in SEQ ID NO: 2n, wherein n is 
an integer between 1 and 101) that include fewer amino acids than the full-length NOVX 
proteins, and exhibit at least one activity of an NOVX protein. Typically, biologically- 
active portions comprise a domain or motif with at least one activity of the NOVX protein. 
A biologically-active portion of an NOVX protein can be a polypeptide which is, for 

15 example, 10, 25, 50, 100 or more amino acid residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
functional activities of a native NOVX protein. 

In an embodiment, the NOVX protein has an amino acid sequence shown SEQ ID 

20 NO: 2n, wherein n is an integer between 1 and 101. In other embodiments, the NOVX 
protein is substantially homologous to SEQ ID NO: 2n, wherein n is an integer between 1 
and 101, and retains the functional activity of the protein of SEQ ID NO: 2n, wherein n is 
an integer between 1 and 101, yet differs in amino acid sequence due to natural allelic 
variation or mutagenesis, as described in detail, below. Accordingly, in another 

25 embodiment, the NOVX protein is a protein that comprises an amino acid sequence at 

least about 45% homologous to the amino acid sequence SEQ ID NO: 2n, wherein n is an 
integer between 1 and 101, and retains the functional activity of the NOVX proteins of 
SEQ ID NO: 2n, wherein n is an integer between 1 and 101. 
Determining Homology Between Two or More Sequences 

30 To determine the percent homology of two amino acid sequences or of two nucleic 

acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
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alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
5 molecules are homologous at that position (i.e., as used herein amino acid or nucleic acid 
"homology" is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity between 
two sequences. The homology may be determined using computer programs known in the 
art, such as GAP software provided in the GCG program package. See, Needleman and 

10 Wunsch, 1970. J Mol Biol 48: 443-453. Using GCG GAP software with the following 
settings for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP 
extension penalty of 0.3, the coding region of the analogous nucleic acid sequences 
referred to above exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 
90%, 95%, 98%, or 99%, with the CDS (encoding) part of the DNA from the group 

15 consisting of SEQ ID NO: 2n-l , wherein n is an integer between 1 and 1 0 1 . 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region 
of comparison. The term "percentage of sequence identity" is calculated by comparing 
two optimally aligned sequences over that region of comparison, detennining the number 

20 of positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of 
nucleic acids) occurs in both sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of positions in the region of 
comparison (i.e., the window size), and multiplying the result by 1 00 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes a 

25 characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 
sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 
Chimeric and Fusion Proteins 

30 The invention also provides NOVX chimeric or fusion proteins. As used herein, 

anNOVX "chimeric protein" or "fusion protein" comprises an NOVX polypeptide 
operatively-linked to a non-NOVX polypeptide. An "NOVX polypeptide" refers to a 



WO 03/010327 PCT/US02/14199 



polypeptide having an amino acid sequence corresponding to an NOVX protein SEQ ID 
NO: 2n, wherein n is an integer between 1 and 101, whereas a "non-NOVX polypeptide" 
refers to a polypeptide having an amino acid sequence corresponding to a protein that is 
not substantially homologous to the NOVX protein, e.g., a protein that is different from 
5 the NOVX protein and that is derived from the same or a different organism. Within an 
NOVX fusion protein the NOVX polypeptide can correspond to all or a portion of an 
NOVX protein. In one embodiment, an NOVX fusion protein comprises at least one 
biologically-active portion of an NOVX protein. In another embodiment, an NOVX 
fusion protein comprises at least two biologically-active portions of an NOVX protein. In 

1 0 yet another embodiment, an NOVX fusion protein comprises at least three biologically- 
active portions of an NOVX protein. Within the fusion protein, the term "operatively- 
linked" is intended to indicate that the NOVX polypeptide and the non-NOVX polypeptide 
are fused in-frame with one another. The non-NOVX polypeptide can be fused to the 
N-terminus or C-terminus of the NOVX polypeptide. 

1 5 In one embodiment, the fusion protein is a GST-NOVX fusion protein in which the 

NOVX sequences are fused to the C-terminus of the GST (glutathione S-transferase) 
sequences. Such fusion proteins can facilitate the purification of recombinant NOVX 
polypeptides. 

In another embodiment, the fusion protein is an NOVX protein containing a 
20 heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian host 
cells), expression and/or secretion of NOVX can be increased through use of a 
heterologous signal sequence. 

In yet another embodiment, the fusion protein is an NOVX-immunoglobulin fusion 
protein in which the NOVX sequences are fused to sequences derived from a member of 
25 the immunoglobulin protein family. The NOVX-immunoglobulin fusion proteins of the 
invention can be incorporated into pharmaceutical compositions and administered to a 
subject to inhibit an interaction between an NOVX ligand and an NOVX protein on the 
surface of a cell, to thereby suppress NOVX-mediated signal transduction in vivo. The 
NOVX-immunoglobulin fusion proteins can be used to affect the bioavailability of an 
30 NOVX cognate ligand. Inhibition of the NOVX ligand/NOVX interaction may be useful 
therapeutically for both the treatment of proliferative and differentiative disorders, as well 
as modulating (e.g. promoting or inhibiting) cell survival. Moreover, the 
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NOVX-immunoglobulin fusion proteins of the invention can be used as immunogens to 
produce anti-NOVX antibodies in a subject, to purify NOVX ligands, and in screening 
assays to identify molecules that inhibit the interaction of NOVX with an NOVX ligand. 

An NOVX chimeric or fusion protein of the invention can be produced by standard 
5 recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, 
restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends 
as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 

10 ligation. In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification of 
gene fragments can be carried out using anchor primers that give rise to complementary 
overhangs between two consecutive gene fragments that can subsequently be annealed and 
reamplified to generate a chimeric gene sequence {see, e.g., Ausubel, et al. (eds.) 

1 5 Current Protocols in Molecular Biology, John Wiley & Sons, 1 992). Moreover, 
many expression vectors are commercially available that already encode a fusion moiety 
{e.g., a GST polypeptide). An NOVX-encoding nucleic acid can be cloned into such an 
expression vector such that the fusion moiety is linked in-frame to the NOVX protein. 

20 NOVX Agonists and Antagonists 

The invention also pertains to variants of the NOVX proteins that function as either 
NOVX agonists (i.e., mimetics) or as NOVX antagonists. Variants of the NOVX protein 
can be generated by mutagenesis (e.g., discrete point mutation or truncation of the NOVX 

25 protein). An agonist of the NOVX protein can retain substantially the same, or a subset of, 
the biological activities of the naturally occurring form of the NOVX protein. An 
antagonist of the NOVX protein can inhibit one or more of the activities of the naturally 
occurring form of the NOVX protein by, for example, competitively binding to a 
downstream or upstream member of a cellular signaling cascade which includes the 

30 NOVX protein. Thus, specific biological effects can be elicited by treatment with a 
variant of limited function. In one embodiment, treatment of a subject with a variant 
having a subset of the biological activities of the naturally occurring form of the protein 
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has fewer side effects in a subject relative to treatment with the naturally occurring form of 
the NOVX proteins. 

Variants of the NOVX proteins that function as either NOVX agonists (i.e., 
mimetics) or as NOVX antagonists can be identified by screening combinatorial libraries 
5 of mutants (e.g. , truncation mutants) of the NOVX proteins for NOVX protein agonist or 
antagonist activity. In one embodiment, a variegated library of NOVX variants is 
generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a 
variegated gene library. A variegated library of NOVX variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic oligonucleotides into gene 

10 sequences such that a degenerate set of potential NOVX sequences is expressible as 

individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage 
display) containing the set of NOVX sequences therein. There are a variety of methods 
which can be used to produce libraries of potential NOVX variants from a degenerate 
oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 

1 5 performed in an automatic DN A synthesizer, and the synthetic gene then ligated into an 
appropriate expression vector. Use of a degenerate set of genes allows for the provision, 
in one mixture, of all of the sequences encoding the desired set of potential NOVX 
sequences. Methods for synthesizing degenerate oligonucleotides are well-known within 
the art. See, e.g., Narang, 1983. Tetrahedron 39: 3; Itakura, et ah, 1984. Annu. Rev. 

20 Biochem. 53: 323; Itakura, et a/., 1984. Science 198: 1056; Ike, et ah, 1983. Nucl. Acids 
Res. 11: 477. 

Polypeptide Libraries 

25 In addition, libraries of fragments of the NOVX protein coding sequences can be 

used to generate a variegated population of NOVX fragments for screening and 
subsequent selection of variants of an NOVX protein. In one embodiment, a library of 
coding sequence fragments can be generated by treating a double stranded PCR fragment 
of an NOVX coding sequence with a nuclease under conditions wherein nicking occurs 

30 only about once per molecule, denaturing the double stranded DNA, renaturing the DNA 
to form double- stranded DNA that can include sense/antisense pairs from different nicked 
products, removing single stranded portions from reformed duplexes by treatment with Si 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
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method, expression libraries can be derived which encodes N-terminal and internal 
fragments of various sizes of the NOVX proteins. 

Various techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
5 libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
NOVX proteins. The most widely used techniques, which are amenable to high 
throughput analysis, for screening large gene libraries typically include cloning the gene 
library into replicable expression vectors, transforming appropriate cells with the resulting 

1 0 library of vectors, and expressing the combinatorial genes under conditions in which 

detection of a desired activity facilitates isolation of the vector encoding the gene whose 
product was detected. Recursive ensemble mutagenesis (REM), a new technique that 
enhances the frequency of functional mutants in the libraries, can be used in combination 
with the screening assays to identify NOVX variants. See, e.g., Arkin and Yourvan, 1992. 

15 Proc. Natl. Acad. Sci. USA 89: 781 1-7815; Delgrave, et al, 1993. Protein Engineering 
6:327-331. 

NOVX Antibodies 

The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 

20 contain an antigen binding site that specifically binds (immunoreacts with) an antigen. 
Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single 
chain, F a t>, F a b- and F (ab ')2 fragments, and an F a b expression library. In general, antibody 
molecules obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, 
which differ from one another by the nature of the heavy chain present in the molecule. 

25 -Certain classes have subclasses as well, such as IgGi, IgG 2 , and others. Furthermore, in 
humans, the light chain may be a kappa chain or a lambda chain. Reference herein to 
antibodies includes a reference to all such classes, subclasses and types of human antibody 
species. 

An isolated protein of the invention intended to serve as an antigen, or a portion or 
30 fragment thereof, can be used as an immunogen to generate antibodies that 

immunospecifically bind the antigen, using standard techniques for polyclonal and 
monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 
41 
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invention provides antigenic peptide fragments of the antigen for use as immunogens. An 
antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid 
sequence of the full length protein, such as an amino acid sequence shown in SEQ ID NO: 
2n, wherein n is an integer between 1 and 101 , and encompasses an epitope thereof such 
5 that an antibody raised against the peptide forms a specific immune complex with the full 
length protein or with any fragment that contains tine epitope. Preferably, the antigenic 
peptide comprises at least 10 amino acid residues, or at least 15 amino acid residues, or at 
least 20 amino acid residues, or at least 30 amino acid residues. Preferred epitopes 
encompassed by the antigenic peptide are regions of the protein that are located on its 
10 surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of NOVX that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human NOVX protein sequence will 
indicate which regions of a NOVX polypeptide are particularly hydrophilic and, therefore, 

1 5 are likely to encode surface residues useful for targeting antibody production. As a means 
for targeting antibody production, hydropathy plots showing regions of hydrophilicity and 
hydrophobicity may be generated by any method well known in the art, including, for 
example, the Kyte Doolittle or the Hopp Woods methods, either with or without Fourier 
transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824- 

20 3828; Kyte and Doolittle 1982, J. Mol. Biol. 157: 105-142, each incorporated herein by 
reference in their entirety. Antibodies that are specific for one or more domains within an 
antigenic protein, or derivatives, fragments, analogs or homologs thereof, are also 
provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
25 thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of 
polyclonal or monoclonal antibodies directed against a protein of the invention, or against 
derivatives, fragments, analogs homologs or orthologs thereof (see, for example, 
30 Antibodies: A Laboratory Manual, Harlow E, and Lane D, 1 988, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, incorporated herein by reference). Some of 
these antibodies are discussed below. 
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Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by one or more injections with 
5 the native protein, a synthetic variant thereof, or a derivative of the foregoing. An 

appropriate immunogenic preparation can contain, for example, the naturally occurring 
immunogenic protein, a chemically synthesized polypeptide representing the 
immunogenic protein, or a recombinantly expressed immunogenic protein. Furthermore, 
the protein may be conjugated to a second protein known to be immunogenic in the 

1 0 mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. The preparation can further include an adjuvant. Various adjuvants used 
to increase the immunological response include, but are not limited to, Freund's (complete 
and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances (e.g., 

1 5 lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), 
adjuvants usable in humans such as Bacille Calmette-Guerin and Corynebacterium 
parvum, or similar immunostimulatory agents. Additional examples of adjuvants which 
can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). 

20 The polyclonal antibody molecules directed against the immunogenic protein can 

be isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 

25 immobilized on a column to purify the immune specific antibody by immunoaffinity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. 
Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 
(April 17, 2000), pp. 25-28). 

Monoclonal Antibodies 

30 The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", 

as used herein, refers to a population of antibody molecules that contain only one 
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molecular species of antibody molecule consisting of a unique light chain gene product 
and a unique heavy chain gene product. In particular, the complementarity determining 
regions (CDRs) of the monoclonal antibody are identical in all the molecules of the 
population. MAbs thus contain an antigen binding site capable of immunoreacting with a 
5 particular epitope of the antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature , 256 :495 (1 975). In a hybridoma method, a 
mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing 

1 0 antibodies that will specifically bind to the immunizing agent. Alternatively, the 
lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment 
thereof or a fusion protein thereof. Generally, either peripheral blood lymphocytes are 
used if cells of human origin are desired, or spleen cells or lymph node cells are used if 

1 5 non-human mammalian sources are desired. The lymphocytes are then fused with an 

immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, Academic 
Press, (1986) pp. 59-103]. Immortalized cell lines are usually transformed mammalian 
cells, particularly myeloma cells of rodent, bovine and human origin. Usually, rat or 

20 mouse myeloma cell lines are employed. The hybridoma cells can be cultured in a 

suitable culture medium that preferably contains one or more substances that inhibit the 
growth or survival of the unfused, immortalized cells. For example, if the parental cells 
lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 

25 thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient 
cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high 
level expression of antibody by the selected antibody-producing cells, and are sensitive to 
a medium such as HAT medium. More preferred immortalized cell lines are murine 
30 myeloma lines, which can be obtained, for instance, from the Salk Institute Cell 

Distribution Center, San Diego, California and the American Type Culture Collection, 
Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also 
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have been described for the production of human monoclonal antibodies [Kozbor, X 
Immunol., 133:3001 (1984); Brodeur et al., Monoclonal Antibody Production Techniques 
and Applications, Marcel Dekker, Inc., New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed 
5 for the presence of monoclonal antibodies directed against the antigen. Preferably, the 
binding specificity of monoclonal antibodies produced by the hybridoma cells is 
determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art. The binding affinity of the monoclonal 

10 antibody can, for example, be determined by the Scatchard analysis of Munson and 
Pollard, Anal. Biochem. , 107:220 (1 980). It is an objective, especially important in 
therapeutic applications of monoclonal antibodies, to identify antibodies having a high 
degree of specificity and a high binding affinity for the target antigen. 

After the desired hybridoma cells are identified, the clones can be subcloned by 

1 5 limiting dilution procedures and grown by standard methods (Goding,l 986). Suitable 

culture media for this purpose include, for example, Dulbecco's Modified Eagle's Medium 
and RPMI-1640 medium. Alternatively, the hybridoma cells can be grown in vivo as 
ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 
20 culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such 
as those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal 
25 antibodies of the invention can be readily isolated and sequenced using conventional 

procedures (e.g., by using oligonucleotide probes that are capable of binding specifically 
to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells 
of the invention serve as a preferred source of such DNA. Once isolated, the DNA can be 
placed into expression vectors, which are then transfected into host cells such as simian 
30 COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise 
produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also can be modified, for example, by substituting the 
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coding sequence for human heavy and light chain constant domains in place of the 
homologous murine sequences (U.S. Patent No. 4,816,567; Morrison. Nature 368, 812-13 
(1 994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
5 polypeptide can be substituted for the constant domains of an antibody of the invention, or 
can be substituted for the variable domains of one antigen-combining site of an antibody 
of the invention to create a chimeric bivalent antibody. 

Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further 

10 comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against 
the administered immunoglobulin. Humanized forms of antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab')2 or other antigen-binding subsequences of antibodies) that are principally comprised 

15 of the sequence of a human immunoglobulin, and contain minimal sequence derived from 
a non-human immunoglobulin. Humanization can be performed following the method of 
Winter and co-workers (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., 
Nature . 332:323-327 (1988); Verhoeyen et al., Science . 239:1534-1536 (1988)), by 
substituting rodent CDRs or CDR sequences for the corresponding sequences of a human 

20 antibody. (See also U.S. Patent No. 5,225,539.) In some instances, Fv framework 
residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies can also comprise residues which are found neither in the 
recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, 

25 variable domains, in which all or substantially all of the CDR regions correspond to those 
of a non-human immunoglobulin and all or substantially all of the framework regions are 
those of a human immunoglobulin consensus sequence. The humanized antibody 
optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), 
typically that of a human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and 

30 Presta, Curr. Op. Struct. Biol.. 2:593-596 (1992)). 

Human Antibodies 
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Fully human antibodies essentially relate to antibody molecules in which the entire 
sequence of both the light chain and the heavy chain, including the CDRs, arise from 
human genes. Such antibodies are termed "human antibodies", or "fully human 
antibodies" herein. Human monoclonal antibodies can be prepared by the trioma 
5 technique; the human B-cell hybridoma technique (see Kozbor, et al., 1 983 Immunol 

Today 4: 72) and the EBV hybridoma technique to produce human monoclonal antibodies 
(see Cole, et al., 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp: 77-96). Human monoclonal antibodies may be utilized in the practice of the 
present invention and may be produced by using human hybridomas (see Cote, et al., 

10 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by transforming human B-cells with 
Epstein Barr Virus in vitro (see Cole, et al., 1985 In: MONOCLONAL ANTIBODIES AND 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol. , 227:381 (1991); 

15 Marks et al., J. Mol. Biol.. 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 

20 This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. f Bio/Technology 10, 
779-783 (1992)); Lonberg et al. (Nature 368 856-859 (1 994)); Morrison ( Nature 368, 
812-13 (1994)); Fishwild et al,( Nature Biotechnology 14, 845-51 (1996)); Neuberger 
( Nature Biotechnology 14, 826 (1996)); and Lonberg and Huszar ( Intern. Rev. Immunol. 

25 13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman 
animals which are modified so as to produce fully human antibodies rather than the 
animal's endogenous antibodies in response to challenge by an antigen. (See PCT 
publication WO94/02602). The endogenous genes encoding the heavy and light 

30 immunoglobulin chains in the nonhuman host have been incapacitated, and active loci 
encoding human heavy and light chain immunoglobulins are inserted into the host's 
genome. The human genes are incorporated, for example, using yeast artificial 
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chromosomes containing the requisite human DNA segments. An animal which provides 
all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the 
5 Xenomouse™ as disclosed in PCT publications WO 96/33735 and WO 96/34096. This 
animal produces B cells which secrete fully human immunoglobulins. The antibodies can 
be obtained directly from the animal after immunization with an immunogen of interest, 
as, for example, a preparation of a polyclonal antibody, or alternatively from immortalized 
B cells derived from the animal, such as hybridomas producing monoclonal antibodies. 

10 Additionally, the genes encoding the immunoglobulins with human variable regions can 
be recovered and expressed to obtain the antibodies directly, or can be further modified to 
obtain analogs of antibodies such as, for example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. 

15 Patent No. 5,939,598. It can be obtained by a method including deleting the J segment 
genes from at least one endogenous heavy chain locus in an embryonic stem cell to 
prevent rearrangement of the locus and to prevent formation of a transcript of a rearranged 
immunoglobulin heavy chain locus, the deletion being effected by a targeting vector 
containing a gene encoding a selectable marker; and producing from the embryonic stem 

20 cell a transgenic mouse whose somatic and germ cells contain the gene encoding the 
selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is 
disclosed in U.S. Patent No. 5,91 6,771 . It includes introducing an expression vector that 
contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 

25 culture, introducing an expression vector containing a nucleotide sequence encoding a 
light chain into another mammalian host cell, and fusing the two cells to form a hybrid 
cell. The hybrid cell expresses an antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically 
relevant epitope on an immunogen, and a correlative method for selecting an antibody that 

30 binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 
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Fab Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. 
5 Patent No. 4,946,778). In addition, methods can be adapted for the construction of F ab 
expression libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid 
and effective identification of monoclonal F ab fragments with the desired specificity for a 
protein or derivatives, fragments, analogs or homologs thereof. Antibody fragments that 
contain the idiotypes to a protein antigen may be produced by techniques known in the art 
1 0 including, but not limited to: (i) an F (ab ')2 fragment produced by pepsin digestion of an 
antibody molecule; (ii) an F ab fragment generated by reducing the disulfide bridges of an 
F( ab ')2 fragment; (iii) an F ab fragment generated by the treatment of the antibody molecule 
with papain and a reducing agent and (iv) F v fragments. 

15 Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
that have binding specificities for at least two different antigens. In the present case, one 
of the binding specificities is for an antigenic protein of the invention. The second binding 
target is any other antigen, and advantageously is a cell-surface protein or receptor or 

20 receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature , 305:537-539 (1983)). Because of the random 

25 assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 

produce a potential mixture of ten different antibody molecules, of which only one has the 
correct bispecific structure. The purification of the correct molecule is usually 
accomplished by affinity chromatography steps. Similar procedures are disclosed in WO 
93/08829, published 1 3 May 1993, and in Traunecker et al., EMBO J.. 10:3655-3659 

30 (1991). 
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Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least 
part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain 
5 constant region (CHI) containing the site necessary for light-chain binding present in at 
least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if 
desired, the immunoglobulin light chain, are inserted into separate expression vectors, and 
are co-transfected into a suitable host organism. For further details of generating 
bispecific antibodies see, for example, Suresh et al., Methods in Enzymology . 121 :210 
10 (1986). 

According to another approach described in WO 96/2701 1, the interface between a 
pair of antibody molecules can be engineered to maximize the percentage of heterodimers 
which are recovered from recombinant cell culture. The preferred interface comprises at 
least a part of the CH3 region of an antibody constant domain. In this method, one or 

1 5 more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of 
identical or similar size to the large side chain(s) are created on the interface of the second 
antibody molecule by replacing large amino acid side chains with smaller ones (e.g. 
alanine or threonine). This provides a mechanism for increasing the yield of the 

20 heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody 
fragments (e.g. F(ab') 2 bispecific antibodies). Techniques for generating bispecific 
antibodies from antibody fragments have been described in the literature. For example, 
bispecific antibodies can be prepared using chemical linkage. Brennan et al., Science 

25 229:81 (1 985) describe a procedure wherein intact antibodies are proteolytically cleaved to 
generate F(ab') 2 fragments. These fragments are reduced in the presence of the dithiol 
complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular 
disulfide formation. The Fab' fragments generated are then converted to thionitrobenzoate 
(TNB) derivatives. One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol 

30 by reduction with mercaptoethylamine and is mixed with an equimolar amount of the 
other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies 
produced can be used as agents for the selective immobilization of enzymes. 
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Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each 
Fab' fragment was separately secreted from E. coli and subjected to directed chemical 
5 coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well 
as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor 
targets. 

Various techniques for making and isolating bispecific antibody fragments directly 

10 from recombinant cell culture have also been described. For example, bispecific 
antibodies have been produced using leucine zippers. Kostelny et al., J. Immunol. 
148(5):1547-1553 (1992). The leucine zipper peptides from the Fos and Jun proteins were 
linked to the Fab' portions of two different antibodies by gene fusion. The antibody 
homodimers were reduced at the hinge region to form monomers and then re-oxidized to 

1 5 form the antibody heterodimers. This method can also be utilized for the production of 
antibody homodimers. The "diabody" technology described by Hollinger et al., Proc. 
Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for 
making bispecific antibody fragments. The fragments comprise a heavy-chain variable 
domain (V H ) connected to a light-chain variable domain (V L ) by a linker which is too short 

20 to allow pairing between the two domains on the same chain. Accordingly, the Vh and Vl 
domains of one fragment are forced to pair with the complementary Vl and Vh domains of 
another fragment, thereby forming two antigen-binding sites. Another strategy for making 
bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

25 Antibodies with more than two valencies are contemplated. For example, trispecific 
antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1 991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of 
which originates in the protein antigen of the invention. Alternatively, an anti-antigenic 
arm of an immunoglobulin molecule can be combined with an arm which binds to a 

30 triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, 
CD28, or B7), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and 
FcyRIII (CD 16) so as to focus cellular defense mechanisms to the cell expressing the 



WO 03/010327 PCT/US02/14199 

particular antigen. Bispecific antibodies can also be used to direct cytotoxic agents to cells 
which express a particular antigen. These antibodies possess an antigen-binding arm and 
an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, 
DOTA, or TETA. Another bispecific antibody of interest binds the protein antigen 
5 described herein and further binds tissue factor (TF). 

Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 

10 antibodies have, for example, been proposed to target immune system cells to unwanted 
cells (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro 
using known methods in synthetic protein chemistry, including those involving 
crosslinking agents. For example, immunotoxins can be constructed using a disulfide 

1 5 exchange reaction or by forming a thioether bond. Examples of suitable reagents for this 
purpose include iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for 
example, in U.S. Patent No. 4,676,980. 

Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector 
20 function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have improved internalization capability and/or increased complement- 
mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et 
25 al., J. Exp Med .. 176 : 1 191-1 195 (1992) and Shopes, J. Immunol . 148 : 2918-2922 (1992). 
Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using 
heterobifunctional cross-linkers as described in Wolff et al. Cancer Research , 53: 2560- 
2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and 
can thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et 
30 al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

Immunoconjugates 
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The invention also pertains to immunoconjugates comprising an antibody 
conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an 
enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments 
thereof), or a radioactive isotope (i.e., a radioconjugate). 

5 Chemotherapeutic agents useful in the generation of such immunoconjugates have 

been described above. Enzymatically active toxins and fragments thereof that can be used 
include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A 
chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 
alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 

10 (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the 
tricothecenes. A variety of radionuclides are available for the production of 
radioconjugated antibodies. Examples include 2 12 Bi, 13 'i, l31 In, 90 Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 

1 5 Afunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) 

propionate (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as 
dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes 
(such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 

20 ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine 
compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin 
can be prepared as described in Vitetta et al., Science, 238 : 1098 (1987). Carbon-14- 
labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) 
is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 

25 WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate 
is administered to the patient, followed by removal of unbound conjugate from the 
circulation using a clearing agent and then administration of a "ligand" (e.g., avidin) that is 

30 in turn conjugated to a cytotoxic agent. 
Immunoliposomes 
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The antibodies disclosed herein can also be formulated as immunoliposomes. 
Liposomes containing the antibody are prepared by methods known in the art, such as 
described in Epstein et al., Proc. Natl. Acad. Sci. USA, 82: 3688 (1985); Hwang et al., 
Proc.Natl Acad. Sci. USA, 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. 
5 Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation 
method with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG- 
derivatized phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters 
of defined pore size to yield liposomes with the desired diameter. Fab' fragments of the 
10 antibody of the present invention can be conjugated to the liposomes as described in 

Martin et al J. Biol. Chem. , 257 : 286-288 (1982) via a disulfide-interchange reaction. A 
chemotherapeutic agent (such as Doxorubicin) is optionally contained within the 
liposome. See Gabizon et al., J. National Cancer Inst. . 81(19): 1484 (1989). 

Diagnostic Applications of Antibodies Directed Against the Proteins of the Invention 

1 5 Antibodies directed against a protein of the invention may be used in methods 

known within the art relating to the localization and/or quantitation of the protein (e.g., for 
use in measuring levels of the protein within appropriate physiological samples, for use in 
diagnostic methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies against the proteins, or derivatives, fragments, analogs or homologs thereof, 

20 that contain the antigen binding domain, are utilized as pharmacologically-active 
compounds (see below). 

An antibody specific for a protein of the invention can be used to isolate the 
protein by standard techniques, such as immunoaffinity chromatography or 
immunoprecipitation. Such an antibody can facilitate the purification of the natural 

25 protein antigen from cells and of recombinantly produced antigen expressed in host cells. 
Moreover, such an antibody can be used to detect the antigenic protein (e.g., in a cellular 
lysate or cell supernatant) in order to evaluate the abundance and pattern of expression of 
the antigenic protein. Antibodies directed against the protein can be used diagnostically to 
monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for example, 

30 determine the efficacy of a given treatment regimen. Detection can be facilitated by 
coupling (i.e., physically linking) the antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic groups, fluorescent materials, 
54 
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luminescent materials, bioluminescent materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
5 umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material 
includes luminol; examples of bioluminescent materials include luciferase, luciferin, and 
aequorin, and examples of suitable radioactive material include l2 % 131 1, 3:> S or 3 H. 

Antibody Therapeutics 

10 Antibodies of the invention, including polyclonal, monoclonal, humanized and 

fully human antibodies, may used as therapeutic agents. Such agents will generally be 
employed to treat or prevent a disease or pathology in a subject. An antibody preparation, 
preferably one having high specificity and high affinity for its target antigen, is 
administered to the subject and will generally have an effect due to its binding with the 

15 target. Such an effect may be one of two kinds, depending on the specific nature of the 

interaction between the given antibody molecule and the target antigen in question. In the 
first instance, administration of the antibody may abrogate or inhibit the binding of the 
target with an endogenous ligand to which it naturally binds. In this case, the antibody 
binds to the target and masks a binding site of the naturally occurring ligand, wherein the 

20 ligand serves as an effector molecule. Thus the receptor mediates a signal transduction 
pathway for which ligand is responsible. 

Alternatively, the effect may be one in which the antibody elicits a physiological 
result by virtue of binding to an effector binding site on the target molecule. In this case 
the target, a receptor having an endogenous ligand which may be absent or defective in the 

25 disease or pathology, binds the antibody as a surrogate effector ligand, initiating a 
receptor-based signal transduction event by the receptor. 

A therapeutically effective amount of an antibody of the invention relates generally 
to the amount needed to achieve a therapeutic objective. As noted above, this may be a 
binding interaction between the antibody and its target antigen that, in certain cases, 

30 interferes with the functioning of the target, and in other cases, promotes a physiological 
response. The amount required to be administered will furthermore depend on the binding 
affinity of the antibody for its specific antigen, and will also depend on the rate at which 
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an administered antibody is depleted from the free volume other subject to which it is 
administered. Common ranges for therapeutically effective dosing of an antibody or 
antibody fragment of the invention may be, by way of nonlimiting example, from about 
0.1 mg/kg body weight to about 50 mg/kg body weight. Common dosing frequencies may 
range, for example, from twice daily to once a week. 

Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a protein of the invention, as well as other, 
molecules identified by the screening assays disclosed herein, can be administered for the 
treatment of various disorders in the form of pharmaceutical compositions. Principles and 
considerations involved in preparing such compositions, as well as guidance in the choice 
of components are provided, for example, in Remington : The Science And Practice Of 
Pharmacy 19th ed. (Alfonso R. Gennaro, et al., editors) Mack Pub. Co., Easton, Pa. : 1995; 
Drug Absorption Enhancement : Concepts, Possibilities, Limitations, And Trends, 
Harwood Academic Publishers, Langhorne, Pa., 1994; and Peptide And Protein Drug 
Delivery (Advances In Parenteral Sciences, Vol. 4), 1991, M. Dekker, New York. 

If the antigenic protein is intracellular and whole antibodies are used as inhibitors, 
internalizing antibodies are preferred. However, liposomes can also be used to deliver the 
antibody, or an antibody fragment, into cells. Where antibody fragments are used, the 
smallest inhibitory fragment that specifically binds to the binding domain of the target 
protein is preferred. For example, based upon the variable-region sequences of an 
antibody, peptide molecules can be designed that retain the ability to bind the target 
protein sequence. Such peptides can be synthesized chemically and/or produced by 
recombinant DNA technology. See, e.g., Marasco et al., Proc. Natl. Acad. Sci. USA, 90: 
7889-7893 (1993). The formulation herein can also contain more than one active 
compound as necessary for the particular indication being treated, preferably those with 
complementary activities that do not adversely affect each other. Alternatively, or in 
addition, the composition can comprise an agent that enhances its function, such as, for 
example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. 
Such molecules are suitably present in combination in amounts that are effective for the 
purpose intended. 

The active ingredients can also be entrapped in microcapsules prepared, for 
example, by coacervation techniques or by interfacial polymerization, for example, 
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hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

5 The formulations to be used for in vivo administration must be sterile. This is 

readily accomplished by filtration through sterile filtration membranes. 

Sustained-release preparations can be prepared. Suitable examples of sustained- 
release preparations include semipermeable matrices of solid hydrophobic polymers 
containing the antibody, which matrices are in the form of shaped articles, e.g., films, or 

10 microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for 
example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), polylactides (U.S. 
Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non- 
degradable ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as 
the LUPRON DEPOT ™ (injectable microspheres composed of lactic acid-glycolic acid 

15 copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While polymers 
such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for 
over 100 days, certain hydrogels release proteins for shorter time periods. 

ELISA Assay 

An agent for detecting an analyte protein is an antibody capable of binding to an 
20 analyte protein, preferably an antibody with a detectable label. Antibodies can be 

polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof 
(e.g., F ab or F (ab )2) can be used. The term "labeled", with regard to the probe or antibody, 
is intended to encompass direct labeling of the probe or antibody by coupling (i.e., 
physically linking) a detectable substance to the probe or antibody, as well as indirect 
25 labeling of the probe or antibody by reactivity with another reagent that is directly labeled. 
Examples of indirect labeling include detection of a primary antibody using a 
fluorescently-labeled secondary antibody and end-labeling of a DNA probe with biotin 
such that it can be detected with fluorescently-labeled streptavidin. The term "biological 
sample" is intended to include tissues, cells and biological fluids isolated from a subject, 
30 as well as tissues, cells and fluids present within a subject. Included within the usage of 
the term "biological sample", therefore, is blood and a fraction or component of blood 
including blood serum, blood plasma, or lymph. That is, the detection method of the 
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invention can be used to detect an analyte mRNA, protein, or genomic DNA in a 
biological sample in vitro as well as in vivo. For example, in vitro techniques for detection 
of an analyte mRNA include Northern hybridizations and in situ hybridizations. In vitro 
techniques for detection of an analyte protein include enzyme linked immunosorbent 
5 assays (ELISAs), Western blots, immunoprecipitations, and immunofluorescence. In vitro 
techniques for detection of an analyte genomic DNA include Southern hybridizations. 
Procedures for conducting immunoassays are described, for example in "ELISA: Theory 
and Practice: Methods in Molecular Biology", Vol. 42, J. R. Crowther (Ed.) Human Press, 
Totowa, NJ, 1995; "Immunoassay", E. Diamandis and T. Christopoulus, Academic Press, 

10 Inc., San Diego, CA, 1996; and "Practice and Thory of Enzyme Immunoassays", P. 

Tijssen, Elsevier Science Publishers, Amsterdam, 1985. Furthermore, in vivo techniques 
for detection of an analyte protein include introducing into a subject a labeled anti-an 
analyte protein antibody. For example, the antibody can be labeled with a radioactive 
marker whose presence and location in a subject can be detected by standard imaging 

1 5 techniques. 

NOVX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an NOVX protein, or derivatives, fragments, analogs or 

20 homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, wherein 
additional DNA segments can be ligated into the viral genome. Certain vectors are 

25 capable of autonomous replication in a host cell into which they are introduced (e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated along 
with the host genome. Moreover, certain vectors are capable of directing the expression of 

30 genes to which they are operatively-linked. Such vectors are referred to herein as 
"expression vectors". In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids. In the present specification, "plasmid" and 
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"vector" can be used interchangeably as the plasmid is the most commonly used form of 
vector. However, the invention is intended to include such other forms of expression 
vectors, such as viral vectors (e.g. , replication defective retroviruses, adenoviruses and 
adeno-associated viruses), which serve equivalent functions. 
5 The recombinant expression vectors of the invention comprise a nucleic acid of the 

invention in a form suitable for expression of the nucleic acid in a host cell, which means 
that the recombinant expression vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for expression, that is operatively-linked 
to the nucleic acid sequence to be expressed. Within a recombinant expression vector, 

1 0 "operably-linked" is intended to mean that the nucleotide sequence of interest is linked to 
the regulatory sequence(s) in a manner that allows for expression of the nucleotide 
sequence (e.g., in an in vitro transcription/translation system or in a host cell when the 
vector is introduced into the host cell). 

The term "regulatory sequence" is intended to includes promoters, enhancers and 

1 5 other expression control elements (e.g. , polyadenylation signals). Such regulatory 
sequences are described, for example, in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory 
sequences include those that direct constitutive expression of a nucleotide sequence in 
many types of host cell and those that direct expression of the nucleotide sequence only in 

20 certain host cells (e.g. , tissue-specific regulatory sequences). It will be appreciated by 

those skilled in the art that the design of the expression vector can depend on such factors 
as the choice of the host cell to be transformed, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to thereby 
produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic 

25 acids as described herein (e.g., NOVX proteins, mutant forms of NOVX proteins, fusion 
proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of NOVX proteins in prokaryotic or eukaryotic cells. For example, NOVX 
proteins can be expressed in bacterial cells such as Escherichia coli, insect cells (using 
30 baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are 
discussed further in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, Calif. (1990). Alternatively, the 
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recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either 

5 fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: (/) to increase expression of recombinant protein; 
(ii) to increase the solubility of the recombinant protein; and (Hi) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 

10 expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their 
cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical 
fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 

15 1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, Mass.) and pPJT5 
(Pharmacia, Piscataway, N.J.) that fuse glutathione S-transferase (GST), maltose E 
binding protein, or protein A, respectively, to the target recombinant protein. 
Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amrann et al, (1988) Gene 69:301-315) and pET 1 Id (Studier et al, GENE EXPRESSION 

20 TECHNOLOGY: METHODS rN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 
60-89). 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein. See, e.g., Gottesman, Gene Expression Technology: Methods in 

25 ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 1 19-128. Another strategy 
is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 
vector so that the individual codons for each amino acid are those preferentially utilized in 
E. coli (see, e.g., Wada, et al, 1992. Nucl. Acids Res. 20: 21 1 1-21 18). Such alteration of 
nucleic acid sequences of the invention can be carried out by standard DNA synthesis 

30 techniques. 

In another embodiment, the NOVX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
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(Baldari, et al, 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 
30: 933-943), pJRY88 (Schultz et al., 1987. Gene 54: 113-123), pYES2 (Invitrogen 
Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 
Alternatively, NOVX can be expressed in insect cells using baculovirus expression 
5 vectors. Baculovirus vectors available for expression of proteins in cultured insect cells 
{e.g., SF9 cells) include the pAc series (Smith, etal, 1983. Mol. Cell. Biol. 3: 2156-2165) 
and the pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 

1 0 expression vectors include pCDM8 (Seed, 1 987. Nature 329: 840) and pMT2PC 
(Kaufman, et al, 1987. EMBO J. 6: 187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. For 
example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus, and simian virus 40. For other suitable expression systems for both 

1 5 prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et al, 
Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type (e.g., 

20 tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert, et al, 
1987. Genes Dev. 1: 268-277), lymphoid-specific promoters (Calame and Eaton, 1988. 
Adv. Immunol. 43: 235-275), in particular promoters of T cell receptors (Winoto and 

25 Baltimore, 1 989. EMBO J. 8: 729-733) and immunoglobulins (Banerji, et al, 1 983. Cell 
33: 729-740; Queen and Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle, 1989. Proc. Natl. Acad. Sci. USA 86: 
5473-5477), pancreas-specific promoters (Edlund, et al, 1985. Science 230: 912-916), and 
mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 

30 and European Application Publication No. 264,166). Developmentally-regulated 

promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. 
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Science 249: 374-379) and the oc-fetoprotein promoter (Campes and Tilghman, 1989. 
Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. 
5 That is, the DNA molecule is operatively-linked to a regulatory sequence in a manner that 
allows for expression (by transcription of the DNA molecule) of an RNA molecule that is 
antisense to NOVX mRNA. Regulatory sequences operatively linked to a nucleic acid 
cloned in the antisense orientation can be chosen that direct the continuous expression of 
the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or 

10 enhancers, or regulatory sequences can be chosen that direct constitutive, tissue specific or 
cell type specific expression of antisense RNA. The antisense expression vector can be in 
the form of a recombinant plasmid, phagemid or attenuated virus in which antisense 
nucleic acids are produced under the control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into which the vector is introduced. 

15 For a discussion of the regulation of gene expression using antisense genes see, e.g., 
Weintraub, et al., "Antisense RNA as a molecular tool for genetic analysis," Reviews- 
Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 

20 "recombinant host cell" are used interchangeably herein. It is understood that such terms 
refer not only to the particular subject cell but also to the progeny or potential progeny of 
such a cell. Because certain modifications may occur in succeeding generations due to 
either mutation or environmental influences, such progeny may not, in fact, be identical to 
the parent cell, but are still included within the scope of the term as used herein. 

25 A host cell can be any prokaryotic or eukaryotic cell. For example, NOVX protein 

can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells 
(such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are 
known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

30 conventional transformation or transfection techniques. As used herein, the terms 

"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid {e.g., DNA) into a host cell, including 
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calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al. (Molecular Cloning: A 
Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
5 Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker {e.g., resistance to antibiotics) is 

10 generally introduced into the host cells along with the gene of interest. Various selectable 
markers include those that confer resistance to drugs, such as G418, hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host cell 
on the same vector as that encoding NOVX or can be introduced on a separate vector. 
Cells stably transfected with the introduced nucleic acid can be identified by drug 

15 selection (e.g. , cells that have incorporated the selectable marker gene will survive, while 
the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 
can be used to produce (i.e., express) NOVX protein. Accordingly, the invention further 
provides methods for producing NOVX protein using the host cells of the invention. In 
20 one embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding NOVX protein has been introduced) in a suitable 
medium such that NOVX protein is produced. In another embodiment, the method further 
comprises isolating NOVX protein from the medium or the host cell. 

25 Transgenic NOVX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which NOVX protein-coding sequences have been 
30 introduced. Such host cells can then be used to create non-human transgenic animals in 
which exogenous NOVX sequences have been introduced into their genome or 
homologous recombinant animals in which endogenous NOVX sequences have been 
altered. Such animals are useful for studying the function and/or activity of NOVX 
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protein and for identifying and/or evaluating modulators of NOVX protein activity. As 
used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 
preferably a rodent such as a rat or mouse, in which one or more of the cells of the animal 
includes a transgene. Other examples of transgenic animals include non-human primates, 
5 sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that 
is integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, a "homologous recombinant animal" is a non-human animal, preferably a 

10 mammal, more preferably a mouse, in which an endogenous NOVX gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior 
to development of the animal. 

A transgenic animal of the invention can be created by introducing 

15 NOVX-encoding nucleic acid into the male pronuclei of a fertilized oocyte {e.g., by 
microinjection, retroviral infection) and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The human NOVX cDNA sequences SEQ ID NO: 
2n-l, wherein n is an integer between 1 and 101 can be introduced as a transgene into the 
genome of a non-human animal. Alternatively, a non-human homologue of the human 

20 NOVX gene, such as a mouse NOVX gene, can be isolated based on hybridization to the 
human NOVX cDNA (described further supra) and used as a transgene. Intronic 
sequences and polyadenylation signals can also be included in the transgene to increase 
the efficiency of expression of the transgene. A tissue-specific regulatory sequence(s) can 
be operably-linked to the NOVX transgene to direct expression of NOVX protein to 

25 particular cells. Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866; 4,870,009; and 4,873,191; and 
Hogan, 1986. In: Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. Similar methods are used for production of other 

30 transgenic animals. A transgenic founder animal can be identified based upon the 

presence of the NOVX transgene in its genome and/or expression of NOVX mRNA in 
tissues or cells of the animals. A transgenic founder animal can then be used to breed 
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additional animals carrying the transgene. Moreover, transgenic animals carrying a 
transgene- encoding NOVX protein can further be bred to other transgenic animals 
carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
5 at least a portion of an NOVX gene into which a deletion, addition or substitution has been 
introduced to thereby alter, e.g., functionally disrupt, the NOVX gene. The NOVX gene 
can be a human gene {e.g. , the cDNA of SEQ ID NO: 2n-l , wherein n is an integer 
between 1 and 101), but more preferably, is a non-human homologue of a human NOVX 
gene. For example, a mouse homologue of human NOVX gene of SEQ ID NO: 2n-l, 

10 wherein n is an integer between 1 and 101 can be used to construct a homologous 
recombination vector suitable for altering an endogenous NOVX gene in the mouse 
genome. In one embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous NOVX gene is functionally disrupted (i.e., no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

1 5 Alternatively, the vector can be designed such that, upon homologous 

recombination, the endogenous NOVX gene is mutated or otherwise altered but still 
encodes functional protein (e.g., the upstream regulatory region can be altered to thereby 
alter the expression of the endogenous NOVX protein). In the homologous recombination 
vector, the altered portion of the NOVX gene is flanked at its 5'- and 3'-termini by 

20 additional nucleic acid of the NOVX gene to allow for homologous recombination to 
occur between the exogenous NOVX gene carried by the vector and an endogenous 
NOVX gene in an embryonic stem cell. The additional flanking NOVX nucleic acid is of 
sufficient length for successful homologous recombination with the endogenous gene. 
Typically, several kilobases of flanking DNA (both at the 5'- and 3'-termini) are included 
. 25 in the vector. See, e.g., Thomas, et al, 1987. Cell 51: 503 for a description of homologous 
recombination vectors. The vector is ten introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced NOVX gene has homologously- 
recombined with the endogenous NOVX gene are selected. See, e.g., Li, et al, 1992. Cell 
69:915. 

30 The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to 

form aggregation chimeras. See, e.g., Bradley, 1987. In: Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 
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1 1 3-1 52. A chimeric embryo can then be implanted into a suitable pseudopregnant female 
foster animal and the embryo brought to term. Progeny harboring the homologously- 
recombined DNA in their germ cells can be used to breed animals in which all cells of the 
animal contain the homologously-recombined DNA by germline transmission of the 
transgene. Methods for constructing homologous recombination vectors and homologous 
recombinant animals are described further in Bradley, 1991. Curr. Opin. Biotechnol. 2: 
823-829; PCT International Publication Nos.: WO 90/1 1354; WO 91/01140; WO 92/0968; 
and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that 
contain selected systems that allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For a 
description of the cre/loxP recombinase system, See, e.g., Lakso, et al, 1992. Proc. Natl. 
Acad. Sci. USA 89: 6232-6236. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae. See, O'Gorman, et al, 1991. Science 
251:1351-1355. Ifa cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et al, 1 997. Nature 385: 810-813. In brief, 
a cell {e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through 
the use of electrical pulses, to an enucleated oocyte from an animal of the same species 
from which the quiescent cell is isolated. The reconstructed oocyte is then cultured such 
that it develops to morula or blastocyte and then transferred to pseudopregnant female 
foster animal. The offspring borne of this female foster animal will be a clone of the 
animal from which the cell (e.g., the somatic cell) is isolated. 

Pharmaceutical Compositions 

The NOVX nucleic acid molecules, NOVX proteins, and anti-NOVX antibodies 
(also referred to herein as "active compounds") of the invention, and derivatives, 
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fragments, analogs and homologs thereof, can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 
nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. As 
used herein, "pharmaceutically acceptable carrier" is intended to include any and all 
solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with pharmaceutical administration. 
Suitable carriers are described in the most recent edition of Remington's Pharmaceutical 
Sciences, a standard reference text in the field, which is incorporated herein by reference. 
Preferred examples of such carriers or diluents include, but are not limited to, water, 
saline, finger's solutions, dextrose solution, and 5% human serum albumin. Liposomes 
and non-aqueous vehicles such as fixed oils may also be used. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof in 
the compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with 
its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
{i.e., topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid (EDTA); 
buffers such as acetates, citrates or phosphates, and agents for the adjustment of tonicity 
such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as 
hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in 
ampoules, disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL 
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(BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the 
composition must be sterile and should be fluid to the extent that easy syringeability 
exists. It must be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria and fungi. 
5 The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the 
like), and suitable mixtures thereof. The proper fluidity can be maintained, for example, 
by the use of a coating such as lecithin, by the maintenance of the required particle size in 
the case of dispersion and by the use of surfactants. Prevention of the action of 

10 microorganisms can be achieved by various antibacterial and antifungal agents, for 

example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many 
cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols 
such as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including in the composition an agent 

1 5 which delays absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., anNOVX protein or anti-NOVX antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, followed 
by filtered sterilization. Generally, dispersions are prepared by incorporating the active 

20 compound into a sterile vehicle that contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, methods of preparation are vacuum drying and 
freeze-drying that yields a powder of the active ingredient plus any additional desired 
ingredient from a previously sterile-filtered solution thereof 

25 Oral compositions generally include an inert diluent or an edible carrier. They can 

be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 

30 applied orally and swished and expectorated or swallowed. Pharmaceutically compatible 
binding agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
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compounds of a similar nature: a binder such as macrocrystalline cellulose, gum tragacanth 
or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic 
acid, Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 

5 or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 

1 0 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 

1 5 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

20 In one embodiment, the active compounds are prepared with carriers that will 

protect the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation 

25 of such formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 
prepared according to methods known to those skilled in the art, for example, as described 

30 in U.S. Patent No. 4,522,811. 

It is especially advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
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herein refers to physically discrete units suited as unitary dosages for the subject to be 
treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 
therapeutic effect to be achieved, and the limitations inherent in the art of compounding 
such an active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see, e.g., U.S. Patent No. 5,328,470) 
or by stereotactic injection [see, e.g., Chen, et ai, 1994. Proc. Natl. Acad. Sci. USA 91: 
3054-3057). The pharmaceutical preparation of the gene therapy vector can include the 
gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery vector can be produced intact from recombinant cells, e.g. , retroviral vectors, the 
pharmaceutical preparation can include one or more cells that produce the gene delivery 
system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express NOVX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy 
applications), to detect NOVX mRNA (e.g., in a biological sample) or a genetic lesion in 
an NOVX gene, and to modulate NOVX activity, as described further, below. In addition, 
the NOVX proteins can be used to screen drugs or compounds that modulate the NOVX 
protein activity or expression as well as to treat disorders characterized by insufficient or 
excessive production of NOVX protein or production of NOVX protein forms that have 
decreased or aberrant activity compared to NOVX wild-type protein (e.g. ; diabetes 
(regulates insulin release); obesity (binds and transport lipids); metabolic disturbances 
associated with obesity, the metabolic syndrome X as well as anorexia and wasting 
disorders associated with chronic diseases and various cancers, and infectious 
disease(possesses anti-microbial activity) and the various dyslipidemias. In addition, the 
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anti-NOVX antibodies of the invention can be used to detect and isolate NOVX proteins 
and modulate NOVX activity. In yet a further aspect, the invention can be used in methods 
to influence appetite, absorption of nutrients and the disposition of metabolic substrates in 
both a positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 

Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents {e.g., peptides, 
peptidomimetics, small molecules or other drugs) that bind to NOVX proteins or have a 
stimulatory or inhibitory effect on, e.g., NOVX protein expression or NOVX protein 
activity. The invention also includes compounds identified in the screening assays 
described herein. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of an 
NOVX protein or polypeptide or biologically-active portion thereof. The test compounds 
of the invention can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 
non-peptide oligomer or small molecule libraries of compounds. See, e.g., Lam, 1997. 
Anticancer Drug Design 12: 145. 

A "small molecule" as used herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 
molecules can be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, 
carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical 
and/or biological mixtures, such as fungal, bacterial, or algal extracts, are known in the art 
and can be screened with any of the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt, et al, 1993. Proc. Natl. Acad. Sci. U.S.A. 90: 6909; Erb, et ah, 
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1994. Proc. Natl. Acad. Sci. U.S.A. 91:1 1422; Zuckermann, et al, 1994. J. Med Chem. 
37: 2678; Cho, et al, 1993. Science 261: 1303; Carrell, et al, 1994. Angew. Chem. Int. Ed. 
Engl. 33: 2059; Carell, et al, 1994. Angew. Chem. Int. Ed. Engl. 33: 2061; and Gallop, et 
al, 1 994. J. Med. Chem. 37: 1233. 
5 Libraries of compounds may be presented in solution {e.g., Houghten, 1992. 

Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips 
(Fodor, 1993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores 
(Ladner, U.S. Patent 5,233,409), plasmids (Cull, et al, 1992. Proc. Natl. Acad. Sci. USA 
89: 1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. 

10 Science 249: 404-406; Cwirla, et al, 1990. Proc. Natl. Acad. Sci. U.S.A. 87: 6378-6382; 
Felici, 1991. J. Mol. Biol. 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses 
a membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the 
cell surface is contacted with a test compound and the ability of the test compound to bind 

15 to an NOVX protein determined. The cell, for example, can of mammalian origin or a 
yeast cell. Determining the ability of the test compound to bind to the NOVX protein can 
be accomplished, for example, by coupling the test compound with a radioisotope or 
enzymatic label such that binding of the test compound to the NOVX protein or 
biologically-active portion thereof can be determined by detecting the labeled compound 

20 in a complex. For example, test compounds can be labeled with 125 1, 35 S, M C, or 3 H, either 
directly or indirectly, and the radioisotope detected by direct counting of radioemission or 
by scintillation counting. Alternatively, test compounds can be enzymatically-labeled 
with, for example, horseradish peroxidase, alkaline phosphatase, or luciferase, and the 
enzymatic label detected by determination of conversion of an appropriate substrate to 

25 product. In one embodiment, the assay comprises contacting a cell which expresses a 
membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the 
cell surface with a known compound which binds NOVX to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the test 
compound to interact with an NOVX protein, wherein determining the ability of the test 

30 compound to interact with an NOVX protein comprises determining the ability of the test 
compound to preferentially bind to NOVX protein or a biologically-active portion thereof 
as compared to the known compound. 
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In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of NOVX protein, or a biologically-active portion 
thereof, on the cell surface with a test compound and determining the ability of the test 
compound to modulate {e.g., stimulate or inhibit) the activity of the NOVX protein or 
biologically-active portion thereof. Determining the ability of the test compound to 
modulate the activity of NOVX or a biologically-active portion thereof can be 
accomplished, for example, by determining the ability of the NOVX protein to bind to or 
interact with an NOVX target molecule. As used herein, a "target molecule" is a molecule 
with which an NOVX protein binds or interacts in nature, for example, a molecule on the 
surface of a cell which expresses an NOVX interacting protein, a molecule on the surface 
of a second cell, a molecule in the extracellular milieu, a molecule associated with the 
internal surface of a cell membrane or a cytoplasmic molecule. An NOVX target 
molecule can be a non-NOVX molecule or an NOVX protein or polypeptide of the 
invention. In one embodiment, an NOVX target molecule is a component of a signal 
transduction pathway that facilitates transduction of an extracellular signal {e.g. a signal 
generated by binding of a compound to a membrane-bound NOVX molecule) through the 
cell membrane and into the cell. The target, for example, can be a second intercellular 
protein that has catalytic activity or a protein that facilitates the association of downstream 
signaling molecules with NOVX. 

Determining the ability of the NOVX protein to bind to or interact with an NOVX 
target molecule can be accomplished by one of the methods described above for 
determining direct binding. In one embodiment, determining the ability of the NOVX 
protein to bind to or interact with an NOVX target molecule can be accomplished by 
determining the activity of the target molecule. For example, the activity of the target 
molecule can be determined by detecting induction of a cellular second messenger of the 
target {i.e. intracellular Ca 2+ , diacylglycerol, IP 3 , etc.), detecting catalytic/enzymatic 
activity of the target an appropriate substrate, detecting the induction of a reporter gene 
(comprising an NOVX-responsive regulatory element operatively linked to a nucleic acid 
encoding a detectable marker, e.g., luciferase), or detecting a cellular response, for 
example, cell survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay 
comprising contacting an NOVX protein or biologically-active portion thereof with a test 



73 



WO 03/010327 PCT/US02/14199 



compound and determining the ability of the test compound to bind to theNOVX protein 
or biologically-active portion thereof. Binding of the test compound to the NOVX protein 
can be determined either directly or indirectly as described above. In one such 
embodiment, the assay comprises contacting the NOVX protein or biologically-active 
5 portion thereof with a known compound which binds NOVX to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the test 
compound to interact with an NOVX protein, wherein determining the ability of the test 
compound to interact with an NOVX protein comprises determining the ability of the test 
compound to preferentially bind to NOVX or biologically-active portion thereof as 

1 0 compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting 
NOVX protein or biologically-active portion thereof with a test compound and 
determining the ability of the test compound to modulate (e.g. stimulate or inhibit) the 
activity of the NOVX protein or biologically-active portion thereof. Determining the 

1 5 ability of the test compound to modulate the activity of NOVX can be accomplished, for 
example, by determining the ability of the NOVX protein to bind to an NOVX target 
molecule by one of the methods described above for determining direct binding. In an 
alternative embodiment, determining the ability of the test compound to modulate the 
activity of NOVX protein can be accomplished by determining the ability of the NOVX 

20 protein further modulate an NOVX target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate substrate can be determined as described, 
supra. 

In yet another embodiment, the cell-free assay comprises contacting the NOVX 
protein or biologically-active portion thereof with a known compound which binds NOVX 

25 protein to form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with an NOVX protein, wherein 
determining the ability of the test compound to interact with an NOVX protein comprises 
determining the ability of the NOVX protein to preferentially bind to or modulate the 
activity of an NOVX target molecule. 

30 The cell-free assays of the invention are amenable to use of both the soluble form 

or the membrane-bound form of NOVX protein. In the case of cell-free assays comprising 
the membrane-bound form of NOVX protein, it may be desirable to utilize a solubilizing 
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agent such that the membrane-bound form of NOVX protein is maintained in solution. 
Examples of such solubilizing agents include non-ionic detergents such as 
n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 
5 Isotridecypoly(ethylene glycol ether) n , N-dodecyl-N,N-dimethyl-3-ammonio-l -propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 
3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it may 
be desirable to immobilize either NOVX protein or its target molecule to facilitate 

1 0 separation of complexed from uncomplexed forms of one or both of the proteins, as well 
as to accommodate automation of the assay. Binding of a test compound to NOVX 
protein, or interaction of NOVX protein with a target molecule in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for 
containing the reactants. Examples of such vessels include microtiter plates, test tubes, 

1 5 and micro-centrifuge tubes. In one embodiment, a fusion protein can be provided that 
' adds a domain that allows one or both of the proteins to be bound to a matrix. For 

example, GST-NO VX fusion proteins or GST-target fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized 
microtiter plates, that are then combined with the test compound or the test compound and 

20 either the non-adsorbed target protein or NOVX protein, and the mixture is incubated 

under conditions conducive to complex formation (e.g., at physiological conditions for salt 
and pH). Following incubation, the beads or microtiter plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex 
determined either directly or indirectly, for example, as described, supra. Alternatively, 

25 the complexes can be dissociated from the matrix, and the level of NOVX protein binding 
or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the NOVX protein or its target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 

30 NOVX protein or target molecules can be prepared from biotin-NHS 

(N-hydroxy-succinimide) using techniques well-known within the art (e.g., biotinylation 
kit, Pierce Chemicals, Rockford, 111.), and immobilized in the wells of streptavidin-coated 
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96 well plates (Pierce Chemical). Alternatively, antibodies reactive with NOVX protein 
or target molecules, but which do not interfere with binding of the NOVX protein to its 
target molecule, can be derivatized to the wells of the plate, and unbound target or NOVX 
protein trapped in the wells by antibody conjugation. Methods for detecting such 
complexes, in addition to those described above for the GST-immobilized complexes, 
include immunodetection of complexes using antibodies reactive with the NOVX protein 
or target molecule, as well as enzyme-linked assays that rely on detecting an enzymatic 
activity associated with the NOVX protein or target molecule. 

In another embodiment, modulators of NOVX protein expression are identified in 
a method wherein a cell is contacted with a candidate compound and the expression of 
NOVX mRNA or protein in the cell is determined. The level of expression of NOVX 
mRNA or protein in the presence of the candidate compound is compared to the level of 
expression of NOVX mRNA or protein in the absence of the candidate compound. The 
candidate compound can then be identified as a modulator of NOVX mRNA or protein 
expression based upon this comparison. For example, when expression of NOVX mRNA 
or protein is greater (i.e., statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 
NOVX mRNA or protein expression. Alternatively, when expression of NOVX mRNA or 
protein is less (statistically significantly less) in the presence of the candidate compound 
than in its absence, the candidate compound is identified as an inhibitor of NOVX mRNA 
or protein expression. The level of NOVX mRNA or protein expression in the cells can be 
determined by methods described herein for detecting NOVX mRNA or protein. 

In yet another aspect of the invention, the NOVX proteins can be used as "bait 
proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; 
Zervos, et al., 1993. Cell 72: 223-232; Madura, et al, 1993. J. Biol. Chem. 268: 
12046-12054; Bartel, et al., 1993. Biotechniques 14: 920-924; Iwabuchi, et al, 1993. 
Oncogene 8: 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to 
or interact with NOVX ("NOVX-binding proteins" or "NOVX-bp") and modulate NOVX 
activity. Such NOVX-binding proteins are also likely to be involved in the propagation of 
signals by the NOVX proteins as, for example, upstream or downstream elements of the 
NOVX pathway. 



76 



WO 03/010327 PCT/US02/14199 



The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for 
NOVX is fused to a gene encoding the DNA binding domain of a known transcription 

5 factor (e.g., GAL-4). In the other construct, a DNA sequence, from a library of DNA 

sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene that 
codes for the activation domain of the known transcription factor. If the "bait" and the 
"prey" proteins are able to interact, in vivo, forming an NOVX-dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 

10 proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) that is 
operably linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected and cell colonies containing the functional 
transcription factor can be isolated and used to obtain the cloned gene that encodes the 
protein which interacts with NOVX. 

1 5 The invention further pertains to novel agents identified by the aforementioned 

screening assays and uses thereof for treatments as described herein. 

Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
20 corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. By way of example, and not of limitation, these sequences can be used to: (z) 
map their respective genes on a chromosome; and, thus, locate gene regions associated 
with genetic disease; (ii) identify an individual from a minute biological sample (tissue 
typing); and (iff) aid in forensic identification of a biological sample. Some of these 
25 applications are described in the subsections, below. 

Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the NOVX 
30 sequences, SEQ ID NO: 2n-l, wherein n is an integer between 1 and 1 01, or fragments or 
derivatives thereof, can be used to map the location of the NOVX genes, respectively, on a 
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chromosome. The mapping of the NOVX sequences to chromosomes is an important first 
step in correlating these sequences with genes associated with disease. 

Briefly, NOVX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the NOVX sequences. Computer analysis of the 
5 NOVX, sequences can be used to rapidly select primers that do not span more than one 
exon in the genomic DNA, thus complicating the amplification process. These primers 
can then be used for PCR screening of somatic cell hybrids containing individual human 
chromosomes. Only those hybrids containing the human gene corresponding to the 
NOVX sequences will yield an amplified fragment. 

10 Somatic cell hybrids are prepared by fusing somatic cells from different mammals 

(e.g., human and mouse cells). As hybrids of human and mouse cells grow and divide, 
they gradually lose human chromosomes in random order, but retain the mouse 
chromosomes. By using media in which mouse cells cannot grow, because they lack a 
particular enzyme, but in which human cells can, the one human chromosome that 

1 5 contains the gene encoding the needed enzyme will be retained. By using various media, 
panels of hybrid cell lines can be established. Each cell line in a panel contains either a 
single human chromosome or a small number of human chromosomes, and a full set of 
mouse chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. See, e.g., D'Eustachio, et ai, 1983. Science 220: 919-924. Somatic cell 

20 hybrids containing only fragments of human chromosomes can also be produced by using 
human chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day 
using a single thermal cycler. Using the NOVX sequences to design oligonucleotide 

25 primers, sub-localization can be achieved with panels of fragments from specific 
chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose division has been blocked in 
30 metaphase by a chemical like colcemid that disrupts the mitotic spindle. The 

chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A pattern 
of light and dark bands develops on each chromosome, so that the chromosomes can be 
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identified individually. The FISH technique can be used with a DNA sequence as short as 
500 or 600 bases. However, clones larger than 1,000 bases have a higher likelihood of 
binding to a unique chromosomal location with sufficient signal intensity for simple 
detection. Preferably 1,000 bases, and more preferably 2,000 bases, will suffice to get 
5 good results at a reasonable amount of time. For a review of this technique, see, Verma, et 
al, Human Chromosomes: A Manual of Basic Techniques (Pergamon Press, New 
York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 

10 marking multiple sites and/or multiple chromosomes. Reagents corresponding to 

noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the chance 
of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 

1 5 position of the sequence on the chromosome can be correlated with genetic map data. 
Such data are found, e.g., in McKusick, Mendelian Inheritance in Man, available 
on-line through Johns Hopkins University Welch Medical Library). The relationship 
between genes and disease, mapped to the same chromosomal region, can then be 
identified through linkage analysis (co-inheritance of physically adjacent genes), described 

20 in, e.g., Egeland, et al, 1987. Nature, 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the NOVX gene, can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 

25 Comparison of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes, such as deletions or translocations that are 
visible from chromosome spreads or detectable using PCR based on that DNA sequence. 
Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

30 Tissue Typing 

The NOVX sequences of the invention can also be used to identify individuals 
from minute biological samples. In this technique, an individual's genomic DNA is 
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digested with one or more restriction enzymes, and probed on a Southern blot to yield 
unique bands for identification. The sequences of the invention are useful as additional 
DNA markers for RFLP ("restriction fragment length polymorphisms," described in U.S. 
Patent No. 5,272,057). 

5 Furthermore, the sequences of the invention can be used to provide an alternative 

technique that determines the actual base-by-base DNA sequence of selected portions of 
an individual's genome. Thus, the NOVX sequences described herein can be used to 
prepare two PCR primers from the 5'- and 3-termini of the sequences. These primers can 
then be used to amplify an individual's DNA and subsequently sequence it. 

10 Panels of corresponding DNA sequences from individuals, prepared in this 

manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
invention can be used to obtain such identification sequences from individuals and from 
tissue. The NOVX sequences of the invention uniquely represent portions of the human 

1 5 genome. Allelic variation occurs to some degree in the coding regions of these sequences, 
and to a greater degree in the noncoding regions. It is estimated that allelic variation 
between individual humans occurs with a frequency of about once per each 500 bases. 
Much of the allelic variation is due to single nucleotide polymorphisms (SNPs), which " 
include restriction fragment length polymorphisms (RFLPs). 

20 Each of the sequences described herein can, to some degree, be used as a standard 

against which DNA from an individual can be compared for identification purposes. 
Because greater numbers of polymorphisms occur in the noncoding regions, fewer 
sequences are necessary to differentiate individuals. The noncoding sequences can 
comfortably provide positive individual identification with a panel of perhaps 10 to 1,000 

25 primers that each yield a noncoding amplified sequence of 1 00 bases. If predicted coding 
sequences, such as those in SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101 
are used, a more appropriate number of primers for positive individual identification 
would be 500-2,000. 
Predictive Medicine 

30 The invention also pertains to the field of predictive medicine in which diagnostic 

assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 
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Accordingly, one aspect of the invention relates to diagnostic assays for determining 
NOVX protein and/or nucleic acid expression as well as NOVX activity, in the context of 
a biological sample (e.g., blood, serum, cells, tissue) to thereby determine whether an 
individual is afflicted with a disease or disorder, or is at risk of developing a disorder, 
5 associated with aberrant NOVX expression or activity. The disorders include metabolic 
disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated cachexia, 
cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 
disorders, and hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 

10 associated with chronic diseases and various cancers. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is at risk of 
developing a disorder associated with NOVX protein, nucleic acid expression or activity. 
For example, mutations in an NOVX gene can be assayed in a biological sample. Such 
assays can be used for prognostic or predictive purpose to thereby prophylactically treat an 

1 5 individual prior to the onset of a disorder characterized by or associated with NOVX 
protein, nucleic acid expression, or biological activity. 

Another aspect of the invention provides methods for determining NOVX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate 
therapeutic or prophylactic agents for that individual (referred to herein as 

20 "pharmacogenomics"). Pharmacogenomics allows for the selection of agents (e.g. , drugs) 
for therapeutic or prophylactic treatment of an individual based on the genotype of the 
individual (e.g., the genotype of the individual examined to determine the ability of the 
individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
25 drugs, compounds) on the expression or activity of NOVX in clinical trials. 

These and other agents are described in further detail in the following sections. 

Diagnostic Assays 

An exemplary method for detecting the presence or absence of NOVX in a 
biological sample involves obtaining a biological sample from a test subject and 
30 contacting the biological sample with a compound or an agent capable of detecting NOVX 
protein or nucleic acid (e.g., mRNA, genomic DNA) that encodes NOVX protein such that 
the presence of NOVX is detected in the biological sample. An agent for detecting NOVX 
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mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to NOVX 
mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length 
NOVX nucleic acid, such as the nucleic acid of SEQ ID NO: 2n-l , wherein n is an integer 
between 1 and 101, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 
5 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 

stringent conditions to NOVX mRNA or genomic DNA. Other suitable probes for use in 
the diagnostic assays of the invention are described herein. 

An agent for detecting NOVX protein is an antibody capable of binding to NOVX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or 

10 more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 

F(ab')2) can be used. The term "labeled", with regard to the probe or antibody, is intended 
to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 

15 labeling include detection of a primary antibody using a fluorescently-labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 
fluorescently-labeled streptavidin. The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a subject, as well as tissues, cells and 
fluids present within a subject. That is, the detection method of the invention can be used 

20 to detect NOVX mRNA, protein, or genomic DNA in a biological sample in vitro as well 
as in vivo. For example, in vitro techniques for detection of NOVX mRNA include 
Northern hybridizations and in situ hybridizations. In vitro techniques for detection of 
NOVX protein include enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations, and immunofluorescence. In vitro techniques for detection of 

25 NOVX genomic DNA include Southern hybridizations. Furthermore, in vivo techniques 
for detection of NOVX protein include introducing into a subject a labeled anti-NOVX 
antibody. For example, the antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 

30 subject. Alternatively, the biological sample can contain mRNA molecules from the test 

subject or genomic DNA molecules from the test subject. A preferred biological sample is 
a peripheral blood leukocyte sample isolated by conventional means from a subject. 
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In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent 
capable of detecting NOVX protein, mRNA, or genomic DNA, such that the presence of 
NOVX protein, mRNA or genomic DNA is detected in the biological sample, and 
5 comparing the presence of NOVX protein, mRNA or genomic DNA in the control sample 
with the presence of NOVX protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of NOVX in a 
biological sample. For example, the kit can comprise: a labeled compound or agent 
capable of detecting NOVX protein or mRNA in a biological sample; means for 
10 determining the amount of NOVX in the sample; and means for comparing the amount of 
NOVX in the sample with a standard. The compound or agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to detect 
NOVX protein or nucleic acid. 

15 Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
NOVX expression or activity. For example, the assays described herein, such as the 

20 preceding diagnostic assays or the following assays, can be utilized to identify a subject 
having or at risk of developing a disorder associated with NOVX protein, nucleic acid 
expression or activity. Alternatively, the prognostic assays can be utilized to identify a 
subject having or at risk for developing a disease or disorder. Thus, the invention provides 
a method for identifying a disease or disorder associated with aberrant NOVX expression 

25 or activity in which a test sample is obtained from a subject and NOVX protein or nucleic 
acid (e.g., mRNA, genomic DNA) is detected, wherein the presence of NOVX protein or 
nucleic acid is diagnostic for a subject having or at risk of developing a disease or disorder 
associated with aberrant NOVX expression or activity. As used herein, a "test sample" 
refers to a biological sample obtained from a subject of interest. For example, a test 

30 sample can be a biological fluid (e.g. , serum), cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) to 
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treat a disease or disorder associated with aberrant NOVX expression or activity. For 
example, such methods can be used to determine whether a subject can be effectively 
treated with an agent for a disorder. Thus, the invention provides methods for determining 
whether a subject can be effectively treated with an agent for a disorder associated with 
aberrant NOVX expression or activity in which a test sample is obtained and NOVX 
protein or nucleic acid is detected (e.g., wherein the presence of NOVX protein or nucleic 
acid is diagnostic for a subject that can be administered the agent to treat a disorder 
associated with aberrant NOVX expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an 
NOVX gene, thereby determining if a subject with the lesioned gene is at risk for a 
disorder characterized by aberrant cell proliferation and/or differentiation. In various 
embodiments, the methods include detecting, in a sample of cells from the subject, the 
presence or absence of a genetic lesion characterized by at least one of an alteration 
affecting the integrity of a gene encoding an NOVX-protein, or the misexpression of the 
NOVX gene. For example, such genetic lesions can be detected by ascertaining the 
existence of at least one of: (/') a deletion of one or more nucleotides from an NOVX gene; 
(//) an addition of one or more nucleotides to an NOVX gene; (///') a substitution of one or 
more nucleotides of an NOVX gene, (zv) a chromosomal rearrangement of an NOVX 
gene; (v) an alteration in the level of a messenger RNA transcript of an NOVX gene, (vf) 
aberrant modification of an NOVX gene, such as of the methylation pattern of the 
genomic DNA, (vz'i) the presence of a non-wild-type splicing pattern of a messenger RNA 
transcript of an NOVX gene, (viii) a non-wild-type level of an NOVX protein, (ix) allelic 
loss of an NOVX gene, and (x) inappropriate post-translational modification of an NOVX 
protein. As described herein, there are a large number of assay techniques known in the 
art which can be used for detecting lesions in an NOVX gene. A preferred biological 
sample is a peripheral blood leukocyte sample isolated by conventional means from a 
subject. However, any biological sample containing nucleated cells may be used, 
including, for example, buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer 
in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain 
reaction (LCR) (see, e.g., Landegran, etal., 1988. Science 241: 1077-1080; and 
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Nakazawa, eta!., 1994. Proc. Natl. Acad. Sci. USA 91 : 360-364), the latter of which can 
be particularly useful for detecting point mutations in the NOVX-gene {see, Abravaya, et 
ah, 1995. Nucl. Acids Res. 23: 675-682). This method can include the steps of collecting a 
sample of cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from 
5 the cells of the sample, contacting the nucleic acid sample with one or more primers that 
specifically hybridize to anNOVX gene under conditions such that hybridization and 
amplification of the NOVX gene (if present) occurs, and detecting the presence or absence 
of an amplification product, or detecting the size of the amplification product and 
comparing the length to a control sample. It is anticipated that PCR and/or LCR may be 

10 desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication (see, 
Guatelli, et al, 1990. Proc. Natl. Acad. Sci. USA 87: 1874-1878), transcriptional 
amplification system (see, Kwoh, et al, 1989. Proc. Natl. Acad. Sci. USA 86: 1 173-1 177); 

1 5 Qp Replicase (see, Lizardi, et al, 1988. BioTechnology 6: 1 197), or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using 
techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 

20 In an alternative embodiment, mutations in an NOVX gene from a sample cell can 

be identified by alterations in restriction enzyme cleavage patterns. For example, sample 
and control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA 

25 indicates mutations in the sample DNA. Moreover, the use of sequence specific 

ribozymes (see, e.g., U.S. Patent No. 5,493,53 1) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in NOVX can be identified by 
hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays 

30 containing hundreds or thousands of oligonucleotides probes. See, e.g., Cronin, et al., 
1996. Human Mutation 7: 244-255; Kozal, et al., 1996. Nat. Med. 2: 753-759. For 
example, genetic mutations in NOVX can be identified in two dimensional arrays 
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containing light-generated DNA probes as described in Cronin, et ah, supra. Briefly, a 
first hybridization array of probes can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the sequences by making linear 
arrays of sequential overlapping probes. This step allows the identification of point 
5 mutations. This is followed by a second hybridization array that allows the 

characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild-type gene and the other complementary 
to the mutant gene. 

1 0 In yet another embodiment, any of a variety of sequencing reactions known in the 

art can be used to directly sequence the NOVX gene and detect mutations by comparing 
the sequence of the sample NOVX with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on techniques developed by Maxim 
and Gilbert, 1977. Proc. Natl. Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl. Acad. 

1 5 Sci. USA 74: 5463 . It is also contemplated that any of a variety of automated sequencing 
procedures can be utilized when performing the diagnostic assays (see, e.g., Naeve, et ah, 
1 995. Biotechniques 19: 448), including sequencing by mass spectrometry (see, e.g., PCT 
International Publication No. WO 94/16101; Cohen, et al, 1996. Adv. Chromatography 
36: 127-162; and Griffin, et al., 1993. Appl. Biochem. Biotechnol 38: 147-159). 

20 Other methods for detecting mutations in the NOVX gene include methods in 

which protection from cleavage agents is used to detect mismatched bases in RNA/RNA 
or RNA/DNAheteroduplexes. See, e.g., Myers, et al, 1985. Science 230: 1242. In 
general, the art technique of "mismatch cleavage" starts by providing heteroduplexes of 
formed by hybridizing (labeled) RNA or DNA containing the wild-type NOVX sequence 

25 with potentially mutant RNA or DNA obtained from a tissue sample. The double-stranded 
duplexes are treated with an agent that cleaves single-stranded regions of the duplex such 
as which will exist due to basepair mismatches between the control and sample strands. 
For instance, RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids 
treated with Si nuclease to enzymatically digesting the mismatched regions. In other 

3 0 embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 

hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 
regions. After digestion of the mismatched regions, the resulting material is then 
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separated by size on denaturing polyacrylamide gels to determine the site of mutation. 
See, e.g., Cotton, et al, 1988. Proc. Natl. Acad. ScL USA 85: 4397; Saleeba, et ah, 1992. 
Methods Enzymol. 217: 286-295. In an embodiment, the control DNA or RNA can be 
labeled for detection. 

5 In still another embodiment, the mismatch cleavage reaction employs one or more 

proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations 
in NOVX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. 
coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells 

10 cleaves T at G/T mismatches. See, e.g., Hsu, et al, 1994. Carcinogenesis 15: 1657-1662. 
According to an exemplary embodiment, a probe based on an NOVX sequence, e.g. , a 
wild-type NOVX sequence, is hybridized to a cDNA or other DNA product from a test 
cell(s). The duplex is treated with a DNA mismatch repair enzyme, and the cleavage 
products, if any, can be detected from electrophoresis protocols or the like. See, e.g., U.S. 

15 Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in NOVX genes. For example, single strand conformation 
polymorphism (SSCP) may be used to detect differences in electrophoretic mobility 
between mutant and wild type nucleic acids. See, e.g., Orita, et al, 1989. Proc. Natl 

20 Acad. Sci. USA: 86: 2766; Cotton, 1993. Mutat. Res 285: 125-144; Hayashi, 1992. Genet. 
Anal Tech. Appl. 9: 73-79. Single-stranded DNA fragments of sample and control NOVX 
nucleic acids will be denatured and allowed to renature. The secondary structure of 
single-stranded nucleic acids varies according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a single base change. The DNA 

25 fragments may be labeled or detected with labeled probes. The sensitivity of the assay 
may be enhanced by using RNA (rather than DNA), in which the secondary structure is 
more sensitive to a change in sequence. In one embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility. See, e.g., Keen, et al, 1991. Trends Genet. 7: 5. 

30 In yet another embodiment, the movement of mutant or wild-type fragments in 

polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE). See, e.g., Myers, et al, 1985. Nature 313: 495. 



87 



WO 03/010327 



PCT/US02/14199 



When DGGE is used as the method of analysis, DNA will be modified to insure that it 
does not completely denature, for example by adding a GC clamp of approximately 40 bp 
of high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient 
is used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA. See, e.g., Rosenbaum and Reissner, 1987. Biophys. Chem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
that permit hybridization only if a perfect match is found. See, e.g., Saiki, et al, 1986. 
Nature 324: 163; Saiki, et al, 1989. Proc. Natl. Acad. Sci. USA 86: 6230. Such allele 
specific oligonucleotides are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are attached to the hybridizing membrane 
and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization; see, e.g., Gibbs, et al, 1989. Nucl. Acids Res. 17: 2437-2448) or at the 
extreme 3'-terminus of one primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension (see, e.g., Prossner, 1993. Tibtech. 11: 238). In 
addition it may be desirable to introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection. See, e.g., Gasparini, et al, 1992. Mol. Cell 
Probes 6: 1. It is anticipated that in certain embodiments amplification may also be 
performed using Taq ligase for amplification. See, e.g., Barany, 1991. Proc. Natl. Acad. 
Sci. USA 88: 189. In such cases, ligation will occur only if there is a perfect match at the 
3'-terminus of the 5' sequence, making it possible to detect the presence of a known 
mutation at a specific site by looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by utilizing 
pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, e.g., in clinical settings to 
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diagnose patients exhibiting symptoms or family history of a disease or illness involving 
anNOVX gene. 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
NOVX is expressed may be utilized in the prognostic assays described herein. However, 
5 any biological sample containing nucleated cells may be used, including, for example, 
buccal mucosal cells. 

Pharmacogenomics 

1 0 Agents, or modulators that have a stimulatory or inhibitory effect on NOVX 

activity (e.g., NOVX gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) disorders 
(The disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, 
cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, 

1 5 Parkinson's Disorder, immune disorders, and hematopoietic disorders, and the various 
dyslipidemias, metabolic disturbances associated with obesity, the metabolic syndrome X 
and wasting disorders associated with chronic diseases and various cancers.) In 
conjunction with such treatment, the pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound or 

20 drug) of the individual may be considered. Differences in metabolism of therapeutics can 
lead to severe toxicity or therapeutic failure by altering the relation between dose and 
blood concentration of the pharmacologically active drug. Thus, the pharmacogenomics 
of the individual permits the selection of effective agents (e.g., drugs) for prophylactic or 
therapeutic treatments based on a consideration of the individual's genotype. Such 

25 pharmacogenomics can further be used to determine appropriate dosages and therapeutic 
regimens. Accordingly, the activity of NOVX protein, expression of NOVX nucleic acid, 
or mutation content of NOVX genes in an individual can be determined to thereby select 
appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 

30 response to drugs due to altered drug disposition and abnormal action in affected persons. 
See e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol. Physiol., 23: 983-985; Linder, 1997. 
Clin. Chem., 43: 254-266. In general, two types of pharmacogenetic conditions can be 
differentiated. Genetic conditions transmitted as a single factor altering the way drugs act 
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on the body (altered drug action) or genetic conditions transmitted as single factors 
altering the way the body acts on drugs (altered drug metabolism). These 
pharmacogenetic conditions can occur either as rare defects or as polymorphisms. For 
example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common inherited 
enzymopathy in which the main clinical complication is hemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 
fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome PREGNANCY ZONE PROTEIN PRECURSOR enzymes CYP2D6 
and CYP2C19) has provided an explanation as to why some patients do not obtain the 
expected drug effects or show exaggerated drug response and serious toxicity after taking 
the standard and safe dose of a drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). 
The prevalence of PM is different among different populations. For example, the gene 
coding for CYP2D6 is highly polymorphic and several mutations have been identified in 
PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 
and CYP2C19 quite frequently experience exaggerated drug response and side effects 
when they receive standard doses. If a metabolite is the active therapeutic moiety, PM 
show no therapeutic response, as demonstrated for the analgesic effect of codeine 
mediated by its CYP2D6-formed metabolite morphine. At the other extreme are the so 
called ultra-rapid metabolizers who do not respond to standard doses. Recently, the 
molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
amplification. 

Thus, the activity of NOVX protein, expression of NOVX nucleic acid, or mutation 
content of NOVX genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding 
drug-metabolizing enzymes to the identification of an individual's drug responsiveness 
phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse 
reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency 
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when treating a subject with anNOVX modulator, such as a modulator identified by one 
of the exemplary screening assays described herein. 

Monitoring of Effects During Clinical Trials 

5 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of NOVX (e.g., the ability to modulate' aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical trials. 
For example, the effectiveness of an agent determined by a screening assay as described 

1 0 herein to increase NOVX gene expression, protein levels, or upregulate NOVX activity, 
can be monitored in clinical trails of subjects exhibiting decreased NOVX gene 
expression, protein levels, or downregulated NOVX activity. Alternatively, the 
effectiveness of an agent determined by a screening assay to decrease NOVX gene 
expression, protein levels, or downregulate NOVX activity, can be monitored in clinical 

1 5 trails of subjects exhibiting increased NOVX gene expression, protein levels, or 

upregulated NOVX activity. In such clinical trials, the expression or activity of NOVX 
and, preferably, other genes that have been implicated in, for example, a cellular 
proliferation or immune disorder can be used as a "read out" or markers of the immune 
responsiveness of a particular cell. 

20 By way of example, and not of limitation, genes, including NOVX, that are 

modulated in cells by treatment with an agent (e.g., compound, drug or small molecule) 
that modulates NOVX activity (e.g., identified in a screening assay as described herein) 
can be identified. Thus, to study the effect of agents on cellular proliferation disorders, for 
example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 

25 levels of expression of NOVX and other genes implicated in the disorder. The levels of 
gene expression (i.e., a gene expression pattern) can be quantified by Northern blot 
analysis or RT-PCR, as described herein, or alternatively by measuring the amount of 
protein produced, by one of the methods as described herein, or by measuring the levels of 
activity of NOVX or other genes. In this manner, the gene expression pattern can serve as 

30 a marker, indicative of the physiological response of the cells to the agent. Accordingly, 
this response state may be determined before, and at various points during, treatment of 
the individual with the agent. 
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In one embodiment, the invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent {e.g., an agonist, antagonist, protein, 
peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the steps of (i) obtaining a 
5 pre-administration sample from a subject prior to administration of the agent; (if) detecting 
the level of expression of an NOVX protein, mRNA, or genomic DNA in the 
preadministration sample; (///) obtaining one or more post-administration samples from 
the subject; (iv) detecting the level of expression or activity of the NOVX protein, mRNA, 
or genomic DNA in the post-administration samples; (v) comparing the level of expression 

10 or activity of the NOVX protein, mRNA, or genomic DNA in the pre-administration 
sample with the NOVX protein, mRNA, or genomic DNA in the post administration 
sample or samples; and (vf) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to 
increase the expression or activity of NOVX to higher levels than detected, i.e., to increase 

15 the effectiveness of the agent. Alternatively, decreased administration of the agent may be 
desirable to decrease expression or activity of NOVX to lower levels than detected, i.e., to 
decrease the effectiveness of the agent. 

Methods of Treatment 

20 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with 
aberrant NOVX expression or activity. The disorders include cardiomyopathy, 
atherosclerosis, hypertension, congenital heart defects, aortic stenosis, atrial septal defect 

25 (ASD), atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, 
subaortic stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, 
scleroderma, obesity, transplantation, adrenoleukodystrophy, congenital adrenal 
hyperplasia, prostate cancer, neoplasm; adenocarcinoma, lymphoma, uterus cancer, 
fertility, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 

30 immunodeficiencies, graft versus host disease, AIDS, bronchial asthma, Crohn's disease; 
multiple sclerosis, treatment of Albright Hereditary Ostoeodystrophy, and other diseases, 
disorders and conditions of the like. 

These methods of treatment will be discussed more fully, below. 



92 



WO 03/010327 PCT/US02/14199 



Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
5 suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antagonize 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that 
may be utilized include, but are not limited to: (/) an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; (ii) antibodies to an aforementioned peptide; 

10 (iif) nucleic acids encoding an aforementioned peptide; (iv) administration of antisense 

nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion 
within the coding sequences of coding sequences to an aforementioned peptide) that are 
utilized to "knockout" endogenous function of an aforementioned peptide by homologous 
recombination (see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( 

15 Le., inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 
between an aforementioned peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 

20 Therapeutics that increase (i. e. , are agonists to) activity. Therapeutics that upregulate 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that 
may be utilized include, but are not limited to, an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide and/or 

25 RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in 
vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or 
mRNAs of an aforementioned peptide). Methods that are well-known within the art 
include, but are not limited to, immunoassays (e.g., by Western blot analysis, 
immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 

30 electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 

expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, and the like). 

Prophylactic Methods 
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In one aspect, the invention provides a method for preventing, in a subject, a 
disease or condition associated with an aberrant NO VX expression or activity, by 
administering to the subject an agent that modulates NOVX expression or at least one 
NOVX activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
5 NOVX expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 
agent can occur prior to the manifestation of symptoms characteristic of the NOVX 
aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending upon the type of NOVX aberrancy, for example, an NOVX 

10 agonist or NOVX antagonist agent can be used for treating the subject. The appropriate 
agent can be determined based on screening assays described herein. The prophylactic 
methods of the invention are further discussed in the following subsections. 
Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating NOVX 

15 expression or activity for therapeutic purposes. The modulatory method of the invention 
involves contacting a cell with an agent that modulates one or more of the activities of 
NOVX protein activity associated with the cell. An agent that modulates NOVX protein 
activity can be an agent as described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of an NOVX protein, a peptide, an NOVX 

20 peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or 
more NOVX protein activity. Examples of such stimulatory agents include active NOVX 
protein and a nucleic acid molecule encoding NOVX that has been introduced into the 
cell. In another embodiment, the agent inhibits one or more NOVX protein activity. 
Examples of such inhibitory agents include antisense NOVX nucleic acid molecules and 

25 anti-NOVX antibodies. These modulatory methods can be performed in vitro (e.g. , by 

culturing the cell with the agent) or, alternatively, in vivo (e.g. , by administering the agent 
to a subject). As such, the invention provides methods of treating an individual afflicted 
with a disease or disorder characterized by aberrant expression or activity of an NOVX 
protein or nucleic acid molecule. In one embodiment, the method involves administering 

30 an agent (e.g. , an agent identified by a screening assay described herein), or combination 
of agents that modulates (e.g., up-regulates or down-regulates) NOVX expression or 
activity. In another embodiment, the method involves administering an NOVX protein or 
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nucleic acid molecule as therapy to compensate for reduced or aberrant NOVX expression 
or activity. 

Stimulation of NOVX activity is desirable in situations in which NOVX is 
abnormally downregulated and/or in which increased NOVX activity is likely to have a 
5 beneficial effect. One example of such a situation is where a subject has a disorder 

characterized by aberrant cell proliferation and/or differentiation {e.g., cancer or immune 
associated disorders). Another example of such a situation is where the subject has a 
gestational disease {e.g., preclampsia). 

1 0 Determination of the Biological Effect of the Therapeutic 

In various embodiments of the invention, suitable in vitro or in vivo assays are 
performed to determine the effect of a specific Therapeutic and whether its administration 
is indicated for treatment of the affected tissue. 

1 5 In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic 
exerts the desired effect upon the cell type(s). Compounds for use in therapy may be 
tested in suitable animal model systems including, but not limited to rats, mice, chicken, 
cows, monkeys, rabbits, and the like, prior to testing in human subjects. Similarly, for in 

20 vivo testing, any of the animal model system known in the art may be used prior to 
administration to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

25 The NOVX nucleic acids and proteins of the invention are useful in potential 

prophylactic and therapeutic applications implicated in a variety of disorders including, 
but not limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, 
cancer-associated cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, hematopoietic disorders, and the various dyslipidemias, 

30 metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 
disorders associated with chronic diseases and various cancers. 

As an example, a cDNA encoding the NOVX protein of the invention may be 
useful in gene therapy, and the protein may be useful when administered to a subject in 
need thereof. By way of non-limiting example, the compositions of the invention will 
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have efficacy for treatment of patients suffering from: metabolic disorders, diabetes, 
obesity, infectious disease, anorexia, cancer-associated cachexia, cancer, 
neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 
disorders, hematopoietic disorders, and the various dyslipidemias. 
5 Both the novel nucleic acid encoding the NOVX protein, and the NOVX protein of 

the invention, or fragments thereof, may also be useful in diagnostic applications, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed. A further use 
could be as an anti -bacterial molecule {i.e., some peptides have been found to possess anti- 
bacterial properties). These materials are further useful in the generation of antibodies, 
1 0 which immunospecifically-bind to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

Sequence Analyses 

1 5 The sequence of NOVX was derived by laboratory cloning of cDNA fragments, by 

in silico prediction of the sequence. cDNA fragments covering either the full length of the 
DNA sequence, or part of the sequence, or both, were cloned. In silico prediction was 
based on sequences available in CuraGen's proprietary sequence databases or in the public 
human sequence databases, and provided either the full length DNA sequence, or some 

20 portion thereof. 

The laboratory cloning was performed using one or more of the methods 
summarized below: 

25 SeqCalling™Technology: cDNA was derived from various human samples 

representing multiple tissue types, normal and diseased states, physiological states, and 
developmental states from different donors. Samples were obtained as whole tissue, 
primary cells or tissue cultured primary cells or cell lines. Cells and cell lines may have 
been treated with biological or chemical agents that regulate gene expression, for example, 

30 growth factors, chemokines or steroids. The cDNA thus derived was then sequenced using 
CuraGen Corporation's SeqCalling technology which is disclosed in full in U. S. Ser. Nos. 
09/417,386 filed Oct. 13, 1999, and 09/614,505 filed July 1 1, 2000. Sequence traces were 
evaluated manually and edited for corrections if appropriate. cDNA sequences from all 
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samples were assembled together, sometimes including public human sequences, using 
bioinformatics programs to produce a consensus sequence for each assembly. Each 
assembly is included in CuraGen Corporation's database. Sequences were included as 
components for assembly when the extent of identity with another component was at least 

5 95% over 50 bp. Each assembly represents a gene or portion thereof and includes 

information on variants, such as splice forms single nucleotide polymorphisms (SNPs), 
insertions, deletions and other sequence variations. 

Variant sequences are also included in this application. A variant sequence can 
include a single nucleotide polymorphism (SNP). A SNP can, in some instances, be 

10 referred to as a "cSNP" to denote that the nucleotide sequence containing the SNP 

originates as a cDNA. A SNP can arise in several ways. For example, a SNP may be due 
to a substitution of one nucleotide for another at the polymorphic site. Such a substitution 
can be either a transition or a transversion. A SNP can also arise from a deletion of a 
nucleotide or an insertion of a nucleotide, relative to a reference allele. In this case, the 

1 5 polymorphic site is a site at which one allele bears a gap with respect to a particular 

nucleotide in another allele. SNPs occurring within genes may result in an alteration of the 
amino acid encoded by the gene at the position of the SNP. Intragenic SNPs may also be 
silent, when a codon including a SNP encodes the same amino acid as a result of the 
redundancy of the genetic code. SNPs occurring outside the region of a gene, or in an 

20 intron within a gene, do not result in changes in any amino acid sequence of a protein but 
may result in altered regulation of the expression pattern. Examples include alteration in 
temporal expression, physiological response regulation, cell type expression regulation, 
intensity of expression, and stability of transcribed message. 

Presented information includes that associated with genomic clones, public genes 

25 and ESTs sharing sequence identity with the disclosed sequence and CuraGen 
Corporation's Electronic Northern bioinformatic tool. 

EXAMPLES 

Example A: Polynucleotide and Polypeptide Sequences, and Homology Data 

3 0 The NOV 1 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 1 A. 
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Table 1A. NOV1 Sequence Analysis 




SEQ ID NO: 1 


750 bp 


NOV la, 
CGI 00 126-01 DNA Sequence 


TCCAAACCCAAGAACTTCACTCTCTTGGAAACCCACCCAGATCCTCCCCGTTCTGACA 
CCATGGTCAGCTCCTGTTGTGGCTCCGTGTGCTCTGACCAGGGCTGCGGCCAAGACCT 
CTGTCAGGAGACCTGCTGCCGCCCCAGCTGCTGTGAGACCACCTGCTGCAGGACCACC 
TGCTGCCGCCCCAGCTGTTGTGTATCCAGCTGCTGCAGGCCCCAGTGCTGCCAGTCTG 
TGTGCTGCCAACCCACTTGTTCCCGCCCCAGCTGCTGTCAGACCACCTGTTGCAGGAC 

CCTGTGTGCTGCCAACCCACCTGCTGTCGCCCCAGCTGCTGTGAGACGACCTGCTGCC 
ACCCTAGGTGCTGCATCTCCAGCTGCTGTCGCCCCAGCTGCTGTGTGTCCAGCTGCTG 




CTGTGTCATCTCCAGCTGCCCCCGCCCCTTGTGCTGTGCCTCCTCTTGCTGCTGAGCC 
CACTGCCCTGGCTTATCTCCCCCTTCACCACTGGCCCACAGATGTAGACCCTTC 




ORF Start: ATG at 61 


ORF Stop: TGA at 691 




SEQ ID NO: 2 


210 aa | MW at 22433.0kD 


NOV 1 a, 
CG100126-01 Protein Sequence 


MVSSCCGSVCSDQGCGQDLCQETCCRPSCCETTCCRTTCCRPSCCVSSCCRPQCCQSV 
CCQPTCSRPSCCQTTCCRTTCYRPSCCVSSCCRPQCCQPVCCQPTCCRPSCCETTCCH 
PRCCISSCCRPSCCVSSCCKPQCCQSVCCQPNCCRPSCSISSCCRPSCCESSCCRPCC 
CVRPVCGRVSCHTTCYRPTCVISSCPRPLCCASSCC 



Further analysis of the NOV la protein yielded the following properties shown in 
Table IB. 



j _ . 

Table IB. Protein Sequence Properties NOVla 


Psort analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in mitochondrial 
matrix space; 0. 1 000 probability located in lysosome (lumen); 0.0000 probability 
located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV1 a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1C. 
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Table 1C. Geneseq Results for NOVla 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOVla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAE02058 


Human four disulfide core domain 
(FDCD)-containing protein - Homo 
sapiens, 230 aa. [WO200140249-A1, 07- 
JUN-2001] 


1..209 
1..204 


149/218(68%) 
167/218 (76%) 


e-102 


ABG29368 


Novel human diagnostic protein #29359 - 
Homo sapiens, 206 aa. [WO200 175067- 
A2, ll-OCT-2001] 


1..210 
12..206 


134/210(63%) 
158/210(74%) 


2e-94 


ABG29368 


Novel human diagnostic protein #29359 - 
Homo sapiens, 206 aa. [WO200 175067- 
A2, ll-OCT-2001] 


1-210 
12..206 


134/210 (63%) 
158/210 (74%) 


2e-94 


ABB 12277 


Human hair keratin associated protein 
homologue, SEQ ID NO:2647 - Homo 
sapiens, 120 aa. [WO200157188-A2, 09- 
AUG-2001] 


1..120 
1..120 


119/120 (99%) 
119/120 (99%) 


8e-83 


AAM79986 


Human protein SEQ ID NO 3632 - Homo 
sapiens. 301 aa. [WO200157I90-A2, 09- 
AUG-2001] 


2..210 
80..301 


110/227 (48%) 
141/227 (61%) 


2e-70 



In a BLAST search of public sequence datbases, the NOVla protein was found to 
have homology to the proteins shown in the BLASTP data in Table ID. 
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Table ID. Public BLASTP Results for NOVla 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOVla 
Residues/ 
Match 
Residues 


Similarities 
for the 
Matched 
Portion 


Expect 
Value 


Q9BYQ8 


KERATIN ASSOCIATED PROTEIN 4.9 
- Homo sapiens (Human), 191 aa 
(fragment). 


20..210 
1 — 191 


191/191 
(100%) 
191/191 
(100%) 


e-137 


Q9BQ66 


KERATIN ASSOCIATED PROTEIN 
4.12 (SIMILAR TO R1KEN CDNA 
1 1 10054P19 GENE) - Homo sapiens 
(Human), 201 aa. 


1..210 
1..201 


183/210 
(87%) 
191/210 
(90%) 


e-128 


Q9BYR0 


KERATIN ASSOCIATED PROTEIN 4.7 
- Homo sapiens (Human), 2 10 aa. 


1 ..210 
1..210 


179/215 
(83%) 
188/215 
(87%) 


e-127 


Q9BYQ5 


KERATIN ASSOCIATED PROTEIN 
4.15 - Homo sapiens (Human), 193 aa 
(fragment). 


13. .210 
1-193 


171/207 
(82%) 
175/207 
(83%) 


e-114 


Q9BYQ6 


KERATIN ASSOCIATED PROTEIN 
4. 14 - Homo sapiens (Human), 195 aa. 


1.210 
1-195 


161/210 
(76%) 
172/210 
(81%) 


e-112 



PFam analysis predicts that the NOVla protein contains the domains shown in the 
Table IE. 



Table IE. Domain Analysis of NOVla 


Pfam Domain 


NOVla Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Keratin_B2 


2..108 


57/177(32%) 
92/177 (52%) 


6.5e-12 


Keratin_B2 


109..210 


54/177(31%) 
82/177(46%) 


3.4e-05 



Example 2. 



5 The NOV2 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 2A. 
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Table 2A. NOV2 Sequence Analysis 



SEQIDNO:3 



NOV2a, 
CGI 00 146-01 DN A Sequence 



CCAAACACACTAAAATAAATATGAGGTCATCAATCTTTTG"; 
ACCTATTCAGACTGTTAGTATTATGTATTTACTTCAAATTTTAGCAGTTATATTTTAA 
CTTG ATT G AT TTTT C CT CAGATAT AAGTAT GAGAAATGACAGAAAGAAACAACAACTG 
GAAAAGAAGCATTGCATAAGACCAGG ATGTCTCTGAAATGGACGTCAGTCTTTCTGCT 
GATACAGCTCAGTTTTTACTTTAGCTCTGGGAGTTGTGGAAAGGTGCTGGTATGGCCC 
ACAGAATACAGCCTTTGGATGAATATGAAGACAATCCTGAAAGAACTTGTTCAGAGAG 
GTCATGAGGTGACTGTACTGGCATCTTCAGCTTCCATTCTTTTTGATCCCAACGACTC 
ATCCACTCTTAAACTTGAAGTTTATCCTACATCTTTAACTAAAAATGAATTTGAGAAT 
ATCATCATGCAATTGGTTAAGAGATGGATATATGGTGTTTCAAAAAGATGCATTTTGG 
iCCTTTCACAAGAACAAGAAATCCTGTGGGCAATATTATGACATAATTAGAAACTT 
CTGTAAAGATGTAGTTTCAAATAAGAAACTTATGAAAAAACTACAAGAGTCAAGATTT 
GACATCGTTTTTGCAGATGCTTATTTACCCTGTGGTGAGCTGCTGGCTGAGCTATTTA 
ACATACCCTTTGTGTACAGTCTCCGCTTCTCTGTTGGCTACACAGTTGAGAAGAATGG 
TGGAGGATTTCTGTTCCCTCCTTCCTATGTACCTGTTGTTATGTCAGAATTAAGTGAT 
CAAATGACTTTCATGGAGAGGGTAAAAAATATGATCTATGTGCTTTACTTTGACTTTT 
GGTTCGAAATATTTGACATGAAGAAGTGGGATCAGTTTTATAGTGAAGTTCTAGGAAG 
ACCCACTACATTATCTGAGACAATGGGGAAAGCTGACGTATGGCTTATTCGAAACTAC 
TGGGATTTTCAATTTCCTCACCCACTCTTACCAAATGTTGAGTTCGTTGGAGGACTCC 
ACTGCAAACCTGCCAAACCCCTACCGAAGGAAATGGAAGAGTTTGTCCAGAGCTCTGG 
AGAAAATGGTGTTGTGGTGTTTTCTCTGGGGTCGATGGTCAGTAACATGTCAGAAGAA 
AGTGCCAACATGATTGCATCAGCCCTTGCCCAGATCCCACAAAAGGTTCTATGGAGAT 
TTGATGGCAAGAAGCCAAATACTTTAGGTTCCAATACTCGACTGTACAAGTGGATACC 
CCAGAATGACCTTCTAGGTCATCCAAAGACCAAAGCTTTTATAACTCATGGTGGAACC 



CGGATCAACATGATAACATTGCTCACATGAAAGCCAAGGGAGCAGCCCTCAGTGTGGA 
CATCAGGACCATGTCAAGTAGAGATTTGCTCAATGCATTGAAGTCAGTCATTAATGAC 
CCTATCTATAAAGAGAATATCATGAAATTATCAAGAATTCATCATGATCAACCAATGA 
AGCCCCTGGATCGAGCAGTCTTCTGGATTGAGTTTGTCATGCGCCACAAAGGAGCCAA 
GCACCTTCGAGTCGCAGCCCACAACCTCACCTGGTTCCAGTACCACTCTTTGGATGTG 



TCTGAGATTTGAAGCTGGGGAATTCCGTTTATTGAAGATTCAGGTTAACCTGAATCAA 
GTTAACCCAGTCTCAAATGCTCACTTATCCTTTCTTCTTGTGACAAATCTCTCCTCTC 
CTGGATTGCCAAAGAAAATTCAAATTATTCTTCAATTAGTCAGGATGATTTGACTATC 
AGCAGTTCATAGTACCCATCTTCATAACTAAGCCACCTAGGGATCCAGCAGAAAAAAA 
AGGGATGAGGGGAGTCATCATACAGGAGGTGGATCATTTACCAGGATCCACACTTCCT 
ACAAAGCGGTTGTAATGTTAAATAACAAAACTGTTTTTTATTCCAATCTTCACATAAA 
ACAGGAATAATTGTATACTTTCTTACTAATGTGTTCCATGGAGTTTTTCCTCCAAGAA 
GTGGCTTAGGGGAAAATGAGCCCCAGTAATGCTTTGTGGCATCCAATCCTTCTACCCC 
GACCCTTTGACTTTCTGCCCCAGCCCCTCTTAGTTCTCCTAGAATTAGGACTAAGGTT 
AAGTGCCCTCTTGGGATATGACTTCCTTCCCTTCCTCTTGATACAAAAAGAGCCTATT 
ACCAACCCTCATACACACAAGAGTTCCCTTCCTAGTTGCAGACTCTTCTGCTCCAGCT 
GGACTCCCCTAGCTCTGGACTCCCACTAGATCACACAGGGGTCCCTGCATGTCAGTAA 
ACTTTGGATGACCTTGGGAGACCAAAAAATGGAATATCATTTTTTGATCTAAACAAAA 
TAGTTTCCTGATTTAACACTGGCCAGGAAGGTGGGCTGCACCCTCAGTCTCTCTCTCC 
CATCATGGTTTTCACATGATATCAAAGGACTCTCATAACAGTCTGATTCTTATGAGTT 
GGGCATCCTGTGTTTCCCTTTJ 
ATGAAACCTATTCACTCTGGACTTGGGATGGCTCTTCTCCATCTTCCCAAGTCTGAGC 
TGGAGCCTCCATGCCCAACTCTC 



ORF Start: ATG at 201 



ORF Stop: TAG at 1791 



530 a; 



! MW at 60923.7kD 



NOV2a, 
CGI 00 146-01 Protein Sequence 



MSLKWTSVFIilQLSFYFSSGSCGKVLVWPTEYSLWMNMKTILKELVQRGHEVTVLAS 
SASILFDPKDSSTLKLEVYPTSLTKNEFENIIMQLVKRWIYGVSKRCILVTFHKNKKS 
CGQYYDIIRNFCKDWSNKKLMKKLQESRFDIVFADAYLPCGELIiAELFNIPFVYSLR 
FSVGYTVEKNGGGFLFPPSYVPWMSELSDQMTFMERVKNMIYVLYFDFWFEIFDMKK 
WDQFYSEVLGRPTTLSETMGKADVWLIRNYWDFQFPHPLIjPNVEFVGGLHCKPAKPLP 
KEMEEFVQSSGENGWVFSLGSMVSNMSEESANMIASALAQIPQKVLVJRFDGKKPNTL 
GSNTRLYKWIPQNDLLGHPKTKAFITHGGTNGIYEAIYHGIPMVGIPLFADQHDNIAH 
MKAKGAALSVDIRTMSSRDLLNALKSVINDPIYKENIMKLSRIHHDQPMKPLDRAVFW 
IEFVMRHKGAKHLRVAAHNLTWFQYHSLDVIAFLLACVATMIFMITKCCLFCFRKFAR 
KAKKGKND 
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Further analysis of the NOV2a protein yielded the following properties shown in 
Table 2B. 



Table 2B. Protein Sequence Properties NOV2a 


PSort analysis: 


0.8200 probability located in endoplasmic reticulum (membrane); 0.4600 probability 
located in plasma membrane; 0.2000 probability located in lysosome (membrane); 
0.1790 probability located in microbody (peroxisome) 


SignalP analysis: 


Cleavage site between residues 25 and 26 



A search of the NOV2a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 2C. 



Table 2C. Geneseq Results for NOV2a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV2a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY78935 


Human UDP- 
glucuronosyltransferase 2B 1 5 amino 
acid sequence - Homo sapiens, 530 
aa. [WO200006776-A1, 10-FEB- 
2000] 


1..530 
1..530 


447/530 (84%) 
485/530 (91%) 


0.0 


AAW47126 


Uridine diphospho- 
glucuronosyltransferase 2B 1 7 
(UGT2B17) enzyme - Homo 
sapiens, 530 aa. [W09744466-A1, 
27-NOV-1997] 


1..530 
1..530 


445/530 (83%) 
480/530 (89%) 


0.0 


ABG05523 


Novel human diagnostic protein 
#55 14 - Homo sapiens, 533 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..530 
5..533 


442/530 (83%) 
477/530 (89%) 


0.0 


ABG05523 


Novel human diagnostic protein 
#5514 - Homo sapiens, 533 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..530 
5..533 


442/530 (83%) 
477/530 (89%) 


0.0 


ABG05524 


Novel human diagnostic protein 
#5515 - Homo sapiens, 532 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..530 
4..532 


446/531 (83%) 
479/531 (89%) 


0.0 



In a BLAST search of public sequence datbases, the NOV2a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 2D. 
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Table 2D. Public BLASTP Results for NOV2a 


Protein 
Number 




NOV2a 
Residues/ 
Match Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


j P54855 


UDP-glucuronosyltransferase 2B15 

(UDPGT) (UDPGTH-3) (HLUG4) - 
Homo sapiens (Human), 530 aa. 


1..530 
1..530 


447/530 
(84%) 

485/530 
(91%) 


0.0 


075795 


UDP-glucuronosyltransferase 2B 1 7 
precursor, microsomal (EC 2.4.1.17) 
(UDPGT) (C 1 9-steroid specific UDP- 
glucuronosyltransferase) - Homo sapiens 
(Human), 530 aa. 


1..530 
1..530 


445/530 
(83%) 

480/530 
(89%) 


0.0 


P36537 

i 


UDP-glucuronosyltransferase 2B10 
precursor, microsomal (EC 2.4.1.17) 
(UDPGT) - Homo sapiens (Human), 528 
aa. 


I..530 
1..528 


451/530 
(85%) 

484/530 
(91%) 


0.0 


PI 6662 


UDP-glucuronosyltransferase 2B7 
precursor, microsomal (EC 2.4.1.17) 
(UDPGT) (3,4-catechol estrogen specific) 
(UDPGTH-2) - Homo sapiens (Human), 
529 aa. 


1..530 
1..529 


442/530 
(83%) 

477/530 
(89%) 


0.0 


002663 

s 


UDP-glucuronosyltransferase 2B9 
precursor, microsomal (EC 2.4.1.17) 
(UDPGT) - Macaca fascicularis (Crab 
eating macaque) (Cynomolgus monkey), 
529 aa. 


1..530 
1..529 


441/530 
(83%) 

479/530 
(90%) 


0.0 



PFam analysis predicts that the NOV2a protein contains the domains shown in the 
Table 2E. 



Table 2E. Domain Analysis of NOV2a 


Pfam Domain 


NOV2a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


UDPGT 


24..528 


356/507 (70%) 
491/507 (97%) 


0 



Example 3. 



The NOV3 clone was analyzed, and the nucleotide and encoded polypeptide 
5 sequences are shown in Table 3A. 
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Table 3A. NOV3 Sequence Analysis 




SEQIDNO:5 


2310 bp 


NOV3a, 
CGI 00 179-01 DNA Sequence 

i 

i 
1 


TGGCCGGCGATAAGGGTTTCACCTTCAGGACTGGAGCTCCACAAGATTCAACCGTTAT 
ATCAACCTCCCCGGCCCAGGTCTTACCATCACAGCGTCACAAACTCCAGGTCGCCTAG 
GCGCTGCGCAGGAAGCGCTTGCCAGCCCCGGACTTCTGCGCGCGCTGCATGCCCATTG 
GATGTGCTCATTGCCACCCCAGCCAATCCCGAAACTCGCTCGGACGCTGACAGAAGAC 
TGCGCCGCTGCTTTGGGATTGGTAGCTGAGTTTTGTGTCGCGCCTTTTCTGACGATGC 
GAACAACATGGCGGCGGAAAGTGGTAGCGATTTTCAGCAGAGACGTAGAAGGCGCCGG 
GACCCGGAGGAACCGGAAAAAACAGAACTCAGCGAAAGAGAGCTGGCAGTAGCAGTGG 
CGGTGTCCCAGGAGAACGATGAGGAGAACGAAGAGCGCTGGGTTGGACCTTTACCTGT 
GGAGGCAACACTGGCCAAGAAGAGGAAAGTCTTAGAGTTTGAAAGAGTCTATCTTGAT 
AATCTCCCCAGTGCATCCATGTATGAGCGCAGTTACATGCATAGAGATGTTATCACCC 
ATGTGGTATGCACCAAGACAGATTTTATTATTACTGCCAGTCATGATGGACATGTCAA 
GTTCTGGAAAAAAATAGAAGAGGGAATTGAATTTGTTAAACATTTTCGTAGTCACCTG 
GGTGTTATTGAGAGTATTGCAGTTAGCTCTGAGGGAGCATTGTTCTGTTCTGTGGGTG 
ATGATAAAGCAATGAAGGTGTTTGATGTAGTGAACTTTGACATGATCAACATGCTGAA 
ACTTGGGTATTTTCCTGGACAGTGTGAGTGGATCTATTGCCCAGGGGATGCAATTTCT 
TCAGTTGCTGCTTCCGAAAAGAGTACAGGAAAAATTTTCATTTATGATGGCCGAGGAG 
ATAACCAGCCACTTCATATTTTTGACAAACTCCATACATCACCTCTTACTCAGATACG 
GCTAAACCCAGTTTACAAAGCAGTAGTGTCTTCTGACAAATCTGGGATGATTGAATAC 
TGGACTGGGCCTCCTCATGAATATAAATTCCCCAAAAATGTGAACTGGGAATATAAAA 
CTGACACTGATTTATATGAATTTGCCAAGTGTAAGGCTTATCCAACCAGCGTATGTTT 
TTCACCAGATGGGAAGAAAATAGCTACTATTGGTTCTGATAGAAAAGTTAGAATTTTC 
AGATTTGTAACTGGAAAACTCATGAGAGTCTTTGATGAATCACTAAGCATGTTTACTG 
AACTGCAACAGATGAGGCAACAGTTACCAGACATGGAATTTGGCCGACGAATGGCTGT 
AGAACGTGAGTTGGAGAAGGTTGATGCTGTAAGATTAATTAATATAGTTTTTGATGAA 
ACTGGACACTTCGTGCTGTATGGAACAATGCTGGGCATTAAAGTTATAAATGTAGAGA 
CAAACAGGTGTGTGCGGATTTTAGGCAAACAAGAAAATATTAGAGTGATGCAATTGGC 
TTTGTTCCAGGGGATAGCCAAAAAGCATCGTGCTGCAACTACTATAGAAATGAAAGCT 
TCTGAAAATCCTGTTCTTCAGAATATTCAAGCTGACCCAACAATAGTCTGTACATCAT 
TCAAAAAGAATAGATTTTATATGTTTACCAAACGAGAACCAGAAGATACGAAAAGTGC 
AGATTCTGATCGAGATGTTTTTAATGAGAAACCTTCTAAAGAAGAAGTCATGGCAGCT 
ACTCAAGCTGAAGGACCTAAACGAGTTTCGGACAGTGCCATTATCCACACCAGCATGG 
GAGACATTCACACCAAACTTTTTCCTGTTGAGTGCCCTAAGACAGTGGAAAACTTCTG 
TGTTCACAGCAGAAATGGTTATTATAATGGGCATACATTTCACCGTATAATTAAGGGC 
TTTATGATTCAGACTGGAGATCCAACAGGTACTGGTATGGGAGGAGAAAGCATATGGG 
GAGGAGAATTTGAAGATGAATTTCATTCAACATTACGACATGACAGGCCGTACACACT 
CAGCATGGCTAACGCGGGATCAAATACTAATGGATCCCAGTTTTTCATAACGGTAGTA 
CCAACGCCTTGGCTTGATAATAAGCATACAGTATTTGGACGAGTGACTAAAGGAATGG 
AAGTTGTACAGAGGATCTCCAACGTCAAAGTCAATCCCAAAACAGATAAGCCCTATGA 
GGATGTCAGCATCATAAATATTACTGTCAAGTAAAATAAGATTTGTTTTAATGTACTT 
GCAAATAAAAATACAATATTAAACAGATTATTTTACATTAGGAAGCTT 




ORF Start: ATG at 298 


ORF Stop: TAA at 2236 




SEQ ID NO: 6 


646 aa | MW at 73574.2kD 


NOV3a, 
CGI 00 179-01 Protein Sequence 


MAAESGSDFQQRRRRRRDPEEPEKTELSERELAVAVAVSQENDEENEERWVGPLPVEA 
TLAKKRKVLEFERVYLDNLPSAS MYERS YMHRDVITHWCTKTDFIITASHDGHVKFW 
KKIEEGIEFVKHFRSHLGVIESIAVSSEGALFCSVGDDKAMKVFDWNFDMINMLKLG 
YFPGQCEWIYCPGDAISSVAASEKSTGKIFIYDGRGDNQPLHIFDKLHTSPLTQIRLN 
PVYKAWSSDKSGMIEYWTGPPHEYKFPKNVNVJEYKTDTDLYEFAKCKAYPTSVCFSP 
DGKKIATIGSDRKVRIFRFVTGKLMRVFDESLSMFTELQQMRQQLPDMEFGRRMAVER 
ELEKTOAVRLINIVFDETGHFVLYGTMLGIKVINVETNRCVRILGKQENIRVMQLALF 
QGIAKKHRAATTIEMKASENPVLQNIQADPTIVCTSFKKNRFYMFTKREPEDTKSADS 
DRDVFNEKPSKEEVMAATQAEGPKRVSDSAIIHTSMGDIHTKLFPVECPKTVENFCVH 
SRNGYYNGHTFHRIIKGFMIQTGDPTGTGMGGESIWGGEFEDEFHSTLRHDRPYTLSM 
ANAGSNTNGSQFFITWPTPWLDNKHTVFGRVTKGMEWQRISNVKVNPKTDKPYEDV 
SIINITVK 



Further analysis of the NOV3a protein yielded the following properties shown in 
Table 3B. 
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Table 3B. Protein Sequence Properties NOV3a 


PSort analysis: 


0.7000 probability located in nucleus; 0.4969 probability located in microbody 
(peroxisome); 0.3600 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV3a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 3C. 





Table 3C. Geneseq 


Results for NO V3a 




Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV3a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAM79775 


Human protein SEQ ID NO 3421 
- Homo sapiens, 660 aa. 
[WO200I57I90-A2, 09-AUG- 
2001] 


1..646 
I5..660 


644/646 (99%) 
645/646 (99%) 


0.0 


AAM78791 


Human protein SEQ ID NO 1453 
- Homo sapiens, 565 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


82..646 
1..565 


565/565 (100%) 
565/565 (100%) 


0.0 


AAM43579 


Human polypeptide SEQ ID NO 
257 - Homo sapiens, 483 aa. 
[WO200155308-A2, 02-AUG- 
2001] 


165..646 

2..483 


478/482 (99%) 
478/482 (99%) 


0.0 


AAM43649 


Human polypeptide SEQ ID NO 
327 - Homo sapiens, 420 aa. 
[WO200155308-A2, 02-AUG- 
2001] 


227..646 
1..420 


420/420 (100%) 
420/420 (100%) 


0.0 


ABB59961 


Drosophila melanogaster 
polypeptide SEQ ID NO 6675 - 
Drosophila melanogaster, 342 aa. 
[WO200171042-A2, 27-SEP- 
2001] 


7..343 
2..335 


178/342 (52%) 
241/342(70%) 


e-100 



In a BLAST search of public sequence datbases, the NOV3a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 3D. 
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Table 3D. Public BLASTP Results for NOV3a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV3a 
Residues/ 

Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 


Q96BP3 


HYPOTHETICAL 73.6 KDA 
PROTEIN - Homo sapiens (Human), 
646 aa. 


1..646 
1 ..646 


646/646 (100%) 
646/646(100%) 


0.0 


Q 15002 


KIAA0073 PROTEIN - Homo 
sapiens (Human), 645 aa (fragment). 


2.. 646 
1..645 


645/645 (100%) 
645/645 (100%) 


0.0 


Q960Q8 


LD39850P - Drosophila 
melanogaster (Fruit fly), 637 aa. 


7..643 
2..63S 


376/642 (58%) 
487/642 (75%) 


0.0 


Q8W4D0 


HYPOTHETICAL 7 1 .0 KDA 
PROTEIN - Arabidopsis thaliana 
(Mouse-ear cress), 63 1 aa. 


25..645 
10..628 


334/633 (52%) 
446/633 (69%) 


0.0 


Q9LXM7 


HYPOTHETICAL 70.7 KDA 
PROTEIN - Arabidopsis thaliana 
(Mouse-ear cress), 629 aa. 


25..645 
10..626 


334/633 (52%) 
444/633 (69%) 


0.0 



PFam analysis predicts that the NOV3a protein contains the domains shown in the 
Table 3E. 



Table 3E. Domain Analysis of NOV3a 


Pfam Domain 


NOV3a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


WD40 


125.. 161 


7/37 (19%) 
29/37 (78%) 


0.27 


WD40 


272..308 


13/37 (35%) 
30/37(81%) 


0.94 


pro_isomerase 


501. .643 


80/164(49%) 
115/164(70%) 


1.2e-61 



Example 4. 



The NOV4 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 4A. 



106 



WO 03/010327 



PCT/US02/14199 



Table 4A. NOV4 Sequence Analysis 




SEQ ID NO: 7 


1060 bp 


NOV4a, 
CG100212-01 DNA Sequence 


CAATGATTGTTCAAAGAGTGGTATTGAATTCTCGACCTGGTAAAAATGGTAATCCAGT 
GGCAGAGAATTTCCGAATGGAAGAAGTCTATTTACCAGATAATATTAATGAAGGACAA 
GTACAAGTTAGAACTCTTTATCTTTCTGTGGATCCTTACATGCGTTGTAGAATGAATG 
AAGACACTGGCACTGATTATATAACACCTTGGCAGCTATCTCAAGTCGTTGATGGAGG 
AGGTATTGGAATTATAGAAGAAAGCAAACACACAAATTTGACTAAAGGCGATTTTGTG 
ACTTCTTTCTATTGGCCCTGGCAAACCAAGGTTATTCTGGATGGAAATAGCCTTGAAA 
AGGTGGACCCACAACTTGTGGATGGACACCTTTCATATTTTCTTGGAGCTATAGGTAT 
GCCTGGTTTGACTTCCTTGATTGGGATACAGGAAAAAGGTCATATAACTGCTGGATCT 
AATAAGACAATGGTTGTCAGTGGGGCCGCAGGTGCCTGTGGATCTGTGGCTGGGCAGA 
TTGGCCATTTCTTAGGTTGTTCCAGAGTGGTGGGAATTTGTGGAACACATGAGAAATG 
CATCCTCTTGACCTCAGAACTGGGCTTTGATGCTGCAATTAATTATAAAAAAGACAAT 
GTGGC AGAACAG CTCCGT GAATCATG CC CAGCTGGAGTGGATGTTTATTTTGAC AATG 
TTGGTGGTAACATCAGTGATACAGTGATAAGTCAGATGAATGAGAACAGCCACATCAT 
CCTGTGTGGTCAAATTTCTCAGTACAACAAAGATGTGCCTTATCCTCCCCCGCTATCC 
CCTGCTATAGAGGCAATCCAGAAAGAAAGAAACATCACAAGGGAAAGATTTCTGGTAT 
TAAATTATAAAGACAAATTTGAGCCTGGCATTCTACAGCTGAGTCAGTGGTTTAAAGA 
AGGAAAGCTAAAGGTAAAAGAGACGGTAATAAATGGGTTGGAAAACATGGGAGCATTC 
CAGTCCATGATGACAGGAGGTAACATTGGAAAGCAGATAGTTTGCATTTCAGAAGAAA 
TCT CTTTGTAATTGCT 




ORF Start: ATG at 3 


ORF Stop: TAAat 1053 




SEQ ID NO: 8 


350 aa { MW at 38413.5kD 


NOV4a, 
CG100212-01 Protein Sequence 


MIVQRVVLNSRPGKNGNPVAENFRMEEVYLPDNINEGQVQWTLYLSVDPYMRCRMNE 
DTGTDYITPWQLSQVVDGGGIGIIEESKHTNLTKGDFVTSFYWPWQTKVILDGNSLEK 
VDPQLVDGHLSYFLGAIGMPGLTSLIGIQEKGHITAGSNKTMWSGAAGACGSVAGQI 
GHFLGCSRWGICGTHEKCILLTSELGPDAAIIMYKKDNVAEQLRESCPAGVDVYFDNV 
GGNISDTVISQMNENSHIILCGQISQYNKDVPYPPPLSPAIEAIQKERNITRERFLVL 
NYKDKFEPGILQLSQWFKEGKLKVKETVINGLENMGAFQSMMTGGNIGKQIVCISEEI 
SL 




SEQ ID NO: 9 


853 bp 


NOV4b, 
CGI 002 12-02 DNA Sequence 


CAATGATTGTTCAAAGAGTGGTATTGAATTCTCGACCTGGTAAAAATGGTAATCCAGT 
GGCAGAGAATTTCCGAATGGAAGAAGTCTATTTACCAGATAATATTAATGAAGGACAA 
GTACAAGTTAGAACTCTTTATCTTTCTGTGGATCCTTACATGCGTTGTAGAATGAATG 
AAGACACTGGCACTGATTATATAACACCTTGGCAGCTATCTCAAGTCGTTGATGGAGG 
AGGTATTGGAATTATAGAAGAAAGCAAACACACAAATTTGACTAAAGGCGATTTTGTG 
ACTTCTTTCTATTGGCCCTGGCAAACCAAGGTTATTCTGGATGGAAATAGCCTTGAAA 
AGGTAGACCCACAACTTGTGGATGGACACCTTTCATATTTTCTTGGAGCTATAGGTAT 
GCCTGGTTTGACTTCCTTGATTGGGATACAGGAAAAAGGTCATATAACTGCTGGATCT 
AATAAGACAATGGTTGTCAGTGGGGCCGCAGGTGCCTGTGGATCTGTGGCTGGGCAGA 
TGAATGAGAACAGCCACATCATCCTGTGTGGTCAAATTTCTCAGTACAACAAAGATGT 




ACAAGGGAAAGATTTCTGGTATTAAATTATAAAGACAAATTTGAGCCTGGCATTCTAC 
AGCTGAGTCAGTGGTTTAAAGAAGGAAGGCTAAAGATTAAAGAGACGGTAATAAATGG 
GTTGGAAAACATGGGAGCTGCATTCCAGTCCATGATGACAGGAGGTAACATTGGAAAG 
CAGATAGTTTGCATTTCAGAAGAAATf T( IT rAATTGCT 




ORF Start: ATG at 3 


ORF Stop: TAA at 846 




SEQ ID NO: 10 


281 aa | MW at 310553kD 


NOV4b, 
CGI 002 12-02 Protein Sequence 


MIVQRWLNSRPGKNGNPVAENFRMEEVYIiPDNINEGQVQVRTLYLSVDPYMRCRMNE 
DTGTDYITPWQLSQWDGGGIGIIEESKHTNLTKGDFVTSFYWPWQTKVILDGNSLEK 
VDPQLVDGHLSYFLGAIGMPGLTELIGIQEKGHITAGSNKTMWSGAAGACGSVAGQM 
NENSHIILCGQISQYNKDVPYPPPLSPAIEAIQKERNITRERFLVLNYKDKFEPGILQ 
T 3WFKEGRI 1 ETVINGLENI AAE £ fl 'TGGNIGXQIVCISEEISL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 4B. 
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Table 4B. Comparison of NOV4a against NOV4b. 


Protein Sequence 


NOV4a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV4b 


1..183 
1..186 


154/186 (82%) 
158/186 (84%) 



Further analysis of the NOV4a protein yielded the following properties shown in 
Table 4C. 



Table 4C. Protein Sequence Properties NOV4a 


PSort analysis: 


0.6500 probability located in cytoplasm; 0. 1 572 probability located in lysosome 
(lumen); 0.1000 probability located in mitochondrial matrix space; 0.0000 
probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV4a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 4D. 



1 , . , 

Table 4D. Geneseq Results for NOV4a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV4a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 


AAG64386 


Human alcohol dehydrogenase 39 - 
Homo sapiens, 351 aa. [WO200 155404- 
Al, 02-AUG-2001] 


1..350 
1..351 


349/351 (99%) 
350/351 (99%) 


0.0 


AAU33607 


Pseudomonas aeruginosa cellular 
proliferation protein #51 - Pseudomonas 
aeruginosa, 345 aa. [WO200170955- 
A2, 27-SEP-2001] 


4..347 
9..S44 


131/347 (37%) 
202/347 (57%) 


4e-58 


AAU33607 


Pseudomonas aeruginosa cellular 
proliferation protein #5 1 - Pseudomonas 
aeruginosa, 345 aa. [WO200 170955- 
A2, 27-SEP-2001] 


4..347 
9..344 


131/347(37%) 
202/347 (57%) 


4e-58 


AAU34567 


E. coli cellular proliferation protein 

#148 - Escherichia coli, 376 aa. 
[WO200170955-A2, 27-SEP-2001] 


4..345 
40..373 


133/350 (38%) 
203/350 (58%) 


2e-56 


AAU34567 


E. coli cellular proliferation protein 

#148 - Escherichia coli, 376 aa. 
[WO200170955-A2, 27-SEP-2001] 


4..345 
40..373 


133/350(38%) 
203/350 (58%) 


2e-56 
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In a BLAST search of public sequence datbases, the NOV4a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 4E. 



Table 4E. Public BLASTP Results for NOV4a 


Protein 

Accession 
Number 


Protein/Organism/Length 


NOV4a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Value 


Q8VDQ1 


SIMILAR TO RIKEN CDNA 
B830026H24 GENE - Mus musculus 
(Mouse), 351 aa. 


1..350 
1..351 


308/35 1 (87%) 
328/351 (92%) 


0.0 


Q9D1W8 


B830026H24RIK PROTEIN - Mus 
musculus (Mouse), 351 aa. 


1..350 
1..351 


304/351 (86%) 
327/351 (92%) 


0.0 


Q09593 


HYPOTHETICAL 59.0 KDA 
PROTEIN M 106.3 IN 
CHROMOSOME II - Caenorhabditis 
elegans, 529 aa. 


6..345 
27..S74 


161/348 (46%) 
218/348 (62%) 


6e-83 


034812 


YFMJ PROTEIN - Bacillus subtilis, 
339 aa. 


4..346 
6.337 


146/346 (42%) 
208/346 (59%) 


7e-61 


Q9HR85 


QUINONE OXIDOREDUCTASE - 
Halobacterium sp. (strain NRC-1 ), 380 

aa. 


7..346 
53..380 


144/343 (41%) 
200/343 (57%) 


6e-60 



PFam analysis predicts that the NOV4a protein contains the domains shown in the 
Table 4F. 



Table 4F. Domain Analysis of NOV4a 


Pfam Domain 


NOV4a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


adh_zinc 


22..344 


77/472(16%) 
216/472(46%) 


3e-16 



5 Example 5. 



The NOV5 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 5A. 
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Table 5A. NOV5 Sequence Analysis 




SEQIDNO:ll i 978 bp 


NOV5a, 
CGI 00222-01 DNA Sequence 


GATGATTCATGCTGAGATGGACCCTGAAGAAGCACTTTGTTGGCAATCCTACTAATAG 
TGACTTTGAGTTGAAGACAGCTGAGCTCCCACCCTTAAAAAATAGAGAGGTCCTGCTT 
GAAGCTTTGTTCCTCACTGGGGACCCTTACATGAGAGTGGCAGCCAAAAGACTGAAGG 
AAGGTGATACAATGATGGGGCAGCAAGTGGCCAGATTTGTGGAAAGTAAAAACGCAGC 




CATGGAAAAGATCTGGAAAACCTGCCAACAGAATGGCCTGACACAGTACCACTGTCTC 
TGGCTCTGGGGACAGTTAGCATGACAGGCCTCACTGCCTACTTTGGCCTACTTGACAT 
CTGTGGTGTGAAGGGTGGAGAAACAGTGATGGTTAATGCAGCAGCTGGAGCTGTGGGC 
TCTGTTTTGGGGCAGATTACAAAGCTCAAGGGCTGCAAAGTGGTTGGAGCAGTAGGGT 
CTGACGAAAAGACTGCCTGCCTTAAAAAGCTTGGATCTGATGTCTTTAACTACAAGAC 
AGTAGAGTCTTTGGAAGAAACTTTGAAGAAAGCCTCTCCTGATGGTTACAATTGTTAT 
TTTGGTAATGTAAGTGGATGGTTTTCAAACACTGTTATCTCCCAGATGAAGAAATTTG 
GAAGAATTGCCACATGTGGAGCTATCTCTATATATAACACCAGCCCACTTCCCCCAGG 
CCCAATCCCAGAGATTGTTATGTATCAGGAGCTTCACATGGAAGGGTTCATTGTGACC 
TGCTGGCCAGGAGATGCCCACCAAAAAGCTCTAAAGGACTTGCTGAAATGGGTCTCAG 
AGGGTCAAATCCAGTACCAGGAATATATCATTGAAGGATATGAAAACATGCCAGCTGC 
ATTTATGGGAATGCTGAAAGGAGATAATTTGGAAAAACAATAGTGAAAGC 




ORF Start: ATG at 9 


ORF Stop: TAG at 969 




SEQ ID NO: 12 


320 aa j MW at 34932. lkD 


MOV5a, 
CGI 00222-01 Protein Sequence 


MLRWTLKFMFVGNPTHSDFELi^TAELPPLKNREVLLEALFLTGDPYMRVAAKRLKEGD 
TMMGQQVARFVESKNAALPNRTIVLAPSGWTAHSIAHGKDLENLPTEWPDTVPLSLAL 
GTVSMTGLTAYFGLLDICGVKGGETVMVNAAAGAVGSVLGQITKLKGCKVVGAVGSDE 
KTACLKKLGSDVFNYKTVESLEETLKKASPDGYNCYFGNVSGWFSNTVISQMKKFGRI 
ATCGAISIYNTSPLPPGPIPErVMYQELHMEGFIVTCWPGDAHQKALKDLLKWVSEGQ 
I QYQEY 1 1 EGYENMPAAFMGMLKGDNLE KQ 



Further analysis of the NOV5a protein yielded the following properties shown in 
Table 5B. 



Table 5B. Protein Sequence Properties NOV5a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.1644 probability located in microbody 
(peroxisome); 0.1620 probability located in lysosome (lumen); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV5a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 5C. 
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Table 5C. Geneseq Results for NOV5a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV5a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM39169 


Human polypeptide SEQ ID NO 23 14 - 
Homo sapiens, 329 aa. [WO2001533 12- 
A1.26-JUL-2001] 


4..319 
7..S24 


269/318 (84%) 
281/318 (87%) 


e-151 


AAM40955 


Human polypeptide SEQ ID NO 5886 - 
Homo sapiens, 417 aa. [WO200153312- 
A1.26-JUL-200I] 


4..319 
60..412 


267/353 (75%) 
280/353 (78%) 


e-145 


AAU37324 


Staphylococcus aureus cellular 
proliferation protein #1494 - 
Staphylococcus aureus, 334 aa. 
[WO2001 70955-A2, 27-SEP-2001] 


12..320 
15..328 


120/319 (37%) 
175/319 (54%) 


5e-49 


AAU37324 


Staphylococcus aureus cellular 
proliferation protein # 1 494 - 
Staphylococcus aureus, 334 aa. 
[WO2001 70955-A2, 27-SEP-2001] 


12..320 
15..328 


120/319 (37%) 
175/319 (54%) 


5e-49 


AAU36657 


Staphylococcus aureus cellular 

proliferation protein #827 - 
Staphylococcus aureus, 335 aa. 
[WO2001 70955-A2, 27-SEP-2001] 


12..320 
15..328 


120/319 (37%) 
174/319 (53%) 


6e-49 



In a BLAST search of public sequence datbases, the NOV5a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 5D. 
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Table 5D. Public BLASTP Results for NOV5a 


j Accession 
1 Number 

i 


Protein/Organism/Length 


NOV5a 
Residues/ 

Match 
Residues 


Identities/ 

for the 
Matched 
Portion 


Expect 
Value 


: Q9H1X6 


BA16L21.1.1 (NADP-DEPENDENT 
LEUKOTRIENEB4 12- 
HYDROXYDEHYDROGENASE 
(ISOFORM 1)) - Homo sapiens (Human), 329 
aa. 


7..324 


(84%) 
281/318 
(87%) 




Q14914 


NADP-dependent leukotriene B4 12- 
hydroxydehydrogenase (EC 1 . 1 . 1 .-) - Homo 
sapiens (Human), 31 1 aa (fragment). 


4..305 
7..310 


256/304 
(84%) 

268/304 
(87%) 


e-143 


062642 


15-OXOPROSTAGLANDIN 13- 
REDUCTASE - Sus scrofa (Pig), 329 aa. 


4..319 
7..324 


248/318 
(77%) 

272/318 
(84%) 


e-141 


j Q29073 

j 


NADP-dependent leukotriene B4 12- 
hydroxydehydrogenase (EC 1 . 1 . 1 .-) - Sus 
scrofa (Pig), 329 aa. 


4..319 
7..324 


247/318 
(77%) 

271/318 
(84%) 


e-141 


. P97584 


NADP-DEPENDENT LEUKOTRIENE B4 
12-HYDROXYDEHYDROGENASE (EC 
1.1.1.-) (DITHIOLETHIONE-INDUCIBLE 
GENE-1) - Rattus norvegicus (Rat), 329 aa. 


4..319 
7..324 


239/318 
(75%) 

266/318 
(83%) 


e-136 



PFam analysis predicts that the NOV5a protein contains the domains shown in the 
Table 5E. 



Table 5E. Domain Analysis of NOV5a 


Pfam Domain 


NOV5a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


adhzinc 


18..319 


63/471 (13%) 
206/471 (44%) 


1.2e-12 



Example 6. 



The NOV6 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 6A. 
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Table 6A. NOV6 Sequence Analysis 




SEQ ID NO: 13 


696 bp 


NOV6a, 


GTACTACATTTAAATCAATGTGACAAAGTCATTTTGTCTTTATGCAAACAATCAACTT 
GGTATTTTCGTATAGTAACAGAAAAAATGGGCTTTGCAGACGCCGCTGCCACCGGGAG 


CGI 00266-01 DNA Sequence 


CCCTCTACTATCAGCCATGGTCAACCCCACCGTGTTCCTCGACATCGCTGTCCACGGT 
GAGCCCTTGGGTGGTATCCCCTTCGAGCTGTTTGCAGACAAGATCCCAAAGACACCAG 




TCACAGAATTATTCCCGGGGTTATGTGTCAGCGTGGTGACTTCACAAGTCATAATGGC 
ACTGGTGGCAAGTCCATCCAAGGGGAGAAATTTGATGACAAGAACTTCATCCTAAAGC 
ACGCAGGTCCTGGCATCTTGTCCATGGCAAATACTGGACCCAACACAAACAGTTTCCA 
GTGTTTCATCTGCACTGCCAAGGCTGAGTGGTTGGATGGCAAGCATGTGGTCTTTGGC 
AAGGTGAAAGAAGGCATAAATATTGTCGAGGCCATGGAGCGCTTTGGGTCCAGGAATG 
GCAAGACCAGCAAGAAGATCACCATTTCTGACTGTGGAAAACTCTAATACGTTTGACT 
TGTGTTTGATATTTTATCTTAACCACCAGACTATTCCTTCTGTAGCTCAGGAGAGCAC 




ORF Start: ATG at 85 


ORF Stop: TAA at 625 




SEQ ID NO: 14 


180aa | MWat 19224.9kD 


NOV6a 

CGI 00266-01 Protein Sequence 


MGFADAAATGSPLLSAMVNPTVFLDIAVHGEPLGGIPFELFADKIPKTPENFHALSTG 
EKGFGYKGSCFHRIIPGVMCQRGDFTSHNGTGGKSIQGEKFDDKNFILKHAGPGILSM 
ANTGPNTNSFQCFICTAKAEWLDGKHWFGKVKEGINIVEAMERFGSRNGKTSKKITI 




SEQ ID NO: 15 


545 bp 


NOV6b, 
CGI 00266-02 DNA Sequence 


AAATGGGCTTTGCAGACGCCGCTGCCACCGGGAGCCCTCTACTATCAGTCATGGTCAA 

GAGCTGTTTGCAGACAAGATCCCAAAGACACCAGAAAACTTTCATGCTCTAAGCACTG 
GAGAAAAAGGATTTGGTTATAAGGGTTCCTGCTTTCACAGAATTATTCCCGGGGTTAT 
GTCTCAGCGTGGTGACTTCACAAGTCATAATGGCACTGGTGGCAAGTCCATCCAAGGG 
GAGAAATTTGATGACAAGAACTTCATCCTAAAGCACGCAGGTCCTGGCATCTTGTCCA 
TGGCAAATACTGGACCCAACACAAACAGTTTCCAGTGTTTCATCTGCACTGCCAAGGC 
TGAGTGGTTGGATGGCAAGCATGTGGTCTTTGGCAAGGTGAAAGAAGGCATAAATATT 
GTCGAGGCCATGGAGCX3CTTTGGGTCCAGGAATGGCAAGACCAGCAAGAAGATCACCA 




ORF Start: ATG at 3 


ORF Stop: TAA at 543 




SEQ ID NO: 16 


180 aa I MWat 19252.9kD 


NOV6b, 
CGI 00266-02 Protein Sequence 


MGFADAAATGSPLLSVMVNPTVFLDIAVHGEPLGGIPFELFADKIPKTPENFHALSTG 
EKGFGYKGSCFHRIIPGVMCQRGDFTSHNGTGGKSIQGEKFDDKNFILKHAGPGILSM 
ANTGPNTNSFQCFICTAKAEWLDGKHWFGKVKEGINIVEAMERFGSRNGKTSKKITI 
SDCGKL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 6B. 



Table 6B. Comparison of NOV6a against NOV6b. 


Protein Sequence 


NOV6a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV6b 


1..180 
1..180 


179/180(99%) 
179/180(99%) 



Further analysis of the NOV6a protein yielded the following properties shown in 
Table 6C. 
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Table 6C. Protein Sequence Properties NOV6a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV6a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 6D. 



Table 6D. Geneseq Results for NOV6a 


Geneseq 
Identifier 

I 


Protein/Organism/Length [Patent 
#, Date] 


NOV6a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU01195 


Human cyclophilin A protein - 

Homo sapiens, 165 aa. 
[WO200132876-A2, 10-MAY- 
2001] 


17..180 
1-164 


141/164 (85%) 
148/164 (89%) 


2e-8l 


AAW56028 


Calcineurin protein - Mammalia, 
165 aa. [WO9808956-A2, 05- 
MAR-1998] 


17..180 
1..164 


141/164 (85%) 
148/164 (89%) 


2e-81 


AAR13726 


Bovine cyclophilin - Bos taurus, 
163 aa. [US5047512-A, 10-SEP- 
1991] 


18..180 
1.-163 


140/163 (85%) 
147/163 (89%) 


4e-81 


AAG65275 


Haematopoietic stem cell 
proliferation agent related human 
protein #2 - Homo sapiens, 164 aa. 
[JP2001163798-A, 19-JUN-2001] 


18..180 
1 ..163 


140/163 (85%) 
147/163 (89%) 


7e-81 


AAP90431 


Cyclophilin - Homo sapiens 
(human), 164aa. [EP326067-A, 
02-AUG-1989] 


18..180 
1..163 


140/163 (85%) 
147/163 (89%) 


7e-81 



In a BLAST search of public sequence datbases, the NOV 6a protein was found to 
5 have homology to the proteins shown in the BLASTP data in Table 6E. 
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Table 6E. Public BLASTP Results for NOV6a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV6a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC39529 


SEQUENCE 26 FROM PATENT 
WO0132876 -Homo sapiens 
(Human), 165 aa. 


1..164 


141/164 (85%) 
148/164 (89%) 


5e-81 


P04374 


Peptidyl-prolyl cis-trans 
isomerase A (EC 5.2.1.8) 
(PPIase) (Rotamase) (Cyclophilin 

protein) - Bos taurus (Bovine), 
and, 1 63 aa. 


18.. 180 
1.163 


140/163 (85%) 
147/163 (89%) 


le-80 


Q9BRU4 


PEPTI D YLPROLYL 
ISOMERASE A 
(CYCLOPHILIN A) - Homo 
sapiens (Human), 165 aa. 


17.180 
1..164 


140/164 (85%) 
147/164 (89%) 


2e-80 


P05092 


Peptidyl-prolyl cis-trans 
isomerase A (EC 5.2. 1.8) 
(PPIase) (Rotamase) (Cyclophilin 
A) (Cyclosporin A-binding 
protein) - Homo sapiens 
(Human),, 164 aa. 


18.1 80 
1.163 


140/163 (85%) 
147/163 (89%) 


2e-80 


Q96IX3 


PEPTI D YLPROLYL 
ISOMERASE A 
(CYCLOPHILIN A) - Homo 
sapiens (Human), 165 aa. 


17..180 
1..164 


140/164(85%) 
147/164 (89%) 


5e-80 



PFam analysis predicts that the NOV6a protein contains the domains shown in the 
Table 6F. 



Table 6F. Domain Analysis of NOV6a 


Pfam Domain 


NOV6a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


pro_isomerase 


21..180 


104/179(58%) 
140/179(78%) 


3e-85 



Example 7. 



The NOV7 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 7A. 
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Table 7A. NOV7 Sequence Analysis 




SEQ ID NO: 17 


665 bp 


NOV7a, 
CGI 00427-01 DNA Sequence 


GTAAiiATAAftATAAAGTTAAATTAATTAAAATCAAATAAAATTAAAACTTTAGGCTTT 


AGAGAAGTTTCCAAAGACAGAAGAAAACTTTCATCTTCTGAGCACTGGAGAGAAAGGA 
TTTGGTTATAAAAGTTCCTGCTTTCACAGAATTATTCCAGGGTTTATGTGTCGGGGTG 
ATGACTTCACATGCCATAACAGCACTGGTGGCAAGTCCGTCTATGGGGAGAAGTTTGT 
TGACGAGAACTTCGTCCTGAAGCATACAGATCCTGGCATCTTGTCCAGGGCAAATGCT 
GGACCCAGTACAAATGGTTCCCAGTTTTTTACCTGCACTGCTAAGACCGAGTGGTTCG 
GCAAGGTGAAAGAACGCATGATTATTGTGGAGGCCATGGGGCGCTTTGCGGGGAATGG 
GA AGACCAGCAAGAAGATCGCTGTTGCTGACTGTGGGCTACTCTAATATGTTTGACTT 
GGGTTTATCTTAACCAGCAGACCATTCTTTCTGTAGCTCCAGAGAGCACCCCTCTACC 
CCATTTGCTCGCAGTATCCTAGAATCT 




ORF Start: ATG at 98 


ORF Stop: TAA at 566 




SEQ IDNO: 18 


156 aa j MW at 171 12.3kD 


NOV7a, 
CGI 00427-01 Protein Sequence 


MVWPTVFFDITVDGEPLGSVSFELFAEKFPKTEENFHLLSTGEKGFGYKSSCFHRIIP 
GFMCRGDDFTCHNSTGGKSVYGEKFVDENFVLKHTDPGILSRANAGPSTNGSQFFTCT 
AKTEWFGKVKERMIIVEAMGRFAGNGKTSKKIAVADCGLL 



Further analysis of the NOV7a protein yielded the following properties shown in 
Table 7B. 



Table 7B. Protein Sequence Properties NOV7a 


PSort analysis: 


0.6500 probability located in plasma membrane; 0.6400 probability located in 
microbody (peroxisome); 0.4500 probability located in cytoplasm; 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV7a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 7C. 
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Table 7C. Geneseq Results for NOV7a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV7a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU01195 


Human cyclophilin A protein - Homo 
sapiens, 165 aa. [WO200132876-A2, 
10-MAY-2001] 


1..156 
1..164 


129/164(78%) 
135/164 (81%) 


5e-70 


AAW56028 


Calcineurin protein - Mammalia, 165 aa. 
[WO9808956-A2, 05-MAR-1998] 


1..156 
1..164 


129/164 (78%) 
135/164 (81%) 


5e-70 


AAG65275 


Haematopoietic stem cell proliferation 
agent related human protein #2 - Homo 
sapiens, 164 aa. [JP2001 163798-A, 19- 
JUN-2001] 


2.. 156 
1..163 


128/163 (78%) 
134/163 (81%) 


2e-69 


AAP90431 


Cyclophilin - Homo sapiens (human), 
164 aa. [EP326067-A, 02-AUG-1989] 


2.. 156 
1 ..163 


128/163 (78%) 
134/163 (81%) 


2e-69 


AAG03831 


Human secreted protein, SEQ ID NO: 
79 12 - Homo sapiens, 165 aa. 
[EP 103340 1-A2, 06-SEP-2000] 


1..156 
1 .164 


128/164 (78%) 
134/164(81%) 


3e-69 



In a BLAST search of public sequence datbases, the NOV7a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 7D. 
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Table 7D. Public BLASTP Results for NOV7a 


Protein 
Accession 
Number 


Protein/Organ ism/Length 


NOV7a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Vatoe 


CAC39529 


SEQUENCE 26 FROM PATENT 
WOO 132876 - Homo sapiens (Human), 
165 aa. 


1 ..156 
1.164 


129/164(78%) 
135/164(81%) 


le-69 


Q9BRU4 


PEPTIDYLPROLYL ISOMERASE A 
(CYCLOPHILIN A) - Homo sapiens 
(Human), 165 aa. 


1..156 
1..164 


129/164 (78%) 
134/164 (81%) 


3e-69 




PEPTIDYLPROLYL ISOMERASE A 
(CYCLOPHILIN A) - Homo sapiens 
(Human), 165 aa. 


1..164 


129/164 (78%) 
134/164 (81%) 




P05092 


Peptidyl-proiyl cis-trans isomerase A 
(EC 5.2. 1 .8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A-binding 
protein) - Homo sapiens (Human),, 1 64 
aa. 


2.. 156 
1..163 


128/163 (78%) 
134/163 (81%) 


4e-69 


P04374 


Peptidyl-proiyl cis-trans isomerase A 
(EC 5.2.1.8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A-binding 
protein) - Bos taurus (Bovine), and, 163 
aa. 


2..154 
1 ..161 


127/161 (78%) 
133/161 (81%) 


le-68 



PFam analysis predicts that the NOV7a protein contains the domains shown in the 
Table 7E. 



Table 7E. Domain Analysis of NOV7a 


Pfam Domain 


NOV7a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


pro_isomerase 


5.. 154 


94/177 (53%) 
126/177(71%) 


7.9e-61 



Example 8. 



The NOV8 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 8A. 
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Table 8A. NOV8 Sequence Analysis 



SEQ ID NO: 19 



NOV8a, 
CGI 00456-01 DNA Sequence 



GCCGCCGAGCCGTCTGCCTGCCTGGAGGCGGCCACCCGCGCCTGGCGCGGCCTGC 
AGCGCGGTGAGGTGGTACCGCTGGGTCCTGGAGTGCCGGCCCTGGTGGCCAACGGCTT 
CCTGGCCCTGGACGTGGCTGCCAATCGGCTGTGGGTGACTCCCGGGGAGCGGGAGCCC 



AAGCTGGAGAGGCGGTGCTGCTGCTGCGGGAGGGGCTTCTGCGCCGCGTGCGTTGCCT 
GCAGCTGGGGTCCCCAGGTCCTGGCCCCGTGGCCGCCGGCCCCGGGCCCGCCTCCGTC 
TCTGGCCTTGCCGCGGGGTCCGGCCGCGACTGCGTGCTGCTGCAAGAGGACTTTCTGG 
CGCACAGGGGCCGACCCCACGTCTACCTGCAGCGCATCCAGCTCAACAACCCCACGGA 
GCGCGTGGCCGCGCTGCAGACTGTGGGGCCCACTGCCGGCCCAGCCCCCAATGCCTTC 



CGCCTACGCCCACTGGGTTGGTGCACCTGGTGGTGGTGGCCGCCAAGAAGCTGGTGAA 
CCGCCTCCAAGTGGCTCCCAAGACGCAGCTGGATGAGACGGTGCTGTGGGTGGTGCAC 
GTCTCTGGCCCCATTAACCCCCAGGTGCTCAAAAGCAAAGCAGCCAAGGAGCTCAAGG 



AGACCAGATGGAGTCGACGCTCAACTATGAAGATCACTGCTTCAGCGGGCACGCCACC 
ATGCACGCCGAGAACCTGTGGCCGGGGCGGCTGTCCTCCGTCCAGCAGATCCTGCAGC 
TCTCTGACCTGTGGAGGCTGACCCTCCAGAAGCGTGGCTGCAAGGGGCTGGTGAAGGT 
GGGTGCCCCAGGCATCCTGCAGGGGATGGTGCTCAGCTTTGGGGGGCTGCAGTTCACA 
GAGAACCACCTCCAGTTCCAGGCCGACCCCGACGTGCTGCACAACAGCTATGCATTGC 
ATGGCATCCGCTACAAGAACGACCATATCAACCTGGCCGTGCTGGCGGATGCCGAGGG 
CAAGCCCTACCTACACGTGTCCGTGGAGTCCCGTGGCCAGCCTGTCAAGATCTATGCC 
TGCAAGGCAGGCTGCCTGGACGAGCCAGTGGAGCTGACCTCGGCGCCCACGGGCCACA 

CACACACCTGCAGGACCTGCGGCACACGCTGCACCTCAAGGCCATCCTGGCCCATGAT 



CAGTCCTGCTTTCAGCCACCATTTGCACAAGACACCCAGCACTGAAAGTCCCGCTGCC 
AGGAGCAAGGGATCCTTTGGAAGCACCCGCCCTTTGTGCCTTGTTGGGGGAAACCGGT 
GACGCAGAAGTGAGTGTGGATACACCAGAGTTTGCATTGGAAGGAATGAGTGTCACGT 
GGGGAGGGAAGGGGCCAGTGGACCTTTTGTAAGCTTTCCACTCAATAAAATGAACCTG 
TATGGCAAATACTTGAAATGGAACTCACTCCTTCCACTTTCCCCCTTTCTTCTGTCCC 
AGGAAATAGATCATCTTTTGAAAAGACTCTTGTCTAGGAAAAGTTGTGTCCTTTTCCT 
AATTTAACGTGTTCTTTCTTAATGAAGTTTTAATTTATTTTTGTTGAGATTTTGCTAG 
ATGGCTTTTGCATCCCCTGTAGATGGTGAGTGTTGGCGGTGA1GTCCGTCTCGGCGTT 
CGGAGGCCCCACGGTCCCGAGGCTGGGCCGGGGCCCCCCAGGGTGGCTGTGCTGCTGC 
CTGTAGGAGGGTGCGGGTTGTGCTGTCATCCTCGGGTTTGCACGCCCTTTTTTAGGAG 
CCTGTGGACATCTGTGGTTTTGTACTTTGGGGCTTCAGGGGAGGTGTTTAACTTTCTA 
GTGATTGATGATTGTCAGGTTTTGAAATACCAAAGCTTTTTTGTTCTGTTTTTAAATA 
AATATCTTTCAAACTTTCAAAAAAAAAAAAAAAAAA 





ORF Start: ATG at 1 


ORF Stop:TGAat 1903 




SEQ ID NO: 20 


634 aa | MW at 69142.3kD 


NOV8a, 
CGI 00456-01 Protein Sequence 


AAEPSACLEAATRAWRGLRERGEWPLGPGVPALVANGFLALDVAANRLWVTPGEREP 
AVAPDFVPFVQLRPLSALAEAGEAVLLLREGLLRRVRCLQLGSPGPGPVAAGPGPASV 
SGIAAGSGRDCVLLQEDFLAHRGRPHVYLQRIQLNNPTERVAALQTVGPTAGPAPNAF 
TSTLEKVGDHQ FLLYS GRSPPTPTGLVHLWVAAKKLVNRLQVAPKTQLDETVLWWH 
VSGPINPQVLKSKAAKELKALQDLARKEMLELLDMPAAELLQDHQLLWAQLFSPGVEM 
KKITDTHTPSGLTWLTLYYMLSCSPAPLLSPSLSHRERDQMESTLNYEDHCFSGHAT 
MHAENLWPGRLSSVQQILQLSDLWR1.TLQKRGCKGLVKVGAPGILOGMVLSFGGLQFT 
ENHLQFQADPDVLHNSYALHGIRYKNDHINIiAVLADAEGKPYLHVSVESRGQPVKIYA 
CKAGCLDEPVELTSAPTGHTFSVMVTQPITPLLYISTDLTHLQDLRHTLHLKAILAHD 
EHMAQQDPGLPFLFWFSVASLITLFHLFLFKLIYNEYCGPGAKPLFRSKEDPSV 



Further analysis of the NOV8a protein yielded the following properties shown in 
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Table 8B. Protein Sequence Properties NOV8a 


PSort analysis: 


0.4600 probability located in plasma membrane; 0.3000 probability located in 
lysosome (membrane); 0.2800 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 41 and 42 



A search of the NOVSa protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 8C. 



Table 8C. Geneseq Results for NOV8a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent ft. 
Date] 


NOV8a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB43028 


Human ORFX ORF2792 polypeptide 
sequence SEQ ID NO:5584 - Homo 
sapiens, 454 aa. [WO200058473-A2, 
05-OCT-2000] 


1 82.-634 
2..454 


433/453 (95%) 
433/453 (95%) 


0.0 


ABB63158 


Drosophila melanogaster polypeptide 
SEQ ID NO 16266 - Drosophila 
melanogaster, 606 aa. 
[ WO200 1 7 1 042- A2, 27-SEP-200 1 ] 


111.. 627 
96..602 


155/557 (27%) 
259/557 (45%) 


5e-55 


AAB27712 


Human secreted protein #3 1 - Homo 
sapiens, 108 aa. [WO20005520I-A1, 
21-SEP-2000] 


456-524 
30..98 


69/69(100%) 
69/69(100%) 


2e-34 


AAB27756 


Protein fragment encoded by gene 3 1 

- Homo sapiens, 65 aa. 
[WO200055201-AI, 21-SEP-2000] 


524..585 
4„65 


61/62 (98%) 
62/62 (99%) 


5e-29 


AAB27755 


Sequence homologous to protein 
fragment encoded by gene 3 1 - Homo 
sapiens, 65 aa. [WO200055201-A1, 
21-SEP-2000] 


523-585 
2..65 


27/66 (40%) 
37/66 (55%) 


3e-05 



In a BLAST search of public sequence datbases, the NOV8a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 8D. 
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Table 8D. Public BLASTP Results for NOV8a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV8a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q91X21 


RIKENCDNA25I0039O18 
GENE - Mus musculus (Mouse), 
634 aa. 


1..634 
1..634 


582/634 (91%) 
607/634 (94%) 


0.0 


Q9CY11 


2510039O18RIK PROTEIN - Mus 
musculus (Mouse), 634 aa. 


1..634 
1..634 


581/634 (91%) 
607/634 (95%) 


0.0 


Q9BSY1 


SIMILAR TO RIKEN CDNA 
2510039018 GENE -Homo 
sapiens (Human), 238 aa 
(fragment). 


400..634 
4..238 


235/235 (100%) 
235/235 (100%) 


e-138 


Q9VQ60 


CG7289 PROTEIN - Drosophila 
melanogaster (Fruit fly), 606 aa. 


111. .627 
96..602 


155/557 (27%) 
259/557 (45%) 


le-54 


Q95RT9 


LD12115P- Drosophila 
melanogaster (Fruit fly), 637 aa. 


111. .627 
96..633 


155/566(27%) 
269/566 (47%) 


2e-54 



PFam analysis predicts that the NOV8a protein contains the domains shown in the 
Table 8E. 



Table 8E. Domain Analysis of NOV8a 



Pfam Domain 


NOV8a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 





Example 9. 



The NOV9 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 9A. 
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Table 9A. NOV9 Sequence Analysis 




SEQIDNO:21 | 905 bp 


NOV9a, 
CGI 00466-01 DNA Sequence 


CCTGCCTCCTCTTTCCTTTCAACATGACRGATBCmrTRTRTrrTTrrarr-aa^ArTT 
CCTGGCAGGTGGAGTGGCCGAAGCCATCTCCAAGACAGCGGTAGCGCCCATCGAGCAG 
GTCAAGCTGCTGCTGCAGGTGCAGCATGCCAGCAAGCAGATCACCTCAGCTAAGCAAT 
ACAAGGGCATTATAGACTGCGTGGTCCGTATTCCCAAGAAGCAGGGAGTCCTGTCCTG 
GCGCGGTAACCTGGCCAATGCCATCAGATACTTCCCCACCCAGGCTTTTAACTTCGCC 
TTCAAAGATAAATACAAGCAGATCTTCCTGGGTGGTGTGGACAAAAGAACTCAGTTTG 
GGCGCTACTTTGCAGGGAATCTGGCATCAGGAGGTGCCGCTGGGGCCACATACTTGTG 
TTTTGTGTACCCTCTTGATTTTGCCCGTACCTGTCTAGCAGCTAATGTGGGTAAAGCT 
GAAGCTGAAAGGGAATTCCGAGGCCTCTGTGACTGCCTGGTTAAGATCTACAAATCTG 
ATGGGATTAAGGGCCTGTACCAAGGCTTTAACGTGTCTATGCAGGGTATTATCCGAGC 
TGCCTACTTCGGTATCTATGACACCGCAAAGGGAATGCTTCCGGATCCCAAGGACACT 
CACATCGTCATCAGCTGGATGACCACACAGACTGTCACTGCCTTTTCTGGGTTGACTT 
CCTATTCATTTGACATCGTTCGCGTGATGATTCAGTCAGGGCGCAAAGTAACTGACAT 
CATGTACACAGGCACACTTGACTGCTGGAGGAAGATTGCTGGTGATGAAGGAGGCAAA 
GCTTTTTTCAAGGGTTCATGGTCCAGTGTTCTCAGAGGCATGGGTGGTGCTTTTGTGC 
TTGTCTTGTTTGATGAAATCAAGAAGTACAGGTAA 




ORF Start: ATG at 24 


ORF Stop: TAA at 903 




SEQ ID NO: 22 


293 aa 1 MW at 32215.1kD 


NOV9a, 
CGI 00466-01 Protein Sequence 


MTDAAVSFAKDFLAGGVAEAISKTAVAPIEQVKLLLQVQHASKQITSAKQYKGIIDCV 
VRIPKKQGVLSWRGNLANAIRYPPTQAFNFAPKDKYKQIFLGGVDKRTQFGRYFAGNL 
ASGGAAGATYLCFVYPLDFARTCIjAANVGKAEAEREFRGLCDCLVKIYKSDGIKGL-0 
GFNVSMQGIIRAAYFGIYDTAKGMLPDPKDTHIVISWMTTQTVTAFSGLTSYSFDIVR 
VMIQSGRKVTDIMYTGTLDCWRKIAGDEGGKAFFKGSWSSVLRGMGGAFVLVLFDE I K 
KYR 



Further analysis of the NOV9a protein yielded the following properties shown ii 
Table 9B. 



Table 9B. Protein Sequence Properties NOV9a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.2400 probability located in lysosome (lumen); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV9a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 9C. 



122 



WO 03/010327 



PCT/US02/14199 



Table 9C. Geneseq Results for NOV9a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV9a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAU 10379 


Human adenine nucleotide 
translocator 2 (ANT2) - Homo 
sapiens, 298 aa. [WO200 1 85944-A2, 
15-NOV-2001] 


1 ..292 
1..297 


262/297 (88%) 
275/297 (92%) 


e-145 


AAU01199 


Human adenine nucleotide 
transIocator-2 (ANT-2) protein - 
Homo sapiens, 298 aa. 
[WO2001 32876- A2, 10-MAY-2001] 


1 ..292 
1..297 


262/297 (88%) 
275/297 (92%) 


e-145 


AAY71032 


Human adenine nucleotide 
translocator ANT2 - Homo sapiens, 
298 aa. [WO200026370-A2, 1 1- 
MAY-2000] 


1..292 
1..297 


262/297 (88%) 
275/297 (92%) 


e-145 


AAU 10380 


Human adenine nucleotide 
translocator 3 (ANTS) - Homo 
sapiens, 298 aa. [WO200185944-A2, 
15-NOV-2001] 


1..291 
1..296 


242/296 (81%) 
267/296 (89%) 


e-137 


AAU01200 


Human adenine nucleotide 
transIocator-3 (ANT-3) protein - 
Homo sapiens, 298 aa. 
[WO200132876-A2, 10-MAY-2001] 


1..291 
1..296 


242/296 (81%) 
267/296 (89%) 


e-137 



In a BLAST search of public sequence datbases, the NOV9a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 9D. 
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Table 9D. Public BLASTP Results for NOV9a 


Protein 
Accession 
' Number 


Protein/Organism/Length 


NOV9a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


I P05141 


ADP,ATP carrier protein, fibroblast 
isoform (ADP/ATP translocase 2) 
(Adenine nucleotide translocator 2) 
(ANT 2) - Homo sapiens (Human), 
298 aa. 


1..292 
1..297 


264/297 (88%) 
276/297 (92%) 


e-146 


A29132 


ADP,ATP carrier protein T2 - human, 

298 aa. 


1..292 
1..297 


262/297 (88%) 
275/297 (92%) 


e-145 


Q09073 


ADP,ATP carrier protein, fibroblast 
isoform (ADP/ATP translocase 2) 
(Adenine nucleotide translocator 2) 
(ANT 2) - Rattus norvegicus (Rat), 
298 aa. 


1..292 
1..297 


261/297 (87%) 
275/297 (91%) 




P51881 


ADP,ATP carrier protein, fibroblast 
isoform (ADP/ATP translocase 2) 
(Adenine nucleotide translocator 2) 
(ANT 2) - Mus musculus (Mouse), 
298 aa. 


1..292 
1..297 


260/297 (87%) 
274/297(91%) 


e-144 


BAB84673 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 2 - Bos taurus 
(Bovine), 298 aa. 


1..292 
1..297 


259/297 (87%) 
274/297 (92%) 


e-144 



PFam analysis predicts that the NOV9a protein contains the domains shown in the 
Tahle 9E. 



Table 9E. Domain Analysis of NOV9a 


Pfam Domain 


NOV9a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mito_carr 


7.. 104 


35/125 (28%) 
86/125 (69%) 


6.8e-28 


mitocarr 


111. .205 


37/125 (30%) 
80/125 (64%) 


1.2e-18 


mito_carr 


206..293 


22/125(18%) 
61/125 (49%) 


7.6e-07 



Example 10. 



The NOV10 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 10A. 



WO 03/010327 



PCT/US02/14199 



Table 10A. NOV10 Sequence Analysis 




SEQlDNO:23 


858 bp 


NOVlOa, 
CGI 00609-01 DNA Sequence 


ACATAGGCCCAGGTCTGTGGCTGGCCAGCTGCAGCTGTGGGGCCTGGCATGTGTCTCA 
ACATGGCCCTGGAGCTCTACATGGACCTGCTGTCAGCACCCTGCCGTGCCGTCTACAT 
CTTCTCGAAGAAGCATGACATCCAGTTCAACTTTCAGTTTGTGGATCTGCTGAAAGGT 
CACCACCACAGCAAAGAATACATTGACATCAACCCCCTCAGGAAGCTTCCCAGCCTCA 
AAGATGGGAAATTTATCTTAAGTGAAAGCGCGGCCATCCTTTACTACCTGTGCCGCAA 
GTACAGCGCACCATCGCACTGGTGCCCGCCAGACCTGCACGCACGTGCCCGTGTGGAT 
GAGTTCGTGGCTTGGCAACACACGGCCTTTCAGCTGCCCATGAAGAAGATAGTCTGGC 
TCAAGTTGCTGATCCCAAAGATAACAGGGGAGGAAGTTTCAGCTGAGAAGATGGAGCA 
TGCAGTGGAAGAGGTGAAGAACAGCCTGCAGCTCTTTGAGGAGTATTTTCTGCAGGAT 
AAGATGTTCATCACCGGGAACCAAATCTCACTGGCTGACCTGGTGGCCGTGGTGGAGA 
TGATGCAGCCCATGGCAGCCAACTATAATGTCTTCCTCAACAGCTCCAAGCTAGCTGA 
GTGGCGTATGCAGGTGGAGCTGAATATTGGCTCTGGCCTCTTTAGGGAGGCCCATGAT 
CGACTAATGCAGTTGGCCGACTGGGACTTTTCAACATTGGATTCAATGGTCAAGGAGA 
BTBTTTrTGAGTTGCTGAAGAAGAGCAGGTGACCCTAAGCGCAGCCTGTCCCGCAGGG 




ORF Start: ATG at 49 


ORF Stop: TGA at 784 




SEQ ID NO: 24 


245 aa j MW at 28490.9kD 


NOV 10a, 
CG100609-01 Protein Sequence 


MC1.NMA1IELVMDLLSAPCRAVYIFSKKHDIQFNFQFVDLLKGHHHSKEYIDINPLRKL 
PSLKDGKFILSESAAILYY1.CRKYSAPSHWCPPDLHARARVDEFVAWQHTAFQLPMKK 
IVWLKIiLIPKITGEEVSAEKMEHAVEEVKNSLQLFEEYFLQDKMFITGNQISLADLVA 
WEMMQPMAANYNVFLNSSKLAEWRMQVELNIGSGLFREAHDRI,MQLADWDFSTLDSM 
VKENISELLKKSR 



Further analysis of the NOVlOa protein yielded the following properties shown in 
Table 10B. 



Table 10B. Protein Sequence Properties NOVlOa 


PSort analysis: 


0.4826 probability located in microbody (peroxisome); 0.4708 probability located in 
mitochondrial matrix space; 0.1732 probability located in mitochondrial inner 
membrane; 0.1732 probability located in mitochondrial intermembrane space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOVlOa protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table IOC. 
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Table IOC. Geneseq Results for NOVlOa 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 10a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


j AAB85772 

j 


Human drug metabolizing enzyme 
(ID No. 7472055CD1) - Homo 
sapiens, 241 aa. [WO200159127-A2, 
16-AUG-2001] 


5.. 245 
1..241 


237/241 (98%) 
238/241 (98%) 


e-I36 


AAY07034 


Breast cancer associated antigen 
precursor sequence - Homo sapiens, 
240 aa. [WO9904265-A2, 28-JAN- 
1999] 


5..237 
1..231 


116/233 (49%) 
156/233 (66%) 


9e-63 


AAB54309 


Human pancreatic cancer antigen 
protein sequence SEQ ID NO:761 - 

Homo sapiens, 255 aa. 
[WO200055320-A1, 21-SEP-2000] 


S..237 
15..246 


111/233 (47%) 
152/233 (64%) 


5e-59 


AAU30356 


Novel human secreted protein #847 - 

Homo sapiens, 420 aa. 
[WO200179449-A2, 25-OCT-2001] 


5..223 
107..331 


93/226(41%) 
132/226(58%) 


le-38 


AAM03015 


Peptide #1697 encoded by probe for 
measuring breast gene expression - 

Homo sapiens, 64 aa. 
[WO200157270-A2, 09-AUG-2001] 


181. .244 
1..64 


62/64 (96%) 
63/64 (97%) 

1 


5e-29 



In a BLAST search of public sequence datbases, the NOVlOa protein was found to 
have homology to the proteins shown in the BLASTP data in Table 1 0D. 
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Table 10D. Public BLASTP Results for NOVlOa 


Protein 
Accession 
j Number 


Protein/Organism/Length 


NOV 10a 
Residues/ 
Match Residues 


j Identities/ 

Similarities for 
j the Matched 
j Portion 


Expect 
Value 


Q9D4P7 

L — — , — _ 


4930583C14RIK PROTEIN - Mus 
musculus (Mouse), 240 aa 


5. .243 


181/239 (75%) 
210/239 (87%) 


e-103 


I Q969K8 


GLUTATHIONE S-TRANSFERASE 
TT1 (GLUTATHIONE S- 
TRANSFERASE THETA 1) 
(GLUTATHIONE TRANSFERASE 
T 1 - 1 ) (EC 2.5 . 1 . 1 8) - Homo sapiens 
(Human), 240 aa. 


5..237 
1..231 


117/233 (50%) 
157/233 (67%) 


3e-63 


Q96IY3 


GLUTATHIONE S-TRANSFERASE 
THETA 1 - Homo sapiens (Human), 
240 aa. 


5..237 
1..231 


117/233 (50%) 
156/233 (66%) 


le-62 


S44358 


glutathione transferase (EC 2.5.1.18) 
theta-1 [validated] - human, 240 aa. 


5..237 
1..231 


1 16/233 (49%) 
156/233 (66%) 


2e-62 


P30711 


Glutathione S-transferase theta I (EC 
2.5. 1 . 1 8) (GST class-theta) - Homo 
sapiens (Human), 239 aa. 


7..237 
2..230 


115/231 (49%) 
155/231 (66%) 


9e-62 



PFam analysis predicts that the NOV 10a protein contains the domains shown ii 
the Table 10E. 



Table 10E. Domain Analysis of NOVlOa 


Pfam Domain 


NOVlOa Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


GST_N 


4..80 


26/90 (29%) 
57/90 (63%) 


2.7e-12 


GST_C 


90.. 1 99 


23/115(20%) 
75/1 15 (65%) 


5.2e-09 



Example 11. 



The NOV1 1 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 1 1 A. 
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MISSING AT THE TIME OF 
PUBLICATION 
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ORF Start: ATG at 127 


ORF Stop: TAG at 721 




SEQ ID NO: 28 


198 aa 1 MW at21674.4kD 


NOV lib, 
CG100631-02 Protein Sequence 


MAELDPFGAPAGAPGGPALGNGVAGAGEEDPAAAFLAQQESEIAGIENDEAFAILDGG 
APGPQPHGE P PGG PDAVDGVMNGEYYQESNGPTDSYAAI SQVDRLQSEPE SIRKWREE 
QMERLEALDANSRKQEAEWKEKAIKELEEWYARQDEQLQKTKANNSTNINHPCYSLEQ 
AAEEALHSHWINYVEFFWTKPFCL 




SEQ ID NO: 29 


735 bp 


NOVllc, 
CG 1 0063 1-04 DNA Sequence 


CCCAGTCGGCACCACAGCGGTGGCTGCCGGGCGTGGTGTCGGTGGGTTGGTTGGTTTT 
TGTCTCACCGTTGGTGTCCGTGCCGTTCAGTTGCCCGCCATGGCTGAGCTGGATCCGT 
TCGGCGCCCCTGCCGGCGCCCCTGGCGGTCCCGCGCTGGGGAACGGAGTGGCCGGCGC 
CGGCGAAGAAGACCCGGCCGCGGCCTTCTTGGCGCAGCAAGAGAGCGAGATTGCGGGC 
ATCGAGAACGACGAGGCCTTCGCCAATTCTCGGAAGCAAGAAGCAGAGTGGAAAGAAA 
AGGCAATAAAGGAGCTAGAAGAATGGTATGCAAGACAGGACGAGCAGCTACAGAAAAC 
AAAAGCAAAC AACAGGGCAG CAGAAGAAG C CTTTGTAAATGACATTGACG AGT CGT CC 
CCAGGCACTGAGTGGGAACGGGTGGCCCGGCTGTGTGACTTTAACCCCAAGTCTAGCA 
AGCAGGCCAAAGATGTCTCCCGCATGCGCTCAGTCCTCATCTCCCTCAAGCAGGCCCC 
GCTGGTGCACTGAAGAGCCACCCTGTGGAAACACTACATCTGCAATATCTTAATCCTA 
CTCAGTGAAGCTCTTCACAGTCATTGGATTAATTATGTTGAGTTCTTTTGGACCAAAC 
CTTTTTGTCTTTAGAGTTGTTCATTGTTTGTGATTGCATGTTTCCTTCCTTCAACTGT 
GTTCTGCCTGGCATTCAGAGAGGAGGGAGAGGAGGAAGA 




ORF Start: ATG at 98 


ORF Stop: TGA at 533 




SEQ ID NO: 30 


145 aa j MW at 15769.2kD 


NOVI lc, 
CGI 0063 1-04 Protein Sequence 


MAELDPFGAPAGAPGGPALGNGVAGAGEEDPAAAFLAQQESEIAGIENDEAFANSRKQ 
EAEWKEKAIKELEEWYARQDEQLQKTKANNRAAEEAFVNDIDESSPGTEWERVARi CD 
FNPKSSKQAKDVSRMRSVLISLKQAPLVH 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 1 IB. 



Table 11B. Comparison of NOVlla against NOVllb and NOVllc. 


Protein Sequence 


NOVI la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOVllb 


1..161 
1-161 


125/161 (77%) 
125/161 (77%) 


NOVllc 


I26..236 
53.. 145 


93/1 1 1 (83%) 
93/1 1 1 (83%) 



Further analysis of the NOVI la protein yielded the following properties shown u 
Table 11C. 



Table 11C. Protein Sequence Properties NOVlla 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1 000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV1 la protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 ID. 



Table 11D. Geneseq Results for NOVlla 


Geneseq 
Identifier 


Protein/Organism/Length [Patent ti, 
Date] 


NOVlla 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 


Expect 


ABB71527 


Drosophila melanogaster 
polypeptide SEQ ID NO 4 1373 - 
Drosophila melanogaster, 2 1 9 aa. 
[WO200 1 7 1 042- A2, 27-SEP-200 1 ] 


20..234 
4..213 


77/23 1 (33%) 
118/231 (50%) 


2e-27 


AAB57165 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1743 - 

Homo sapiens, 1 16 aa. 
[ WO200055 1 74-A 1,21 -SEP-2000] 


10.. 79 


39/71 (54%) 
42/71 (58%) 


9e-ll 


AAY87249 


Human signal peptide containing 
protein HSPP-26 SEQ ID NO:26 - 

Homo sapiens, 82 aa. 
[WO200000610-A2, 06-JAN-2000] 


161 ..1 80 
18..37 


19/20(95%) 
20/20(100%) 


0.001 


ABB62631 


Drosophila melanogaster 
polypeptide SEQ ID NO 14685 - 
Drosophila melanogaster, 305 aa. 
[WO200 1 7 1 042-A2, 27-SEP-200 1 ] 


89..171 

199..282 


26/86 (30%) 
47/86 (54%) 


0.007 


AAY71045 


Streptococcus pyogenes strain AP49 
partial GRAB protein - 
Streptococcus pyogenes, 271 aa. 
[WO200026240-A2, 11-MAY- 
2000] 


73..219 
98..253 


46/160 (28%) 
69/160(42%) 


0.025 



In a BLAST search of public sequence datbases, the NOV1 la protein was found to 
5 have homology to the proteins shown in the BLASTP data in Table 1 IE. 
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Table HE. Public BLASTP Results for NOVlla 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOVlla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


P09496 


Clathrin light chain A (Lea) - 
Homo sapiens (Human), 248 
aa. 


1..236 
1..248 


236/248 (95%) 
236/248 (95%) 


e-133 


LRRTA1 


clathrin light chain A I - rat, 
286 aa (fragment). 


1..236 
39..286 


231/248 (93%) 
234/248 (94%) 


e-130 


P08081 


Clathrin light chain A (Lea) - 
Rattus norvegicus (Rat), 248 
aa. 


I. .236 
1..248 


231/248 (93%) 
234/248 (94%) 


e-130 


008585 


Clathrin light chain A (Lea) - 
Mus musculus (Mouse), 235 
aa. 


I..236 
1..235 


220/236 (93%) 
223/236 (94%) 


e-123 


P04973 


Clathrin light chain A (Lea) - 
Bos taurus (Bovine), 243 aa. 


1..236 
1..243 


223/248 (89%) 
226/248 (90%) 


e-121 



PFam analysis predicts that the NOV1 la protein contains the domains shown in 
the Table 11 F. 



Table 11F. Domain Analysis of NOVlla 


Pfam Domain 


NOVlla Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Clathrin_lg_ch 


3..235 


154/257 (60%) 
226/257 (88%) 


5.2e-142 



Example 12. 



The NOV 12 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 12A. 
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Table 12A. NOV12 Sequence Analysis 




SEQIDNO:31 


1 1849 bp 


NOV 12a, 
CGI 007 10-01 DNA Sequence 


CATGTCGTGGCTCTTCGGCGTTAACAAGGGCCCCAAGGGTGAAGGCGCGGGGCCGCCG 
CCGCCTTTGCCGCCCGCGCAGCCCGGGGCCGAGGGCGGCGGGGACCGCGGTTTGGGAG 
ACCGGCCGGCGCCCAAGGACAAATGGAGCAACTTCGACCCCACCGGCCTGGAGCGCGC 
CGCCAAGGCGGCGCGCGAGCTGGAGCACTCGCGTCAGACACAGGAGCGGCTGTCAGGC 
AGTGCCAGCCCTGAGCAAGTGCCAGCTGGTGAGTGCTGTGCTCTGCAGGAGTATGAGG 
CCGCCGTGGAGCAGCTCAAGAGCGAGCAGATCCGGGCGCAGGCTGAGGAGAGGAGGAA 
GACCCTGAGCGAGGAGACCCGGCAGCACCAGGCCAGGGCCCAGTATCAAGACAAGCTG 
GCCCGGCAGCGCTACGAGGACCAACTGAAGCAGCAGCAACTTCTCAATGAGGAGAATT 
TACGGAAGCAGGAGGAGTCCGTGCAGAAGCAGGAAGCCATGCGGCGAGCCACCGTGGA 
GCGGGAGATGGAGCTGCGGCACAAGAATGAGATGCTGCGAGTGGAGACCGAGGCCCGG 
GCGCGCGCCAAGGCCGAGCGGGAGAATGCAGACATCATCCGCGAGCAGATCCGCCTGA 
AGGCGTCCGAGCACCGTCAGACCGTCTTGGAGTCCATCAGGACGGCTGGCACCTTGTT 
TGGGGAAGGATTCCGTGCCTTTGTGACAGACCGGGACAAAGTGACAGCCACGGTAGCT 
GGGCTGACGCTGCTGGCTGTCGGGGTCTACTCAGCCAAGAATGCGACAGCCGTCACTG 
GCCGCTTCATCGAGGCTCGGCTGGGGAAGCCGTCCCTAGTGAGGGAGACGTCCCGCAT 
CACGGTGCTGGAGGCGCTGCGGCACCCCATCCAGGTAAGCCGGCGGCTCCTCAGTCGA 
CCCCAGGACGTGCTGGAGGGTGTTGTGCTTAGTCCCAGCCTGGAAGCACGGGTGCGCG 
ACATCGCCATAGCAACCAGGAACACCAAGAAGAACCGGGGCCTGTACAGGCACATCCT 
GCTGTATGGGCCACCAGGCACCGGGAAGACGCTGTTTGCCAAGAAACTCGCCCTGCAC 
TCAGGCATGGACTACGCCATCATGACAGGCGGGGACGTGGCCCCCATGGGGCGGGAAG 
GCGTGACCGCCATGCACAAGCTCTTTGACTGGGCCAATACCAGCCGGCGCGGGCTCCT 
GCTCTTCATGGATGAAGCAGACGCCTTCCTTCGGAAGCGAGCCACTGAGGAGATAAGC 
AAGGACCTCAGAGCCACACTGAACGCCTTCCTGTACCACATGGGCCAACACAGCAACA 
AGTTCATGCTGGTCCTGGCCAGCAATCTGCCTGAGCAGTTCGACTGTGCCATCAACAG 
CCGCATTGACGTGATGGTCCACTTCGACCTGCCGCAGCAGGAGGAGCGGGAGCGCCTG 
GTGAGACTGCATTTTGACAACTGTGTTCTTAAGCCGGCCACAGAAGGAAAACGGCGCC 
TGAAGCTGGCCCAGTTTGACTACGGGAGGAAGTGCTCGGAGGTCGCTCGGCTGACGGA 
GGG CATGT CGGG C CGGGAGATCGCT CAGCTGGCCGTGTCCTGGCAGGCCACGG CGTAT 
GCCTCCGAGGACGGGGTCCTGACCGAGGCCATGATGGACACCCGCGTGCAAGATGCTG 
TCCAGCAGCACCAGCAGATGATGCGCTGGCTGAAGAGGGGGAGGCCTGGGCCCGAGGA 
CGAGCAACCCTCATCCTGAGTCCATGGGGAGACCACACCTrpCG, \ i i rGG i 3 GC 
AC CCCTC C C ACCC CTGC CTTTGCGGCCCCG CACATTTAGGAAATACTCCCC 




ORF Start: ATG at 2 


ORF Stop: TGA at 1757 




SEQ ID NO: 32 


585 aa 1 MW at 65861. lkD 


NOV12a, 
CGI 007 10-01 Protein Sequence 


MSWLFGVNKGPKGEGAGPPPPLPPAQPGAEGGGDRGLGDRPAPKDKWSNFDPTGLERA 
AKAARELEHSRQTQERLSGSASPEQVPAGECCALQEYEAAVEQLKSEQIRAQAEERRK 
TLSEETRQHQARAQYQDKLARQRYEDQLKQQQLLNEENLRKQEESVQKQEAMRRATVE 
REMELRHKNEMLRVETEARARAKAERENADIIREQIRLKASEHRQTVLESIRTAGTLF 
GEGFRAPVTDRDKVTATVAGLTLLAVGVYSAKNATAVTGRFIEARLGKPSLVRETSRI 
TVLEALRHPIQVSRRLLSRPQDV1EGWLSPSLEARVRDIAIATRNTKKKRGLYRHIL 
LYGPPGTGKTLFAKKLALHSGMDYAIMTGGDVAPMGREGVTAMHKLFDWANTSRRGLL 
LFMDEADAFLRKRATEEISKDLRATLNAFLYHMGQHSNKFMLVLASNLPEQFDCAINS 
RIDVMVHFDLPQQEERERLVRLHFDNCVLKPATEGKRRLKLAQFDYGRKCSEVARLTE 
GMSGREIAQIAVSWQATAYASEDGVLTEAMMDTRVQDAVQQHQQMMRWLKRGRPGPED 
EQPSS 



Further analysis of the NOV12a protein yielded the following properties shown it 
Table 12B. 



Table 12B. Protein Sequence Properties NOV12a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1524 probability located in lysosome (lumen); 0.1 000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV 12a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 12C. 



Table 12C. Geneseq Results for NOV12a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 1 2a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB93086 


Human protein sequence SEQ ID 
NO: 1 1926 - Homo sapiens, 586 aa. 
[EPI074617-A2, 07-FEB-2001] 


I..585 
1..586 


529/586 (90%) 
547/586 (93%) 


0.0 


AAM39035 


Human polypeptide SEQ ID NO 2180 
- Homo sapiens, 602 aa. 
[ WO200 1 533 1 2- A 1 , 26-JUL-200 1 ] 


70..585 
25. .540 


502/516 (97%) 
505/516(97%) 


0.0 


AAB94853 


Human protein sequence SEQ ID 
NO: 16038 - Homo sapiens, 480 aa. 
[EP1074617-A2, 07-FEB-2001] 


168..585 
I ..418 


404/418 (96%) 
407/418 (96%) 


0.0 


ABB71478 


Drosophila melanogaster polypeptide 
SEQ ID NO 4 1226 - Drosophila 
melanogaster, 599 aa. 
[ WO200 1 7 1 042- A2, 27-SEP-200 1 ] 


1..570 
1..576 


342/579 (59%) 
423/579 (72%) 


0.0 


AAB63264 


Human breast cancer associated 
antigen protein sequence SEQ ID 
NO:626 - Homo sapiens, 208 aa. 
[WO200073801-A2, 07-DEC-2000] 


279..474 
3..198 


181/196(92%) 
187/196 (95%) 


8e-99 



In a BLAST search of public sequence datbases, the NOV 12a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 12D. 
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Table 12D. Public BLASTP Results for NOV12a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV 12a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


j Q9BUK4 


SIMILAR TO HYPOTHETICAL 
PROTEIN FLJ 10709 
(HYPOTHETICAL 72.6 KDA 
PROTEIN) - Homo sapiens (Human), 
648 aa. 


1..586 


551/586 (94%) 
561/586 (95%) 


0.0 


Q9NVI7 


pnMA FI TimAQ PIC r*\ rVKtc 
V^UIN/Y rLJIU/Uy rib, LLUNb 

NT2RP3000869 - Homo sapiens 
(Human), 586 aa. 


1..585 
1..586 


529/586 (90%) 
547/586 (93%) 


0.0 


Q96A50 


HYPOTHETICAL 66.3 KDA PROTEIN 
(UNKNOWN) (PROTEIN FOR 
MGC: 14291) (PROTEIN FOR 

MGC:20264) - Homo sapiens (Human), 
586 aa. 


1..585 
1..586 


528/586 (90%) 
546/586 (93%) 


0.0 


Q96T67 


TOB3 - Homo sapiens (Human), 578 aa. 


1..572 
1..572 


527/573 (91%) 
541/573 (93%) 


0.0 


Q925I1 


TOB3 - Mus musculus (Mouse), 591 aa. 


1..583 
1..583 


500/584 (85%) 
540/584 (91%) 


0.0 



PFam analysis predicts that the NOV 12a protein contains the domains shown in 
the Table 12E. 



Table 12E. Domain Analysis of NOV12a 


Pfom Domain 


NOV 12a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


AAA 


346..542 


60/221 (27%) 
140/221 (63%) 


2.6e-32 



Example 13. 



The NOV 13 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 13 A. 
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Table 13A. NOV13 Sequence Analysis 




SEQ ID NO: 33 


1705 bp 


NOV13a, 
CGI 00730-01 DNA Sequence 


GATAAATGGAGGAAGCT 4AGAC :TCACTGAGCCTGTGGCTCCGGCCGCTGAGTGATGG 


TTGAGTAGCACAGGGTCTCAAGCTGTGGCCATGGACTTGGGCAACAGGAAATTAGAAA 
TATCCTCTGGAAAGCTGGCCAGATTTGCAAATGGCTCTGCTGTAATACAGTCAGGTGA 
CACTGCGGTAATGGCCACAGCCGTCAGTAAAACAAAACCTTCCCCTTCCCAGTTTATG 
CCTTTGGTGGTTGACTACGGACAAAAGGCTGCTGCAGTAGGTAGAATTCCCACAAACC 

CTCTGAGAAGAGAGATTAGTACTTCTGATAAAGAAATTCTTACACGTGGAATAGTAGA 
TTGTTCAGTTAGACCTATCTTTCCAGCTGGCTACTTCTGTGATATACAGGTTCTGTGT 
AATCTGTTAGCAGTAGATGGTATTAATGAACTTGATGTCCTAGCAATTAATGGCARCT 
CTGTAGCCCTCTCATTATCAGATATTCTTTGGAATGGACCTGTTGGGACAGTAAGAAT 
AGGAATGACTGATGGRGARTGTGTTGTTAACCCAACAAGAAAAGAAATGTCTTCTAGT 
ACTTTAAATGTAGTGGTTGCTGGAGCACCTAAAAGTCAGACTGTCATGTTGGAAGCCT 
CTGCAGAGAACATTTTACAGCAGGACTTTTGCCACGCTATCAAAGTGGGAGTGAAATA 
TACCCAGCAAATAATTCAGGGCATCCAGCAGCTGGTAAAAGAAATTGGTGTTACCARG 
AGGACACCTGAGAAGTTATTTACCCCTTCACAAGAGATTGTGAAACATGCTCATAAAC 
TCACTATGGAGAGACTCTATGCAGTTTTTACAGATTATGAACATGATAAAATTTCCAG 
AGATGAAGCTGTTAACAAAATAAGATTAGATACAGAGGAACAACTAAAAGAAATATTT 
CCAGAGGTCGATCTATATGAGATAATAGAATCCTTCAATACTGTTGCAAAGAAAGTTT 
TTAGAAGTATTATTTTGAATGAATACAAAAGATGCGATGGTCGGGATTTGACTTTACT 
TCGGAATATAAGTTGTGAGGTAGATATGTTTAAAACCCTTCATGGATCAGAATTATTT 
CAAAGAGGACAAACACAGCTGCTTTGTGCTGTTACRTTTGATTCATTAGAATCCAGTA 
TTAAGTTGGATCGAGTTATAACAACTATAAATGGGATAAAAGATAAAAATTTCATGCT 
GCACTATGAGTTTCCTCCTTATGCAACTAATGAAATTGTCAAAGTCACCGGTATAAAT 




AAGATTTTCCTTTTGCCATASGAGTTACATCTGAAGTCCTAGAGTCAAATGGGTCATC 
TTCTATGGCATCTGCATGTGGTGGAAGTTTAGCATTAATGGATTCAGGGCTTCCAATT 
AAATCTGCTGTCACAGGTGTAGCAATGGGATTGGCCACCAAAACCGATCTTGAGAAGG 
GTGAAATAGAAGATTATCATTTGCTAACAGATATTCTGGGAATTGARGCTTACAATGG 
TTGACATGGACTTCAAAATAGCTGGCACTAATAAAGGAATAACTGCATTACAGGCAGA 
TATTAAATTACCTGGAATAACAA 




ORF Start: ATG at 6 


ORF Stop: TG A at 1626 




SEQ ID NO: 34 


540 aa | MW at 59197.4kD 


NOV 13 a, 
CG100730-01 Protein Sequence 


MEEAKTSLSLWLRPLSDGPFCLPGWDRALIQLQVRALLSSTGSQAVAMDLGNRKLEIS 
SGKLARFANGSAVIQSGDTAVMATAVSKTKPSPSQFMPLWDYGQKAAAVGRIPTNPL 
RREISTSDKEILTRGIVDCSVRPIFPAGYFCDIQVLCNLLAVDGINELDVLAINGNSV 
ALSLSDILWNGPVGTVRIGMTDGECWNPTRKEMSSSTLNVWAGAPKSQTVMLEASA 
ENILQQDFCHAIKVGVKYTQQIIQGIQQLVKEIGVTKRTPQKLFTPSQEIVKHAHKLT 
MERLYAVFTDYEHDKISRDEAVNKIRLDTEEQLKEIFPEVDLYEIIESFNTVAKKVFR 
SIILNEYKRCDGRDLTLLRNISCEVDMFKTLHGSELFQRGQTQLLCAVTFDSLESSIK 
LDRVITTINGIKDKNFMLHYEFPPYATNEIVKVTGINRRELGPDALAEKALYPVIPKD 
FPFAIRVTSEVLESNGSSEMASACGGSLALMDSGLPIKSAVTGVAMGLATKTDLEKGE 
IEDYHLLTDILGIEAYNG 



Further analysis of the NOV13a protein yielded the following properties shown in 
Table 13B. 



Table 13B. Protein Sequence Properties NOV13a 


PSort analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No known Signal Sequence Predicted 
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A search of the NOV13a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 13C. 



! ' — - — | 

Table 13C. Geneseq Results for NOV13a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 13a 
Residues/ 
Match Residues 


Similarities for 
the Matched 
Region 


Expect 
Value 


AAB08732 


Amino acid sequence of a human 
OLD-35 polypeptide - Homo sapiens, 
705 aa. [WO200046391-A2, 10- 
AUG-2000] 


17..540 
1..524 


455/524 (86%) 
477/524 (90%) 


0.0 


AAB92684 


Human protein sequence SEQ ID 
NO: 1 1065 - Homo sapiens, 504 aa. 
[EP1074617-A2, 07-FEB-2001] 


10..466 
11. .467 


396/457 (86%) 
416/457 (90%) 


0.0 


ABG 17275 


Novel human diagnostic protein 
# 1 7266 - Homo sapiens, 899 aa. 
[WO200I75067-A2, 1 l-OCT-2001] 


192..540 
522..852 


328/349 (93%) 
330/349 (93%) 


0.0 


ABG08546 


Novel human diagnostic protein 
#8537 - Homo sapiens, 899 aa. 
[WO200I75067-A2, 1 l-OCT-2001] 


192. .540 

522..852 


328/349 (93%) 
330/349 (93%) 


0.0 


ABG 17275 


Novel human diagnostic protein 
# 1 7266 - Homo sapiens, 899 aa. 
[WO200175067-A2, 1 l-OCT-2001] 


192.. 540 
522..852 


328/349 (93%) 
330/349 (93%) 


0.0 



In a BLAST search of public sequence datbases, the NOV 13a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 13D. 
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Table 13D. Public BLASTP Results for NOV13a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV 13a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Value 


Q9DC52 


1200003F12RIK PROTEIN - Mus 
musculus (Mouse), 540 aa. 


12..533 
13.. 534 


434/522 (83%) 
467/522 (89%) 


0.0 


Q96T05 


PfYMA PI 1 1 zl'x 1 FTC r'l MMC 

NT2RM2000371, WEAKLY SIMILAR 

TO POLYRIBONUCLEOTIDE 
NUCLEOTIDYLTRANSFERASE (EC 
2.7.7.8) - Homo sapiens (Human), 504 aa. 


11. .467 


396/457 (86%) 
416/457(90%) 


0.0 


Q9V9X7 


CG 1 1337 PROTEIN - Drosophila 
melanogaster (Fruit fly), 748 aa. 


35..540 
29.. 540 


262/512(51%) 
354/512(68%) 


e-143 


Q95RX7 


LD03255P - Drosophila melanogaster 
(Fruit fly), 720 aa. 


55..540 
1..489 


257/489 (52%) 
342/489 (69%) 


e-142 


Q9S7G6 


POLYNUCLEOTIDE 
PHOSPHOR YLASE - Arabidopsis 
thaliana (Mouse-ear cress), 991 aa. 


49..540 
58..545 


201/494(40%) 
307/494 (61%) 


e-104 



PFam analysis predicts that the NOV 13a protein contains the domains shown in 
the Table 13E. 



Table 13E. Domain Analysis of NOV13a 


Pfam Domain 


NOV13a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RNaseJPH 


48..250 


58/255 (23%) 
137/255 (54%) 


4.5e-20 


RNasePH 


357..537 


58/260 (22%) 
136/260(52%) 


3.2e-15 



Example 14. 



The NOV 14 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 14A. 
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Table 14A. NOV14 Sequence Analysis 




SEQ ID NO: 35 | 2626 bp 


NOV 14a, 
CG 1 008 1 9-0 1 DN A Sequence 

i 


CCTGCGTTGACCGCGTGCCGGGTGTCATGGCGGCCTGCAGGTACTGCTGCTCGTGCrT 
CCGGCTCCGGCCCCTGAGCGATGGTCCTTTCCTTCTGCCACGGCGGGATCGGGCACTC 
ACCCAGTTGCAAGTGCGAGCACTATGGAGTAGCGCAGGGTCTCGAGCTGTGGCCGTGG 
ACTTAGGCAACAGGAAATTAGAAATATCTTCTGGAAAGCTGGCCAGATTTGCAGATGG 
CTCTGCTGTAGTACAGTCAGGTGACACTGCAGTAATGGTCACAGCGGTCAGTAAAACA 
AAACCTTCCCCTTCCCAGTTTATGCCTTTGGTGGTAGACTACAGACAAAAAGCTGCTG 
CAGCAGGTAGAATTCCCACAAACTATCTGAGAAGAGAGATTGGTACTTCTGATAAAGA 
AATTCTAACAAGTCGAATAATAGATCGTTCAATTAGACCGCTCTTTCCAGCTGGCTAC 
TTCTATGATACACAGGTACTGTGTAATCTGTTAGCAGTAGATGGTGTAAATGAGCCTG 
ATGTCCTAGCAATTAATGGCGCTTCCGTAGCCCTCTCATTATCAGATATTCCTTGGAA 
TGGACCTGTTGGTGCAGTACGAATAGGAATAATTGATGGAGAATATGTTGTTAACCCA 
ACAAGAAAAGAAATGTCTTCTAGTACTTTAAATTTAGTGGTTGCTGGAGCACCTAAAA 
GTCAGATTGTCATGTTGGAAGCCTCTGCAGAGAACATTTTACAGCAGGACTTTTGCCA 
TG CT AT C AAAGTGGGAGTG AAATATAC C C AAC AAATAATTC AGG GCATT C AG CAGTTG 
GTAAAAGAAACTGGTGTTACCAAGAGGACACCTCAGAAGTTATTTACCCCTTCGCCAG 
AGATTGTGAAATATACTCATAAACTTGCTATGGAGAGACTCTATGCAGTTTTTACAGA 
TTACGAGCATGACAAAGTTTCCAGAGATGAAGCTGTTAACAAAATAAGATTAGATACG 
GAGGAACAACTAAAAGAAAAATTTCCAGAAGCCGATCCATATGAAATAATAGAATCCT 
TCAATGTTGTTGCAAAGGAAGTTTTTAGAAGTATTGTTTTGAATGAATACAAAAGGTG 
CGATGGTCGGGATTTGACTTCACTTAGGAATGTAAGTTGTGAGGTAGATATGTTTAAA 
ACCCTTCATGGATCAGCATTATTTCAAAGAGGACAAACACAGGTGCTTTGTACCGTTA 
CATTTGATTCATTAGAATCTGGTATTAAGTCAGATCAAGTTATAACAGCTATAAATGG 
GATAAAAGATAAAAATTTCATGCTGCACTACGAGTTTCCTCCTTATGCAACTAATGAA 
ATTGGCAAAGTCACTGGTTTAAATAGAAGAGAACTTGGGCATGGTGCTCTTGCTGAGA 
AAGCTTTGTATCCTGTTATTCCCAGAGATTTTCCTTTCACCATAAGAGTTACATCTGA 
AGTCCTAGAGTCAAATGGGTCATCTTCTATGGCATCTGCATGTGGCGGAAGTTTAGCA 
TTAATGGATTCAGGGGTTCCAATTTCATCTGCTGTTGCAGGCGTAGCAATAGGATTGG 
TCACCAAAACCGATCCTGAGAAGGGTGAAATAGAAGATTATCGTTTGCTGACAGATAT 
TTTGGGAATTGAAGATTACAATGGTGACATGGACTTCAAAATAGCTGGCACTAATAAA 
GGAATAACTGCATTACAGGCTGATATTAAATTACCTGGAATACCAATAAAAATTGTGA 
TGGAGGCTATTCAACAAGCTTCAGTGGCAAAAAAGGAGATATTACAGATCATGAACAA 
AACTATTTCAAAACCTCGAGCATCTAGAAAAGAAAATGGACCTGTTGTAGAAACTGTT 
CAGGTTCCATTATCAAAACGAGCAAAATTTGTTGGACCTGGTGGCTATAACTTAAAAA 
AACTTCAGGCTGAAACAGGTGTAACTATTAGTCAGGTGGATGAAGAAACGTTTTCTGT 
ATTTGCACCAACACCCAGTGTTATGCATGAGGCAAGAGACTTCATTACTGAAATCTGC 
AAGGATGATCAGGAGCAGCAATTAGAATTTGGAGCAGTATATACCGCCACAATAACTG 
AAATCAGAGATACTGGTGTAATGGTAAAATTATATCCAAATATGACTGCGGTACTGCT 
TCATAACACACAACTTGATAACGAAAGATTAAACATCCTACTGCCCTAGGATTAGAAG 
TTGGCCAAGAAATTCAGGTGAAATACTTTGGACGTGACCCAGCCGATGGAAGAATGAG 
GCTTTCTCGAAAAGTGCTTCAGTCGCCAGCTACAACCGTGGTCAGAACTTTGAATGAC 
AGAAGTAGTATTGTAATGGGAGAACCTATTTCACAGTCATCATCTAATTCTCAGTGAT 
TTTTTTTTTTTAAAGAGAATTCTAGAATTCTATTTTGTCTAGGGTGATGTGCTGTAGA 
GCAACATTTTAGTAGATCTTCCATTGTGTAGATTTCTATATAATATAAATACATTTTA 
ATTATTTGTACTAAAATGCTCATTTACATGTGCCATTTTTTTAATTCGAGTAACCCAT 
ATTTGTTTAATTGTATTTACATTATAAATCAAGAAATATTTATTATTAAAAGTAAGTC 
ATTTATACATCTTAGA 




ORF Start: ATG at 27 


ORF Stop: TAG at 2 193 




SEQ ID NO: 36 


722 aa | MW at 79307.2kD 


NOV 14a, 
CGI 008 19-01 Protein Sequence 


MAACRYCCSCLRLRPLSDGPFLLPRRDRALTQLQVRALWSSAGSRAVAVDLGNRKLEI 
SSGKLARFADGSAVVQSGDTAVMVTAVSKTKPSPSQFMPLVVDYRQKAAAAGRIPTNY 
LRREIGTSDKEILTSRI IDRS IRPLFPAGYFYDTQVLCNLIAVDGVNEPDVLAINGAS 
VALSLSDIPWNGPVGAVRIGIIDGEYWNPTRKEMSSSTLNLWAGAPKSQIVMLEAS 
AENI LQQDF CHAI KVG VKYTQQ 1 1 QG IQQLVKETGVTKRTPQKLFTPS PE I VKYTHKL 
AMERLYAVFTDYEHDKVSRDEAVNKIRLDTEEQLKEKFPEADPYEIIESFNWAKEVF 
RSIVLNEYKRCDGRDLTSLRNVSCEVDMFKTLHGSALFQRGQTQVLCTVTFDSLESGI 
KSDQVITAINGIKDKNFMLHYEFPPYATNEIGKVTGLNRRELGHGALAEKALYPVIPR 
DFPFTIRVTSEVLESNGSSSMASACGGSLALMDSGVPISSAVAGVAIGLVTKTDPEKG 
EIEDYRLLTDILGIEDYNGDMDFKIAGTNKGITALQADIKLPGIPIKIVMEAIQQASV 
AKKEILQIMNKTISKPRASRKENGPWETVQVPLSKRAKFVGPGGYNLKiCLQAETGVT 
ISQVDEETFSVFAPTPSVMHEARDFITEICKDDQEQQLEFGAVYTATITEIRDTGVMV 
KLYPNMTAVLLHNTQLDNERLNILLP 
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Further analysis of the NOV 14a protein yielded the following properties shown in 
Table 14B. 



Table 14B. Protein Sequence Properties NOV14a 


PSort analysis: , 


0.5016 probability located in mitochondrial matrix space; 0.4500 probability 
located in cytoplasm; 0.2212 probability located in mitochondrial inner membrane; 
0.2212 probability located in mitochondrial intermembrane space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 14a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 14C. 



Table 14C. Geneseq Results for NOV14a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV14a 
Residues/ 
Match 
Residues 


Identities/ ■ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


AAB08732 


Amino acid sequence of a human OLD-35 
polypeptide - Homo sapiens, 705 aa. 
[WO200046391-A2, 10-AUG-2000] 


18. .722 
1..705 


70 1/705 
(99%) 

705/705 
(99%) 


0.0 


AAB92684 


Human protein sequence SEQ ID 
NO: 1 1065 - Homo sapiens, 504 aa. 
[EP1074617-A2, 07-FEB-2001] 


1..467 
1..467 


466/467 
(99%) 

467/467 
(99%) 


0.0 


ABB58546 


Drosophila melanogaster polypeptide SEQ 

ID NO 2430 - Drosophila melanogaster, 
748 aa. [WO200171042-A2, 27-SEP-2001] 


36..717 
29..693 


363/688 
(52%) 

485/688 
(69%) 


0.0 


ABG 17275 


Novel human diagnostic protein #17266 - 
Homo sapiens, 899 aa. [WO200175067-A2, 
ll-OCT-2001] 


190..583 
519..894 


335/394 
(85%) 

349/394 
(88%) 


0.0 


ABG08546 


Novel human diagnostic protein #8537 - 
Homo sapiens, 899 aa. [WO200175067-A2, 
ll-OCT-2001] 


190..583 
519..894 


335/394 
(85%) 

349/394 
(88%) 


0.0 



In a BLAST search of public sequence datbases, the NOV 14a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 14D. 
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Table 14D. Public BLASTP Results for NOV14a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV 14a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


! Q9DC52 


1200003F12R1K PROTEIN - Mus musculus 
(Mouse), 540 aa. 


1..534 
1..534 


478/534 (89%) 
506/534 (94%) 


0.0 


i Q96T05 


CDNA FLJ 14531 FIS, CLONE 
NT2RM2000371, WEAKLY SIMILAR TO 
POLYRIBONUCLEOTIDE 
NUCLEOTIDYLTRANSFERASE (EC 
2.7.7.8) - Homo sapiens (Human), 504 aa. 


1..467 


467/467 (99%) 




Q95RX7 


LD03255P - Drosophila melanogaster (Fruit 
fly), 720 aa. 


56..717 
1..665 


373/665 (56%) 
494/665 (74%) 


0.0 


Q9V9X7 


CG 1 1 337 PROTEIN - Drosophila 
melanogaster (Fruit fly), 748 aa. 


36..717 
29..693 


363/688 (52%) 
485/688 (69%) 


0.0 


Q9S7G6 


POLYNUCLEOTIDE PHOSPHORYLASE 
- Arabidopsis thaliana (Mouse-ear cress), 
991 aa. 


50..715 
58..713 


278/668 (41%) 
417/668(61%) 


e-148 



PFam analysts predicts that the NOV 14a protein contains the domains shown ii 
the Table 14E. 



Table 14E. Domain Analysis of NOV14a 


Pfam Domain 


NOV 1 4a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RNase_PH 


48..251 


68/255 (27%) 
147/255 (58%) 


l.le-37 


RNase_PH 


358..581 


75/261 (29%) 
175/261 (67%) 


4e-55 


KH-domain 


609..65 1 


14/49 (29%) 
31/49 (63%) 


0.35 



Example 15. 



The NOV15 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 15 A. 
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Table 15A. NOV15 Sequence Analysis 




SEQ ID NO: 37 


2030 bp 


NOV 15a, 
CGI 00872-01 DNA Sequence 


TGTGTTGGGCACAGAAATCTGCTTGGATCTCACCAGGTGTGCCCCCCAGAAGTGCCAG 
TGCTTCACCATCCATGGCTCTGGGAGGGTCTTGATCGATGTGGCCAACACGGTGATTT 
CTGAGAAGGAGGACGCCACCATCTGGTGGCCCCTGTCTGATCCCACGTACGCCACAGT 
GAAGATGACATCGCCCAGCCCTTCCGTGGATGCGGATAAGGTAAGCCTCACATACTAT 
GGGCCCAACGAGGATGCCCCCGTGGGCACAGCTGTGCTGTACCTCACTGGCATTGGTG 
AGTGTTGCTCCAACTGGGAGCCTCCCAACCGCTGGGACCCCAGATCCATTTTTTCTCA 
GAAAAAATGGATCTGGGGTCCCAGCGGTTGGGGTGCCATCCTGCTTGTGAATTGCAAC 
CCTGCTGATGTGGGCCAGCAACTTGAGGACAAGAAAACCAAGAAAGTGATCTTTTCAG 
AGGAAATAACGAATCTGTCCCAGATGACTCTGAATGTCCAAGGCCCCAGCTGTATCTT 
AAAGAAATATCGGCTAGTCCTCCATACCTCCAAGGAAGAGTCGAAGAAGGCGAGAGTC 
TACTGGCCCCAAAGTGAAGACAACTCCAGTACCTTTGAGTTGGTGCTGGGGCCCGACC 
AGCACGCCTATACCTTGGCCCTCCTCGGGAACCACTTGAAGGAGACTTTCTACGTTGA 
AGCTATAGCATTCCCATCTGCCGAATTCTCAGGCCTCATCTCCTACTCTGTGTCCCTG 
GTGGAGGAGTCTTTGCAGTCAATTCCAGAGACTGTGCTGTACAAAGACACGGTGGTGT 




CCTTCTGCTACACCCAGGCTCCCCACAAGACAACGTCCTTGATCCTCGACACACCTCA 

GGCCGCCGATCTCGATGAGTTCCCCATGAAGTACTCACTGGTGCCTGGTATTGGCTAC 

ATGATCCAGGACACTGAGGACCATAAAGTGGCCAGCATGGATTCCATTGGGAACCTGA 

TGGTGTCCCCACCTGTCAAGGTCCAAGGGAAAGAGTACCCGCTGGGCAGAGTCCTCAT 

TGGCAGCAGCTTTTACCCCTCTAGCGCAGAGGGCCGGGCCATGAGTAAGACCCTCCGA 

GACTTCCTCTATGCCCAGCAGGTCCAAGCGCCGGTGGAGCTCTACTCAGATTr L , ; TPJ 

TGACTGGCCACGTGGATGAC-' ; -TCATGTGCT'rCATCCCCACAGATGACAAGCAGGGCTT 

CCTGCTGCTCCTGGCCAGCCCCAGTGCCTGCTATAAACTGTTCCGAGAGAAACAGAAG 

GAAGGCTATGGCGACGCTCTTCTGTTTGATGAGCTTAGAGCAGATCAGCTCCTGTCTA 

ATGGTAAGGGAACTCCCTTTCGCCCCCCCCCCCACCCACCCACCCACCCACAGAAGTG 

CATTCACCTGAACCGTGACATCCTGAAGACGGAGCTGGGCCTGGTGGAArAG - 

ATCGAGATTCCC " - n -"T'-Ti_-CITGGAGAAGCTGACTAACATCCCCTCTGACCAGC 

AGCCCAAGTTGCGGATGATTGTGATGGGCAAGAACCTGGGGATCCCCAAGCCTTTTGG 

GCCCCAAATCAAGGGGACCTGCTGCCTGGAAGAAAAGATTTGCTGCTTGCTGGAGrcC 

CTGGGCTTCAAGTGCACCTTCATCAATGACTTTGACTGTTACCTGACAGAGGTCGGAG 

ACATCTGTGCCTGTGCCAACATCCGCCGGGTGCCCTTTGCCTTCAAATGGTGGAAGAT 

GGTACCTTAGACCCAGGCCCTGGAGCTGCCAGCTCTGCCCCAGCGTGGATGGCCCACT 




ORF Start: ATGat3 


ORF Stop: TAG at 1980 




SEQ ID NO: 38 


659 aa j MW at 73697.2kD 


NOV 15a, 
CG100872-01 Protein Sequence 


MSFQSIIHLSLDSPVHAVCVLGTEICLDLTRCAPQKCQCFTIHGSGRVLIDVANTVIS 
EKEDATIWWPLSDPTYATVKMTSPSPSVDADKVSLTYYGPNEDAPVGTAVLYLTGIGE 
CCSNWEPPNRWDPRSIFSQKKWIWGPSGWGAILLVNCNPADVGQQLEDKKTKKVIPSE 
EITNLSQMTLNVQGPSCILKKyRLVLHTEKEESKKARVYWPQSEDNSSTFELVLGPDQ 
HAYTLALLGNHLKETFWEAIAFPSAEFSGLISYSVSLVEESLQSIPETVLYKDTWF 
RVAPCVFIPCTQVPLETVYLCYLSWVLRVEQEIiAGRKGVLTVPFPANNLSSSPFQDEMA 
FCYTQAPHKTTSLILDTPQAADLDEFPMKYSLVPGIGYMIQDTEDHKVASMDSIGNLM 
VS PPVKVQGKEYPLGRVLIG SE FYPSSAEGRAMSKTLRDFLYAQQVQAPVELYSDWLM 
TGHVDEFMCFIPTDDKQGFLLLLASPSACYKLFREKQKEGYGDALLFDELRAEQLLSN 
GKGTPFRPPPHPPTHPQKCIHLKRDILKTELGLVEQDIIEIPQLFCLEKLTNIPSDQQ 
PKLRMIVMGKNLGIPKPFGPQIKGTCCLEEKICCLLEPLGFKCTFINDFDCYLTEVGD 
IC i I i } p 



Further analysis of the NOV15a protein yielded the following properties shown in 
Table 15B. 
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Table 15B. Protein Sequence Properties NOV15a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3600 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0598 probability located in microbody (peroxisome) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 1 5a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 15C. 



Table 15C. Geneseq Results for NOV15a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 15a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 


AAE06034 


Mouse oocyte protein 5 (MOP5) - Mus 
musculus. 664 aa. [WO200153339-A2, 
26-JUL-2001] 


20. .659 
1..664 


386/672 (57%) 
492/672 (72%) 


0.0 


AAB41584 


Human ORFX ORF1348 polypeptide 
sequence SEQ ID NO:2696 - Homo 
sapiens, 663 aa. [WO200058473-A2, 05- 
OCT-2000] 


1..659 
1..663 


289/666 (43%) 
411/666 (61%) 


e-155 


ABB57299 


Mouse ischaemic condition related 
protein sequence SEQ ID NO:839 - Mus 
musculus, 673 aa. [WO200188188-A2, 
22-NOV-2001] 


7..659 
15..673 


268/667 (40%) 
379/667 (56%) 


e-132 


ABG26518 


Novel human diagnostic protein #26509 
- Homo sapiens, 761 aa. 
[WO2001 75067- A2, ll-OCT-2001] 


31. .632 
36..642 


263/620 (42%) 
361/620(57%) 


e-122 


ABG26518 


Novel human diagnostic protein #26509 
- Homo sapiens, 76 1 aa. 
[WO200175067-A2, ll-OCT-2001] 


31. .632 
36..642 


263/620 (42%) 
361/620 (57%) 


e-122 



In a BLAST search of public sequence datbases, the NOV 15a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 1 5D. 
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Table 15D. Public BLASTP Results forNOV15a 


Protein 
Accession 
Number 


Prote i n/Organ ism/Length 


NOV 15a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 


Q9UM07 


Protein-arginine deiminase type V (EC 
3.5.3. 1 5) (Peptidylarginine deiminase V) 
(HL-60 PAD) - Homo sapiens (Human), 663 
aa. 


1..659 
1..663 


289/666 
(43%) 

411/666 
(61%) 


e-154 


088807 


Protein-arginine deiminase type IV (EC 
3.5.3.15) (Peptidylarginine deiminase IV) 
(PAD-R4) (Peptidylarginine deiminase type 
alpha) - Rattus norvegicus (Rat), 666 aa. 


1..659 
1..666 


297/675 
(44%) 

404/675 
(59%) 


e-148 


P70708 


Protein-arginine deiminase type III (EC 
3.5.3. 15) (Peptidylarginine deiminase III) - 
Rattus norvegicus (Rat), 664 aa. 


I. .659 
1..664 


290/677 
(42%) 

408/677 
(59%) 


e-145 


Q9Z184 


Protein-arginine deiminase type III (EC 
3.5.3.15) (Peptidylarginine deiminase III) - 
Mus musculus (Mouse), 664 aa. 


1..659 
1..664 


288/677 
(42%) 

409/677 
(59%) 


e-144 


057400 


PEPTIDYLARGININE DEIMINASE - 
Gallus gallus (Chicken), 672 aa. 


1..659 
1..672 


286/682 
(41%) 

399/682 
(57%) 


e-143 



PFam analysis predicts that the NOV1 5a protein contains the domains shown in 
the Table 15E. 



Table 15E. Domain Analysis of NOV15a 


Pfam Domain 


NOV 15a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PAD 


I. .659 


311/702(44%) 
446/702 (64%) 


4.7e-222 



Example 16. 



The NOV 16 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 16A. 
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Table 16A. NOV16 Sequence Analysis 




SEQIDNO:39 | 


2018 bp 


NOV 16a, 
CGI 00980-01 DNA Sequence 


GATGGCCCCAAAGAGAGTTGTGCAGCTGTCCCTGAAGATGCCTACCCATGCCGTGTGT 
3TGGTGGGAGTCGAGGCACATGTGGACATTCACAGTGATGTGCCCAAGGGTGCCAACA 




GTATCTGTGGGCACAGCCAGTAAGGAATTAAAGGACTTCAAGGTAAGAGTCTCCTACT 
TTGGGGAGCAGGAAGACCAAGCTCTGGGCCGCAGCGTGCTTTACCTCACTGGCGTCCT 


i 


CAGAAAACCTGGCGCTGGGGCCCTGAGGGCTATGGGGCTATCTTGCTGGTGAACTGTG 
ACCGGGACAATCACAGGTCCGCAGAGCCTGACCTCACCCACAGCTGGCTGATGTCGCT 
GGCTGACCTGCAGGACATGTCCCCAATGCTGCTGAGCTGCAATGGCCCCGACAAGCTC 
TTCGACAGCCACAAGCTTGTCTTGAACGTGCCCTTTTCTGATTCCAAAAGAGTGAGGG 
TCTTCTGTGCCAGGGGTCCTGAGGATGTGTGTGAGGCCTATAGGCATGTGCTGGGCCA 
AAATAAGGTGTCCTATGAGGTACCCCGCTTGCATGGGGATGAGGAGCGCTTCTTCGTG 
GAAGGCCTGTCCTTCCCTGATGCCGGCTTCACAGGACTCATCTCCTTCCATGTCACTC 
TGCTGGACGACTCCAACGAGGATTTCTCGGCATCCCCTATCTTCACTGACACTGTGGT 
GTTCCGAGTGGCACCCTGGATCATGACGCCCAGCACTCTGCCACCCCTAGAGGTGTAT 
GTGTGCCGGGTGAGGAACAACACGTGTTTTGTGGATGCGGTGGCAGAGCTGGCCAGGA 
AGGCCGGCTGCAAGCTGACCATCTGCCCACAGGCCGAGAACCGCAACGACCGCTGGAT 
CCAGGATGAGATGGAGCTGGGCTACGTTCAGGCGCCGCACAAGACCCTCCCGGTGGTC 
TTTGACTCCCCAAGGAATGGGGAACTGCAGGATTTCCCTTACAAAAGAATCCTGGGTC 
CAGATTTTGGTTACGTGACTCGGGAACCACGCGACAGGTCTGTGAGTGGCCTGGACTC 
CTTTGGGAACCTGGAGGTCAGCCCTCCAGTGGTGGCCAATGGGAAAGAGTACCCCCTG 
GGGAGGATCCTCATTGGGGGCAACCTGCCTGGGTCAAGTGGCCGCAGGGTCACCCAGG 
TGGTGCGGGACTTCCTCCATGCCCAGAAGGTGCAGCCCCCCGTGGAGCTCTTTGTGGA 
CTGGTTGGCCGTGGGCCATGTGGATGAGTTTCTGAGCTTTGTCCCTGCCCCCGATGGG 
AAGGGCTTCCGGATGCTCCTGGCCAGCCCTGGGGCCTGCTTCAAGCTCTTCCAGGAAA 
AGCAGAAGTGTGGCCACGGGAGGGCCCTCCTGTTCCAGGGGGTTGTTGGTGATGAGCA 
GGTCAAGACCATCTCCATCAACCAGGTGCTCTCCAATAAAGACCTCATCAACTACAAT 
AAGTTTGTGCAGAGCTGCATTGACTGGAACCGTGAGGTGCTGAAGCGGGAGCTGGGCC 
TGGCAGAGTGTGACATCATTGACATCCCACAGCTCTTCAAGACCGAGAGGAAAAAAGC 
AACGGCCTTCTTCCCTGACTTGGTGAACATGCTGGTGCTGGGGAAGCACCTGGGCATC 
CCCAAGCCCTTTGGGCCCATCATCAATGGCTGCTGCTGCCTGGAGGAGAAGGTGCGGT 
CCCTGCTGGAGCCGCTGGGCCTCCACTGCACCTTCATTGATGACTTCACTCCATACCA 
CATGCTGCATGGGGAGGTGCACTGTGGCACCAATGTGTGCAGAAAGCCCTTCTCTTTC 
AAGTGGTGGAACATGGTGCCCTGAGACAGCTCCCACCCACCATCCT 




ORF Start: ATG at 2 


ORF Stop: TGA at 1994 




SEQ ID NO: 40 


664 aa | MW at 74356.9kD 


NOV 16a, 
CG100980-01 Protein Sequence 


MAPKRWQLSLKMPTHAVCWGVEAHVDIHSDVPKGANSFRVSGSSGVEVFMVYNRTR 
VKEPIGKARWPLDTDADMWSVGTASKELKDFKVRVSYFGEQEDQALGRSVLYLTGVI, 
EPLPGASWVPLIPRHLILPQKTWRWGPEGYGAILLVNCDRDNHRSAEPDLTHSWLMSL 
ADLQDMSPMLLSCNGPDKLFDSHKLVLNVPFSDSKRVRVFCARGPEDVCEAYRHVLGQ 
NKVSYEVPRLHGDEERFFVEGLSFPDAGFTGLISFHVTLLDDSNEDFSASPIFTDTW 
FRVAPWIMTPSTLPPLEVYVCRVRNNTCFVDAVAELARKAGCKLTICPQAENRNDRWI 
QDEMELGYVQAPHKTLPWFDSPRNGELQDFPYKRILGPDFGYVTREPRDRSVSGLDS 
FGNLEVSPPWANGKEYPLGRILIGGNLPGSSGRRVTQWRDFLHAQKVQPPVELFVD 
WLAVGHVDEFLSFVPAPDC-KGFRMLLASPGACFKLFQEKQKCGHGRALLFQGVVGDEQ 
VKTIS INQVLSNKDLINYNKFVQSCIDWNREVLKRELGIAECDI IDIPQLFKTERKKA 
TAFFPDLVKMLVLGKHLGIPKPFGPIINGCCCLEEKVRSLLEPLGLHCTFIDDFTPYH 
MLHGEVHCGTNVCRKPFSFKWWNMVP 



Further analysis of the NOV 16a protein yielded the following properties shown in 
Table 16B. 
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Table 16B. Protein Sequence Properties NOV16a 


Psort analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.2424 
probability located in lysosome (lumen); 0.1410 probability located in microbody 
(peroxisome); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 16a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 16C. 



| Table 16C. Geneseq Results for NOV16a 


j Geneseq 
j Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 16a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


j ABG26518 

j 


Novel human diagnostic protein 
#26509 - Homo sapiens, 761 aa. 
[WO2001 75067- A2, ll-OCT-2001] 


32. .645 
37..650 


591/614 (96%) 
595/614 (96%) 


0.0 


j ABG26518 


Novel human diagnostic protein 
#26509 - Homo sapiens, 761 aa. 
[WO200175067-A2, 1 l-OCT-2001] 


32. .645 
37..650 


591/614 (96%) 
595/614 (96%) 


0.0 


AAB41584 


Human ORFX ORF1348 polypeptide 
sequence SEQ ID NO:2696 - Homo 
sapiens, 663 aa. [WO200058473-A2, 
05-OCT-2000] 


1..664 
1..663 


377/671 (56%) 
468/671 (69%) 


0.0 


ABB57299 


Mouse ischaemic condition related 
protein sequence SEQ ID NO:839 - 

Mus musculus, 673 aa. 
[WO200 1 88 1 88-A2, 22-NOV-200I] 


I. .664 
9.,673 


352/670 (52%) 
457/670 (67%) 


0.0 


AAE06034 


Mouse oocyte protein 5 (MOP5) - Mus 
musculus, 664 aa. [WO200I53339- 
A2,26-JUL-200I] 


20..664 
1..664 


280/675 (41%) 
402/675 (59%) 


e-136 



In a BLAST search of public sequence datbases, the NOV 16a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 16D. 
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Table 16D. Public BLASTP Results for NOV16a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV16a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9ULW8 


Protein-arginine deiminase type III 
(EC 3.5.3.15) (Peptidylarginine 
deiminase III) - Homo sapiens 
(Human), 664 aa. 


1..664 
1..664 


531/664 (79%) 
565/664 (84%) 


0.0 


Q9Z184 


Protein-arginine deiminase type III 
(EC 3.5.3.15) (Peptidylarginine 
deiminase III) - Mus musculus 

(Mouse), 664 aa. 


1..664 
1..664 


484/664 (72%) 
552/664 (82%) 


0.0 


P70708 


Protein-arginine deiminase type III 
(EC 3.5.3.15) (Peptidylarginine 
deiminase III) - Rattus norvegicus 
(Rat), 664 aa. 


1..664 
1 ..664 


481/664(72%) 
546/664(81%) 


0.0 


002849 


Protein-arginine deiminase type III 

(EC 3.5.3.15) (Peptidylarginine 
deiminase III) - Ovis aries (Sheep), 
664 aa. 


I. .664 
1.664 


485/664 (73%) 
539/664 (81%) 


0.0 


Q9ULC6 


Protein-arginine deiminase type I 
(EC 3.5.3.15) (Peptidylarginine 
deiminase I) - Homo sapiens 
(Human), 663 aa. 


1..664 
1..663 


487/669 (72%) 
544/669 (80%) 


0.0 



PFam analysis predicts that the NOV 16a protein contains the domains shown in 
the Table 16E. 



Table 16E. Domain Analysis of NOV16a 


Pfam Domain 


NOV 16a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PAD 


1..664 


465/691 (67%) 
623/691 (90%) 


0 



Example 1 7. 



The NOV 17 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 17A. 
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Table 17A. NOV17 Sequence Analysis 




SEQIDNO:41 j 1027 bp 


NOV 1 7a, 
! CGI 72805-0 1 DNA Sequence 

! 


GTCGACGCGGCCGCGGGCGGCGGAGATGGCGGCGGGGATAGTGGCTTCTCGCAGACTO 
CGCGACCTACTGACCCGGCGACTGACAGGCTCCAACTACCCGGGACTCAGTATTAGCC 
TTCGCCTCACTGGCTCCTCTGCACAAGAGGCGGCTTCCGGAGTAGCCCTCGGTGAAGC 
CCCAGACCACAGCTATGAGTCCCTTCGTGTGACGTCTGCGCAGAAACATGTTCTGCAT 
GTCCAGCTCAACCGGCCCAACAAGAGGAATGCCATGAACAAGGTCTTCTGGAGAGAGA 
TGGTAGAGTGCTTCAACAAGATTTCGAGAGACGCTGACTGTCGGGCGGTGGTGATCTC 
TGGTGCAGGAAAAATGTTCACTGCAGGTATTGACCTGATGGACATGGCTTCGGACATC 
CTGCAGCCCAAAGGAGATGATGTGGCCCGGATCAGCTGGTACCTCCGTGACATCATCA 
CTCGATACCAGGAGACCTTCAACGTCATCGAGAGGTGCCCCAAGCCCGTGATTGCTGC 
CGTCCATGGGGGCTGCATTGGCGGAGGTGTGGACCTTGTCACCGCCTGTGACATCCGG 
TACTGTGCCCAGGATGCTTTCTTCCAGGTGAAGGAGGTGGACGTGGGTTTGGCTGCCG 
ATGTAGGAACACTGCAGCGCCTGCCCAAGGTCATCGGGAACCAGAGCCTGGTCAACGA 
GCTGGCCTTCACCGCCCGCAAGATGATGGCTGACGAGGCCCTGGGCAGTGGGCTGGTC 
AGCCGGGTGTTCCCAGACAAAGAGGTCATGCTGGATGCTGCCTTAGCGCTGGCGGCCG 




CCGCGACCATTCGGTGGCCGAGAGCCTCAACTACGTGGCGTCCTGGAACATGAGCATG 
CTGCAGACCCAAGACCTCGTGAAGTCGGTCCAGGCCACGACTGAGAACAAGGAACTGA 
AAACCGTCACCTTCTCCAAGCTCTGAGAGCCCTCGCGTCCC 




ORF Start: ATG at 26 


ORF Stop: TGAat 1010 




SEQ ID NO: 42 


328 aa ! MW at 35757.7kD 


NOV17a, 
CGI 72805-01 Protein Sequence 


MAAGIVASRRLRDLLTRRLTGSNYPGLSISLRLTGSSAQEAASGVALGEAPDHSYESL 
RVTSAQKHVLHVQLNRPNKRNAMNKVPWREMVECFNKISRDADCRAWISGAGICMFTA 
GIDLMDMASDILQPKGDDVARISKYLRDIITRYQETFNVIERCPKPVIAAVHGGCIGG 
GVDLVTACDIRYCAQDAFFQVKEVDVGLAADVGTLQRLPKVIGNQSLVNELAFTARKN 
MADEALGSGLVSRVFPDKEVMLDAALALAAEISSKSPVAVQSTK\T-ILLYSRDHSVAES 
LN'/VASWNMSMLQTQDLVKSVQATTENKELKTVTFSKL 



Further analysis of the NOV 17a protein yielded the following properties shown in 
Table 17B. 



Table 17B. Protein Sequence Properties NOV17a 


PSort analysis: 


0.9200 probability located in mitochondrial matrix space; 0.8000 probability 
located in microbody (peroxisome); 0.6000 probability located in mitochondrial 
inner membrane; 0.6000 probability located in mitochondrial intermembrane space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 17a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 17C. 
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Table 17C. Geneseq Results for NOV17a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV17a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB43664 


Human cancer associated protein 
sequence SEQ ID NO: 1 109 - Homo 
sapiens, 300 aa. [WO200055350-A1, 
21-SEP-2000] 


2..276 
5..278 


265/275 (96%) 
268/275 (97%) 


e-147 


AAM24501 


Colon tumour related predicted amino 

acid sequence for CT632 - Homo 
sapiens, 167 aa. [WO200149716-A2, 
12-JUL-2001] 


46. .212 
I..I67 


166/167 (99%) 
166/167 (99%) 


le-93 


AAB 11904 


Human colon tumour protein CT632, 
SEQ ID NO:204 - Homo sapiens, 167 
aa. [WO200037643-A2, 29-JUN-2000] 


46..212 
1..167 


166/167 (99%) 
166/167 (99%) 


le-93 


j ABB61607 


Drosophila melanogaster polypeptide 

SEQ ID NO 1 1613 - Drosophila 
melanogaster, 289 aa. [WO200171042- 
A2,27-SEP-2001] 


51. .328 
13..289 


145/281 (51%) 
199/281 (70%) 


le-73 


■ AAG00231 


Human secreted protein, SEQ ID NO: 
43 12 - Homo sapiens, 1 18 aa. 
[EP1 03340 1-A2, 06-SEP-2000] 


1.118 
1..118 


118/118(100%) 
118/118(100%) 


2e-61 



In a BLAST search of public sequence datbases, the NOV 17a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 17D. 
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Table 17D. Public BLASTP Results for NOV17a 


Protein 
Number 




NOV 17a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q8WVX0 


HYPOTHETICAL 35.8 KDA 
PROTEIN - Homo sapiens (Human), 
328 aa. 


I. .328 
1..328 


328/328 (100%) 
328/328 (100%) 


0.0 


Q96EZ9 


UNKNOWN (PROTEIN FOR 
MGC: 19561) - Homo sapiens 
(Human), 328 aa. 


1..328 
1..328 


327/328 (99%) 
327/328 (99%) 


0,0 


Q13011 


Delta3,5-deIta2,4-dienoyl-CoA 
isomerase, mitochondrial precursor 
(EC 5.3.3.-) - Homo sapiens (Human), 
328 aa. 


I..328 
1..328 


318/328 (96%) 
320/328 (96%) 


e-178 


Q62651 


Delta3 ,5-delta2,4-d ienoy I-CoA 
isomerase, mitochondrial precursor 
(EC 5.3.3.-) - Rattus norvegicus (Rat), 
327 aa. 


1..328 
1..327 


248/328 (75%) 
289/328 (87%) 


e-143 


A57626 


peroxisomal enoyl hydratase-like 
protein - rat, 327 aa. 


1..328 
1..327 


247/328 (75%) 
288/328 (87%) 


e-142 



PFam analysis predicts that the NOV 17a protein contains the domains shown in 
the Table 17E. 



Table 17E. Domain Analysis of NOV17a 


Pfam Domain 


NOV 17a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ECH 


68..248 


66/185(36%) 
132/185(71%) 


le-41 



Example 18. 



The NOV 18 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 18A. 
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Table 18A. NOV18 Sequence Analysis 




SEQ ID NO: 43 


1067 bp 


NOV 18a, 
CG56763-01 DNA Sequence 


CCCTTTCCTCTTGCTCTTTGATGTTTTGTAGGCCTGCAGCTCCCAAGCACAGAGGCAT 


CCAGGGCTGCAGTACCTGCTCTTCCTCCTCTTCCTGCTCACCTACCTCTTTGTCCTGG 
TGGAGAACCTC-GCCATCATCCTCACCGTCTGGAGCAGCACCTCCCTCCACAGGCCCAT 
GTACTACTTTCTGAGCTCCATGTCTTTCCTAGAGATCTGGTACGTGTCTGACATCACC 




ACCCCGGGGCTGTGCCTCCAGCTGGTGGGCTTCTCCTTTGTGAGTGGCTTCACCATCT 
CCATGATCAAGGTCTGTTTTATCTCCAGCGTCACGTTCTGTGGCTCCAACGTCTTGAA 
CCACTTCTTCTGTGACATTTCCCCCATCCTCAAGCTGGCCTGCACGGACTTCTCCACT 
GCAGAGCTGGTGGATTTCATTCTGGCCTTCATCATCCTGGTGTTTCCACTCCTGGCCA 
CCATGCTGTCATATGCGCACATCACCCTGGCTGTCCTGCGCATCCCCTCGGCCACCGG 
CTGCTGGAGAGCCTTCTTCACCTGCGCCTCTCACCTCACCGTGGTCACCGTCTTCTAT 
ACAGCCTTGCTTTTCATGTATGTCCGGCCCCAGGCCATTGATTCCCGGAGCTCCAACA 
AGCTCATCTCTGTTTTGTACACAGTTATCACCCCCATCTTGAACCCCTTGATATACTG 
CCTGAGGAATAAGGAATTTAAGAATGCCTTGAAAAAAGCCTTCGGCTTGACGAGCTGC 
GCCGTAGAGGGGAGGCTTTCTAGTCTTCTGGAACTTCATCTCCAAATACACAGCCAGC 
CTCTCTGAGGAGGCCATTTGACT 




ORF Start: ATG at 21 


ORF Stop: TGAat 1050 




SEQ ID NO: 44 


343 aa 1 MW at 38584.5kD 


NOV 18a, 
CG56763-01 Protein Sequence 


MFCRPAAPKHRGMSGENVTRVGTFILVGFPTAPGLQYLLFLLFLLTYLFVLVENLAII 
LTVWSSTSLHRPMYYFLSSMSFLEIWYVSDITPKMLEGFLLQQKRISFVGCMTQLYFF 
SSLVCTECVLLASMAYDRYVAICHPLRYHVLVTPGLCLQLVGFSFVSGFTISMIKVCF 
- 3£ -CGotWLN^FFCDISPILKLACTDFSTAELVDFILAFIILVFPLLATMLSYAH 
ITLAVLRIPSATGCWRAFFTCASHLTWTVFYTALLFMYVRPQAIDSRSSNKLISVLY 
TVITPILNPLIYCLRNKEFKMALKKAFGLTSCAVEGRLSSLLELHLQIHSQPL 



Further analysis of the NOV 18a protein yielded the following properties shown in 
Table 18B. 



Table 18B. Protein Sequence Properties NOV18a 


PSort analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 64 and 65 



A search of the NOV 18a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 18C. 
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Table 18C. Geneseq Results for NOV18a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 18a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG71967 


Human olfactory receptor polypeptide, 

SEQ ID NO: 1648 - Homo sapiens, 
319 aa. [WO200127158-A2, 19-APR- 
2001] 


13..332 
1.319 


304/320 (95%) 
308/320 (96%) 


e-173 


AAG71962 


Human olfactory receptor polypeptide, 

SEQ ID NO: 1643 - Homo sapiens, 
314 aa. [WO200I27158-A2, 19-APR- 
2001] 


13..319 
1..307 


289/307 (94%) 
299/307 (97%) 


e-167 


ABG 16907 


Novel human diagnostic protein 
#16898 - Homo sapiens, 291 aa. 
[WO200175067-A2, ll-OCT-2001] 


5..295 
20..289 


267/291 (91%) 
267/291 (91%) 


e-150 


' ABG 16907 


Novel human diagnostic protein 
#16898 - Homo sapiens, 291 aa. 
[WO2001 75067- A2, ll-OCT-2001] 


5..295 
20..289 


267/291 (91%) 
267/291 (91%) 


e-150 


AAG71675 


Human olfactory receptor polypeptide, 

SEQ ID NO: 1356 - Homo sapiens, 
327 aa. [WO200127158-A2, 19-APR- 
2001] 


15. .313 
6..308 


177/303 (58%) 
224/303 (73%) 


e-100 



In a BLAST search of public sequence datbases, the NOV18a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 18D. 
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Table 18D. Public BLASTP Results for NOV18a 


Accession 
Number 


Protein/Organism/Length 


NOV 18a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q8VGU5 


OLFACTORY RECEPTOR MORI 03- 
2 - Mus musculus (Mouse), 3 12 aa. 


13.319 
1..307 


270/307 (87%) 
287/307 (92%) 


e-156 


Q8VGU4 


OLFACTORY RECEPTOR MORI 03- 
3 - Mus musculus (Mouse), 3 12 aa. 


13. .319 
1..307 


261/307 (85%) 
284/307 (92%) 


e-153 


Q9EPG2 


M51 OLFACTORY RECEPTOR - 
Mus musculus (Mouse), 3 14 aa. 


17..322 
5..310 


211/306 (68%) 
250/306 (80%) 


e-124 


Q8VGU1 


OLFACTORY RECEPTOR MORI 03- 
4 - Mus musculus (Mouse), 3 14 aa. 


I7..322 
5..310 


210/306 (68%) 
249/306 (80%) 


e-124 


Q9EPV0 


M50 OLFACTORY RECEPTOR 
(OLFACTORY RECEPTOR M50) 
(OLFACTORY RECEPTOR 
MORI 03- 16) - Mus musculus (Mouse), 
316 aa. 


16..315 
2..301 


209/300 (69%) 
246/300 (81%) 


e-121 



PFam analysis predicts that the NOV 18a protein contains the domains shown ii 
the Table 18E. 



Table 18E. Domain Analysis of NOV18a 


Pfam Domain 


NOV 18a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l 


53..302 


50/269(19%) 
181/269(67%) 


6.5e-27 


ColicinV 


147..310 


29/197(15%) 
100/197(51%) 


0.43 



Example 19. 



The NOV 19 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 19A. 
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Table 19 A. NOV19 Sequence Analysis 




SEQIDNO:45 j 1071 bp 


NOV 19a, 
CG 56777-01 DNA Sequence 


GAAAA.GGAGGAAAG7 CI JGT lASCToGGGATGATGACGGATCTGAAGCAAAGCCATTC 


AGTGAGGCTGAATGATGGACCCTTCATGCCAGTGCTGGGATTTGGCACTTATGCTCCT 
GATCATACTCCCAAAAGCCAGGCTGCCGAGGCCACCAAAGTGGCTATTGACGTAGGCT 
TCCGCCATATTGATTCAGCATACTTATACCAAAATGAGGAGGAGGTTGGACAGGCCAT 
TTGGGAGAAGATCGCTGATGGTACCGTCAAGAGAGAGGAAATATTCTACACCATCAAG 
CTTTGGGCTACTTTCTTTCGGGCAGAATTGGTTCACCCGGCCCTAGAAAGGTCACTGA 
AGAAACTTGGACCGGACTATGTAGATCTCTTCATTATTCATGTACCATTTGCTATGAA 
GCCTGGGAAAGAATTACTGCCAAAGGATGCCAGTGGAGAGATTATTTTAGAAACTGTG 
GAGCTTTGTGACACTTGGGAGGCCCTGGAGAAGTGCAAAGAAGCAGGTTTAACCAGGT 




GCTCAAGTACAAGCCCACCTGCAACCAGGTGGAATGTCACCCTTACCTCAACCAGAGC 
AAACTCCTGGAGTTCTGCAAGTCCAAGGACATTGTTCTAGTTGCCTACAGTGCCCTGG 
GATCCCAAAGAGACCCACAGTGGGTGGATCCCGACTGCCCACATCTCTTGGAGGAGCC 
GATCTTGAAATCCATTGCCAAGAAACACAGTGGAAGCCCAGGCCAGGTCGCCCTGCGC 




AAGAGAACTTCCAGATTTTTGACTTTGAGTTGACTCCAGAGGACATGAAAGCCATTGA 
TGGCCTCAACAGAAATCTCCGATATGACAAGTTACAATTTGCTGCTAATCACCCTTAT 
TTTCCATTTTCTGAAGAATATTGACCATGAGCTATTGAACATTACCACCACAATTTCC 




ORF Start: ATG at 30 


ORF Stop: TGA at 1008 




SEQ ID NO: 46 


326 aa I MW at 37132.2kD 


NOV 19a, 
CG56777-01 Protein Sequence 


MMTDLKQSHSVRLNDGPFKPVLGFGTYAPDHTPKSQAAEATKVAIDVGFPHir A ' 

QNEEEVGQAIWEKIADGTVKREEIFYTIKIiWATFFRAETjVHPALERSTjKKIjGPDYVDL 

FIIHVPFAMKPGKELLPKDASGEIILETVELCDTWEALEKCKEAGLTR3IGVSNFNHK 

LLELILNKPGLKYKPTCNQVECHPYLNQSKLLEFCKSKDIV1,VAYSALGSQRDPQWVD 

PDCPHLLEEPILKSIAKKHSGSPGQVALRYQLQRGV\^VIiAKSFSQERIKENFQIFDFE 

LiTPEDMKAIDGLNRNLRYDKLQFAANHPYFPFSEEY 




SEQ ID NO: 47 | 1108 bp 


NOV 19b, 
CG56777-02 DNA Sequence 


GAAAAGGAGGAAAGACAGGTTAGCTGGGGATGATGACGGATCTGAAGCAAAGCCATTC 


AGTGAGGCTGAATGATGGACCCTTCATGCCAGTGCTGGGATTTGGCACTTATGCTCCT 
GATCATACTCCCAAAAGCCAGGCTGCCGAGGCCACCAAAGTGGCTATTGACGTAGGCT 
TCCACCATATTGATTCAGCATACTTATACCAAAATGAGGAGGAGGTTGGACAGGCCAT 
TTGGGAGAAGATCGCTGATGGTACCGTCAAGAGAGAGGAAATATTCTACACCATCAAG 
CTTTGGGCTACTTTCTTTCGGGCAGAATTGGTTCACCCGGCCCTAGAAAGGTCACTGA 
AGAAACTTGGACCAGACTATGTAGATCTCTTCATTATTCATGTACCATTTGCTATGAA 
GGTCAGGTTCTTCCTGAGAGAGGCAGATTTGAGTGAGGTGCATAGGATTGCCAAAACT 
TTCCCTGGGAAAGAATTACTGCCAAAGGATGCCAGTGGAGAGATTATTTTAGAAACTG 
TGGAGCTTTGTGACACTTGGGAGGCCCTGGAGAAGTGCAAAGAAGCAGGTTTAACCAG 
GTCCATTGGGGTGTCCAATTTCAATCACAAGCTGCTGGAACTCATCCTCAACAAGCCA 
GGGCTCAAGTACAAGCCCACCTGCAACCAGGTGGAATGTCACCCTTACCTCAACCAGA 
GCAAACTCCTGGAGTTCTTCAAGTCCAAGGACATTGTTCTAGTTGCCTACAGTGCCCT 
GGGATCCCAAAGAGACCCACAGTGGGTGGATCCCGACTGCCCACATCTCTTGGAGGAG 
CCGATCTTGAAATCCATTGCCAAGAAACACAGTCGAAGCCCAGGCCAGGTCGCCCTGC 




CAAAGAGAACTTCCAGATTTTTGACTTTGAGTTGACTCCAGAGGACATGAAAGCCATT 
GATGGCCTCAACAGAAATCTCCGATATGATAAGTTACAATTTGCTGCTAATCACCCTT 
ATTTTCCATTTTCTGAAGAATATTGATCACCCTTATTTTCCATTTTCTGAAGAATATT 
GATCAC 




ORF Start: ATG at 30 


ORF Stop: TGA at 1068 




SEQ ID NO: 48 


346 aa | MW at 39673. lkD 


NOV 19b, 
CG56777-02 Protein Sequence 


tIMTDLKQSHSVRLNDGPFMPVLGFGTYAPDHTPKSQAAEATKVAIDVGFHHIDSAYLY 
QNEEEVGQAIWEKIADGTVKREEIFYTIKLWATFFRAELVHPALERSLKKLGPDYVDL 
FIIHVPFAMKVRFFLREADLSEVHRIAKTFPGKELLPKDASGEIILETVELCDTWEAL 
EKCKEAGLTRSIGVSNFNHKLLELILNKPGLKYKPTCNQVECHPYIiNQSKLLEFFKSK 
DIVLVAYSALGSQRDPQWVDPDCPHLLEEPILKSIAKKHSRSPGQVALRYQLQRGVW 
LAKSFSQERI KENFQI FDFELTPEDMKAIDGLNRNLRYDKLQFAANHPYFPFSEEY 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 19B. 
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Table 19B. Comparison of NOV19a against NOV19b. 


Protein Sequence 


NOV 19a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV 19b 


1..326 
1..346 


323/346 (93%) 
323/346.(93%) 



Further analysis of the NOV 19a protein yielded the following properties shown in 
Table 19C. 



Table 19C. Protein Sequence Properties NOV19a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.4275 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 19a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 19D. 
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Table 19D. Gcneseq Results for NOV19a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV 19a 

Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAM79455 


Human protein SEQ ID NO 3101 - 
Homo sapiens, 325 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


4..326 
4.325 


220/323 (68%) 
269/323 (83%) 


e-131 


AAM78471 


Human protein SEQ ID NO 1 133 - 
Homo sapiens, 323 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


4..S26 
2..323 


220/323 (68%) 
269/323 (83%) 


e-131 


AAW14799 


Type 5 17-beta-hydroxysteroid 
dehydrogenase - Homo sapiens, 
323 aa. [W0971I162-A1,27- 
MAR-1997] 


2..323 


220/323 (68%) 
269/323 (83%) 


e-131 


AAB43444 


Human cancer associated protein 
sequence SEQ ID NO:889 - Homo 
sapiens, 336 aa. [WO200055350- 
Al,21-SEP-2000] 


1..326 
10..336 


216/327 (66%) 
268/327 (81%) 


e-128 


AAB76865 


Human lung tumour protein related 
protein sequence SEQ ID NO:783 - 
Homo sapiens, 364 aa. 
[WO200100828-A2, 04-JAN- 
2001] 


33..S26 
71..364 


200/294 (68%) 
248/294 (84%) 


e-120 



In a BLAST search of public sequence datbases, the NOV 19a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 19E. 
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Table 19E. Public BLASTP Results for NOV19a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV 19a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P05980 


Prostaglandin-F synthase 1 (EC 
1.1.1.188) (PGF synthase 1 ) (PGF 1 ) 
(Prostaglandin-^ 1 1 reductase 1) 
(PGFSI) - Bos taurus (Bovine), 323 aa. 


7..326 
4..S23 


229/320 (71%) 
270/320 (83%) 


e-134 


P52897 


Prostaglandin-F synthase 2 (EC 
1.1.1.188) (PGF synthase 2) (PGF 2) 
(Prostaglandin-^ 11 reductase 2) 
(PGFSI 1) - Bos taurus (Bovine), 323 aa. 


7..326 
4..323 


228/320 (71%) 
268/320 (83%) 


e-134 


P52898 


Dihydrodiol dehydrogenase 3 (EC 1.-.-.-) 
(Prostaglandin F synthase) - Bos taurus 
(Bovine), 323 aa. 


11. .326 
S..323 


226/316(71%) 
264/316 (83%) 


e-133 


Q9NS54 


3ALPHA-HYDROXYSTEROID 
DEHYDROGENASE VARIANT (EC 
1.1.1.213) - Homo sapiens (Human), 323 


11. .326 
8..32S 


216/316(68%) 
265/316(83%) 


e-131 


P80508 


Prostaglandins 9-reductase (EC 
1 . 1 . 1 . 1 89) (20-alpha-hydroxysteroid 
dehydrogenase) (EC 1 .1.1.149) (20- 
alpha-HSD) - Oryctolagus cuniculus 
(Rabbit), 323 aa. 


11. .326 
8.323 


219/316(69%) 
259/316(81%) 


e-131 



PFam analysis predicts that the NOV 19a protein contains the domains shown in 
the Table 19F. 



Table 19F. Domain Analysis of NOV19a 


Pfam Domain 


NOV 19a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


aldo_ket_red 


13..306 


164/368(45%) 
261/368(71%) 


1.5e-144 



Example 20. 



5 The NOV20 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 20A. 
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Table 20A. NOV20 Sequence Analysis 




SEQ ID NO: 49 


979 bp 


NOV20a, 
CG56941-01 DNAS 


quence 


TGGAAAACTTGTCACAGTTTAGGACAACTCTCTTTGTAAAATTAGTGGTTCTTGGGAG 
TCAAAATGCTTCCTGGTGCCGGGAGTGAGCGATGAGCCGGCTTCTGTCCCTGGCGGCC 
AGAGTCGCCTTGGCCGCCTTGCCCTGCCGCCGCTCTCGCGGGTTCGGGATGTTCTATG 






CCGTGAGGAGGGGCGGCAAGTCCGCGGTCTTTCTGACCGGGAATGAGTGCAAAGCACA 
GGTGGACCGGTTTCCTGCTGCCAGATTTAAGAAGTTTGCCACAGAGGATGAGGCCTGG 
GACTTTGTCAGGAAATCTGCAAGCCCGGAAGTTTCAGAAGGGCAGGAAAATCAACATG 
GACAAGAATCGGAGACGAAAGCCAGCAAGCGACTCCGTGAGCCACTGGATGGAGATGG 
AGATGAAAGCGCAGAGCCATATGCAAGCACAATGAAGCCGAGCGTGAAGCCGGCGCCT 

GCTGCTGCTCCAGTAATGGGCGTAGAAAGCCACGAGCAGGAATCCGCGTTTACTGGGG 
GCCGGGCTATCCTTTAACTGTAGGCATTAGACTTCCTGGGTGGCAGACAAACCAAAGA 
GCGGAAATTCATGCAGCCTGCAAGGCCATTGAACAAGCACAGACTCAGAAAATCAATA 
AACTGGTTCTGTATACAGACAGTATGTTTACGATAAATGGTATAACTAACTGGGTTCA 
AGGTTGGAAGAAAAATGGGTGGAAGACAAGTGCAGGGAAAGAGGTGATCAACAAAGAG 
GACTTCGTGGCACTGGAGAGGCTGACCCAGGGGATGGACATTCAGTGGGCGAGTATCT 










TGGTTTATGTAAGTGGTTTAATGACATAACAATTTCCATGAAGTTAATATC 




ORF Start: ATG at 90 


ORF Stop: TGA at 89" 




SEQ ID NO: 50 


269 aa j MW at 29980.9kD 


NOV20a, 
CG56941-01 Protein Sequence 


MSRLLSLAARVALAALPCRRSRGFGMFYAVRRGGKSAVFLTGNECKAQVDRFPAARFK 
KFATEDEAWDFVRKSASPEVSEGQENQHGQESETKASKRLREPLDGDGDESAEPYAST 






LPGWQTNQRA3IHAACKAIEQAQTQKINKLVLYTDSMFTINGITNWVQGWKKNGWKTS 
AGKEVINKEDFVMjERLTQGMDIQWASILNVHVPWLL 



Further analysis of the NOV20a protein yielded the following properties shown in 
Table 20B. 



Table 20B. Protein Sequence Properties NOV20a 


PSort analysis: 


0.3700 probability located in outside; 0.2339 probability located in microbody 
(peroxisome); 0.1080 probability located in nucleus; 0.1000 probability located in 
endoplasmic reticulum (membrane) 


SignalP analysis: 


Cleavage site between residues 24 and 25 



A search of the NOV20a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 20C. 
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Table 20C. Geneseq Results for NOV20a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV20a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY70235 


Human RN A-associated protein- 1 6 
(RNAAP-1 6) - Homo sapiens, 286 
aa. [WO20001 1 171-A2, 02-MAR- 
2000] 


1..266 
1 ..262 


236/267 (88%) 
244/267 (90%) 


e-134 


AAB97508 


Human type II RNase H protein - 
Homo sapiens, 286 aa. 
[WO200123613-A1, 05-APR-200 1 ] 


1..266 
1..262 


235/267 (88%) 
242/267 (90%) 


e-134 


AAY25094 


Human type 2 RNase H protein - 
Homo sapiens, 286 aa. [W09928447- 
Al, 10-JUN-1999] 


I. .266 
1 ..262 


235/267 (88%) 
242/267 (90%) 


e-134 


AAB975 1 1 


E coli type 11 RNase H protein - 
Escherichia coli, 216 aa. 
[WO200123613-A1, 05-APR-2001] 


23..266 
4..206 


157/244 (64%) 
178/244 (72%) 


2e-85 


AAY25098 


Mouse RNase H homologue protein 
fragment - Mus sp, 216 aa. 
[W09928447-A1, 10-JUN-1999] 


23. .266 
4.. 206 


157/244(64%) 
178/244(72%) 


2e-85 



In a BLAST search of public sequence datbases, the NOV20a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 20D. 



Table 20D. Public BLASTP Results for NOV20a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV20a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Value 


060930 


Ribonuclease HI (EC 3.1.26.4) 
(RNase HI) (Ribonuclease H type II) 
- Homo sapiens (Human), 286 aa. 


1..266 
1..262 


237/267 (88%) 
244/267 (90%) 


e-134 


Q8VCR6 


RIBONUCLEASE HI - Mus 
musculus (Mouse), 285 aa. 


1..266 
1..261 


183/267 (68%) 
213/267 (79%) 


e-101 


070338 


Ribonuclease HI (EC 3.1.26.4) 
(RNase HI)- Mus musculus 
(Mouse), 285 aa. 


1..266 
1..261 


183/267 (68%) 
213/267(79%) 


e-101 


Q91953 


MRNA, COMPLETE CDS, CLONE 
CLFEST65 - Gallus gallus (Chicken), 
293 aa. 


11. .266 
6. .269 


141/270 (52%) 
178/270 (65%) 


2e-71 


Q21024 


F59A6.6 PROTEIN - Caenorhabditis 
elegans, 369 aa. 


27..260 
67..342 


76/276 (27%) 
122/276 (43%) 


2e-25 
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PFam analysis predicts that the NOV20a protein contains the domains shown in 
the Table 20E. 



Table 20E. Domain Analysis of NOV20a 


Pfam Domain 


NOV20a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RnaseH 


1 S6..257 


50/147 (34%) 
93/147 (63%) 


l.Ie-27 



Example 21. 



The NOV21 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 21 A. 
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Table 21A. NOV21 Seque 


nee Analysis 




SEQ ID NO: 51 


j 2536 bp 


| NOV21a, 

i CG57109-OI DNA Sequence 

1 

| 
j 


GGGACACTGACATGGACTGAAGGAGTAGAAAAGAAGCCTTGGGCTCTCCCAGATGGAA 
GAATGACCGTGTGAGGAAACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCT 
GATTTCTTCAGGGAAGGGGATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTGA 
AGAGCATTCAGGTGGCTGTAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACT 
GGCCCAGCACAGCCGTGCCCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCT 
CTGAAAGGAGACCACCGCTGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTG 
CAGGATGCAAGGCAGCCATGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGA 
TGACAGAGCGAGGACCCAGAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGT 
AGCAAGCCCCCCAGGGAAGCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATC 
TTGGGGTGGAGATTGAAAAGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAG 
AGAGAGGGAGCTTCAGCAGAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAG 
CTGGATATGGGGAAGGGCCCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCT 
GCAGGAGGTCTCCCGAGGCAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAG 
CCACAGGAGCAGCCCCAGGAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATG 
GACAAGAAAGAGGACAGAGGCCCAGAGGATCAAGAAAGCCATGCTCAGGGAGCAGCCA 
AGGCCAAGAAGGACCTTGTGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGGGAGGT 
GAAGAAGGACACCAGGCCCATGAGCAGGAGCAAACATGGTGGCTGGCTCCTGAGAGAG 
CACCAGGCGGGCTTTGAGAAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGA 
AGGAGAAAAAGCCATGTATGTCTGGAGGCAGAAGGATGACTCTCAGAGATGACCAACC 
TGCAAAGCTAGAAAAGGAGCCCAAGACGAGGCCAGAAGAGAACAAGCCAGAGCGGCCC 
AGCGGTCGGAAGCCACGGCCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATG 
AGACTGGCCGGGTCATTGGGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCG 
CGAGACCAGGCAGGCCTATGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAG 
GAGGACATGGTGGACAGTGAGATCTTGATCATCCAGAGCCTCTCTCACCCCAACATCG 

GCAGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTGAAGTTCCCGGAGCCCGAT 
GCTGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCCACATGCACGACAAGAGC'A 
TTGTCCACCGGGACCTCAAGCCGGAAAACCTTTTGGTTCAGCGAAATGAGGACAAATC 
TACTACCTTGAAATTGGCTGATTTTGGACTTGCAAAGCATGTGGTGAGACCTATATTT 
ACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAAATTCTTTCTGAGAAAGGTTATG 
GACTGGAGGTGGACATGTGGGCGGCTGGCGTGATCCTCTATATCCTGCTGTGTGGCTT 
TCCCCCATTCCGCAGCCCTGAGAGGGACCAGGACGAGCTCTTTAACATCATCCAGCTG 
GGCCACTTTGAGTTCCTCCCCCCTTACTGGGACAATATCTCTGATGCTGCTAAAGATC 
TGGTGAGCCGGTTGCTGGTGGTAGACCCCAAAAAGCGCTACACAGCTCATCAGGTTCT 
TCAGCACCCCTGGATCGAAACAGCTGGCAAGACCAATACAGTGAAACGACAGAAGCAG 
GTGTCCCCCAGCAGCGAGGGTCACTTCCGGAGCCAGCACAAGAGGGTTGTGGAGCAGG 
TATCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTCTGCTCAAGGACAGAGA 
AAAGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCTTCTTCACATAATTGGTGAA 
TCAGAGGGAGAGACACTGAGTATATTTTAAAGCATATTAAAAAAATTAAGTCAATGTT 
AAATGTCACAACATATTTTTAGATTTGTATATTTAAAGCCTTTAATACATTTTTGGGG 
GGTAAGCATTGTCATCAGTGAGGAATTTTGGTAATAATGATGTGTTTTGCTTCCCCTT 
TGTAACCAAGTTTATTCTGTACTACAGGAGTGGTGCTTACCAGGGTCTAAACTCCCCC 
TGTGAGATTAATAAGGTGCATTGTGGTCTTTCTGTGTTAATAAAATGTGCTCTGAATA 
ACAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 




ORF Start: ATG at 150 


ORF Stop: TAG at 2094 




SEQ ID NO: 52 


648 aa j MW at 73813.6kD 


NOV21a, 
CG57 109-01 Protein Sequence 


MGKEPLTLKSIQVAVEELYPNKARALTLAQHSRAPSPRLRSRLFSKALKGDHRCGETE 
TPKSCSEVAGCKAAMRHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATLEE 
RHARGEKHLGVEIEKTSGEIIRCEKCKRERELQQSLERERLSLGTSELDMGKGPMYDV 
EKLVRTRSCRRSPEANPASGEEGWKGDSHRSSPRNPTQELRRPSKSMDKKEDRGPEDQ 
ESHACGAAKAICKDLVEVLPVTEEGLREVKKDTRPMSRSKHGGWLLREHQAGFEKLRRT 
RGEEKEAEKEKKPCMSGGRRMTLRDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGII 
AANVEKHYETGRVIGDGKFAWKECRHRETRQAYAMKIIDKSRLKGKEDMVDSEILII 
QSLSHPNIVKLHEVYETDMEIYLILEYVQGGDLFDAIIESVKFPEPDAALMIMDLCKA 
LVHMHDKSIVHRDLKPENLLVQRNEDKSTTLKLADFGLAKHVVRPIFTVCGTPTYVAP 
EILSEKGYGLEVDNWAAGVILYILLCGFPPFRSPERDQDELFNIIQLGHFEFLPPYWD 
NISDAAKDLVSRLLWDPKKRYTAHQVLQHPWIETAGKTNTVKRQKQVSPSSEGHFRS 
QHKRWEQVS 




SEQ ID NO: 53 


2808 bp 


NOV21b, 
CG57 109-02 DNA Sequence 


TTTACAGAGTCAGGCCTCACCGTGAGAGGGCTCCTGTATTAGTCCCTTTTCATGCTGC 
TTATAGAGACATACCTGAGACTGGGCAATTTGCAGAGAAAGGTTTTCTTGGACTTACA 
3TTAGTTCCACGTGGCTGGGGAAGCCTCACAATCATGGCGGAAGGCAAGGAAGGGCAA 
3TCCCATCTTACATGGATGGCAGCAGGCAAAGAGAGAATGAGGAAGATGCAAAAGCGG 
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AAACCCCTGATGTAACCATCAGATCTTATGRGATTTATTCACTACCATGGAACAGACA 
GCAAGGCGTATGTGACCATTCTCTAGAATATTTAAGCTCGAGAATCTCAGAGCGGAAG 
CTGCAAGGCTCCTGGCTGCCTGCCAGCCGAGGGAATCTGGAGAAACCATTCCTGGGGC 
CGCGTGGCCCCGTCGTGCCCTTGTTCTGCCCTCGGAATGGCCTTCACTCAGCACATCC 
TGAGAACAGCCCTCTGAAGCCCAGGGTCGTGACCGTAGTGAAGCTGGGTGGGCAGCGC 
CCCCGAAAGATCACTCTGCTCCTCAACAGGCGATCAGTGCAGACGTTCGAGCAGCTCT 

ACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCTGATTTCTTCAGGGAAGGG 
GATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTGAAGAGCATTCAGGTGGCTG 
TAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTGGCCCAGCACAGCCGTGC 
CCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCTCTGAAAGGAGACCACCGC 
TGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGCAGGATGCAAGGCAGCCA 
TGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGATGACAGAGCGAGGACCCA 
GAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTAGCAAGCCCCCCAGGGAA 
GCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATCTTGGGGTGGAGATTGAAA 
AGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGAGAGAGGGAGCTCCAGCA 

CCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCTGCAGGAGGTCTCCCGAGG 
CAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAGCCACAGGAGCAGCCCCAG 
GAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATGGACAAGAAAGAGGACAGA 
GGCCCAGAGGATCAAGAAAGCCATGCTCAGGGAGCAGCCAAGGCCAAGAAGGACCTTG 
TGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGGGAGGTGAAGAAGGACACCAGGCC 
CATGAGCAGGAGCAAACATGGTGGCTGGCTCCTGAGAGAGCACCAGGCGGGCTTTGAG 
AAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAAGGAGAAAAAGCCATGTA 
TGTCTGGAGGCAGAAGGATGACTCTCAGAGATGACCAACCTGCAAAGCTAGAAAAGGA 
GCCCAAGACGAGGCCAGAAGAGAACAAGCCAGAGCGGCCCAGCGGTCGGAAGCCACGG 



GGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGCGAGACCAGGCAGGCCTA 
TGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGGAGGACATGGTGGACAGT 
GAGATCTTGATCATCCAGAGCCTCTCTCACCCCAACATCGTGAAATTGCATGAAGTCT 
ACGAAACAGACATGGAAATCTACCTGATCCTGGAGTACGTGCAGGGAGGAGACCTTTT 
TGACGCCATCATAGAAAGTGTGAAGTTCCCGGAGCCCGATGCTGCCCTCATGATCATG 
GACTTATGC/iAAGCCCTCGTCCACATGCACGACAAGAGCATTGTCCACCGGGACCTCA 
AGC CGG A A A ACCT T TTGGTT CAG CG AAATGAGGACAAATCTACTAC CTTGAAATTGG C 
TGATTTTGGACTTGCAAAGCATGTGGTGAGACCTATATTTACTGTGTGTGGGACCCCA 
ACTTACGTAGCTCCCGAAATTCTTTCTGAGAAAGGTTATGGACTGGAGGTGGACATGT 
3GCTTTCCCCCATTCCGCAGCCC 
rTAACATCATCCAGCTGGGCCACTTTGAGTTCCTC 
CCCCCTTACTGGGACAATATCTCTGATACAGCTGCTAAAGATCTGGTGAGCCGGTTGC 
TGGTGGTAGACCCCAAAAAGCGCTACACAGCTCATCAGGTTCTTCAGCACCCCTGGAT 
CGAAACAGCTGGCAAGACCAATACAGTGAAACGACAGAAGCAGGTGTCCCCCAGCAGC 
GAGGGTCACTTCCGGAGCCAGCACAAGAGGGTTGTGGAGCAGGTATCATA GTCACCAC 
CTTGGGAATCTGTCCAGCCCCCAGTTCTGCTCAAGGACAGAGAAAAGGATAGAAGTTT 



CTGAGTATATTTTAAAGCATAT TA 

ORF Start: ATG at 151 



ORF Stop: TAG at 2659 



SEQ ID NO: 54 



NOV21b, 
CG57 109-02 Protein Sequence 



836 aa j MW at 95152.6kD 



MAEGKEGQVPSYMDGSRQRENEEDAKAETPDVTIRSYEIYSLPWNRQQGVCDHSLEYL 
SSRISERKLQGSWLPASRGNLEKPFLGPRGPWPLFCPRNGLHSAHPENSPLKPRWT 
WKLGGQRPRKITLLLNRRSVQTFEQLLADISEALGSPRWKNDRVRKLFNLKGREIRS 
VSDFFREGDAFIAMGK3PLTLKSIQVAVEELYPNKARALTLAQHSRAPSPRLRSRLFS 
KALKGDHRCGETETPKSCSEVAGCKAAMRHQGKIPEELSLDDRARTQKKWGRGKWEPE 
PSSKPPREATLEERHARGEKHLGVEIEKTSGEIIRCEKCKRERELQQSLERERLSLGT 
SELDMGKGPMYDVEKLVRTRSCRRSPEANPASGEEGWKGDSHRSSPRNPTQELRRPSK 
SMDKKEDRGPEDQESHACGAAKAKKDLVEVLPVTEEGLREVKKDTRPMSRSKHGGWLL 
REHQAGFEKLRRTRGEEKEAEKEKKPCMSGGRRMTLRDDQPAKLEKEPKTRPEENKPE 
RPSGRKPRPMGIIAANVEIQiYETGRVIGDGNFAWKECRHRETRQAYAMKIIDKSRLK 
GKEDMVDSEILIIQSLSHPNIVKLHEVYETDMEIYLILEYVQGGDLFDAIIESVKFPE 
PDAAMIMDLCKALVHMHDKSIVHRDLKPENLLVQRNEDKSTTLKIJffiFGLAKHVVRP 
IFTVCGTPTYVAPEILSEKGYGLEVDMWAAGVILYILLCGFPPFRSPERDQDELFNII 
QLGHFEFLPPYWDNISDTAAKDLVSRLLWDPKKRYTAHQVLQHPWIETAGKTNTVKR 
QKQVSPSSEGHFRSQHKRWEQVS 
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SEQIDNO: 55 


| 3016 bp 


NOV21c, 
CG57109-03 DNA Sequence 


TTTACAGAGTC^GGCCTCSCCGTGAGAGGGCTCCTGTATTAGTCCCTrTTCATGCTGC 
TTATAGAGACATACCTGAGACTGGGCAATTTGCAGAGAAAGGTTTTCTTGGACTTACA 
GTTAGTTCCACGTGGCTGGGGAAGCCTCACAATCATGGCGGAAGGCAAGGAAGGGCAA 
GTCCCATCTTACATGGATGGCAGCAGGCAAAGAGAGAATGAGGAAGATGCAAAAGCGG 
AAACCCCTGATGTAACCATCAGATCTTATGAGATTTATTCACTACCATGGAACAGACA 
GCAAGGCGTATGTGACCATTCTCTAGAATATTTAAGCTCGAGAATCTCAGAGCGGAAG 
CTGCAAGGCTCCTGGCTGCCTGCCAGCCGAGGGAATCTGGAGAAACCATTCCTGGGGC 
CGCGTGGCCCCGTCGTGCCCTTGTTCTGCCCTCGGAATGGCCTTCACTCAGCACATCC 
TGAGAACAGCCCTCTGAAGCCCAGGGTCGTGACCGTAGTGAAGCTGGGTGGGCAGCGC 
CCCCGAAAGATCACTCTGCTCCTCAACAGGCGATCAGTGCAGACGTTCGAGCAGCTCT 
TAGCTGACATCTCAGAAGCCTTGGGCTCTCCCAGATGGAAGAATGACCGTGTGAGGAA 
ACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCTGATTTCTTCAGGGAAGGG 
GATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTGAAGAGCATTCAGGTGGCTG 
TAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTGGCCCAGCACAGCCGTGC 
CCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCTCTGAAAGGAGACCACCGC 
TGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGCAGGATGCAAGGCAGCTA 
TGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGATGACAGAGCGAGGACCCA 
GAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTAGCAAGCCCCCCAGGGAA 
GCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATCTTGGGGTGGAGATTGAAA 
AGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGAGAGAGGGAGCTTCAGCA 
GAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAGCTGGATATGGGGAAGGGC 
CCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCTGCAGGAGGTCTCCCGAGG 
CAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAGCCACAGGAGCAGCCCCAG 
GAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATGGACAAGAAAGAGGACAGA 

TGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGGGAGGTGAAGAAGGACACCAGGCC 
CATGAGCAGGAGCAAACATGGTGGCTGGCTCCTGAGAGAGCACCAGGCGGGCTTTGAG 
AAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAAGGAGAAAAAGCCATGTA 
TGTCTGGAGGCAGAAGGATGACTCTCAGAGATGACCAACCTGCAAAGCTAGAAAAGGA 
GCCCAAGACGAGGCCAGAAGAGAACAAGCCAGAGCGGCCCAGCGGTCGGAAGCCACGG 
CCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATGAGACTGGCCGGGTCATTG 
GGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGCGAGACCAGGCAGGCCTA 
TGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGGAGGACATGGTGGACAGT 
GAGATCTTGCATGAAGTCTACGAAACAGACATGGAAATCTACCTGATCCTGGAGTACG 
TGCAGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTGAAGTTCCCGGAGCCCGA 
TGCTGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCCACATGCACGACAAGAGC 
ATTGTCCACCGGGACCTCAAGCCGGAAAACCTTTTGGTTCAGCGAAATGAGGACAAAT 
CTACTACCTTGAAATTGGCTGATTTTGGACTTGCAAAGCATGTGGTGAGACCTATATT 
TACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAAATTCTTTCTGAGAAAGGTTAT 

TTCCCCCATTCCGCAGCCCTGAGAGGGACCAGGACGAGCTCTTTAACATCATCCAGCT 




CTGGTGAGCCGGTTGCTGGTGGTAGACCCCAAAAAGCGCTACACAGCTCATCAGGTTC 
TTCAGCACCCCTGGATCGAAACAGCTGGCAAGACCAATACAGTGAAACGACAGAAGCA 
GGTGTCCCCCAGCAGCGAGGGTCACTTCCGGAGCCAGCACAAGAGGGTTGTGGAGCAG 
GTATCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTCTGCTCAAGGACAGAG 
AAAAGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCTTCTTCACATAATTGGTGA 
ATCAGAGGGAGAGACACTGAGTATATTTTAAAGCATATTAAAAAAATTAAGTCAATGT 
TAAATGTCACAACATATTTTTAGATTTGTATATTTAAAGCCTTTAATACATTTTTGGG 
GGGTAAGCATTGTCATCAGTGAGGAATTTTGGTAATAATGATGTGTTTTGCTTCCCCT 
TTGTAACCAAGTTTATTCTGTACTACAGGAGTGGTGCTTACCAGGGTCTAAACTCCCC 
CTGTGAGATTAATAAGGTGCACTGTGGTCTTTCTGTGTTAATAAAATGTGCTCTGAAT 




ORF Start: ATG at 52 


ORF Stop: TAG at 2617 




SEQ ID NO: 56 


855 aa 1 MW at 97447.3kD 


NOV21c, 
CG57109-03 Protein Sequence 


MLLIETYLRLGNLQRKVFLDLQLVPRGWGSLTIMAEGKEGQVPSYMDGSRQRENEEDA 
KAETPDVTIRSYEIYSLPWOTIQQGVCDHSLEYLSSRISERKLQGSWLPASRGNLEKPF 
LGPRGPWPLFCPRNGI^SAHPENSPLKPRVVTVVKLGGQRPRKITLLLNRRSVQTFE 
QLLADISEALGSPRWKNDRVRKI1FNI1KGREIRSVSDFFREGDAFIAMGKEPLTLKSIQ 
VAVEELYPNKARALTLAQHSRAPSPRLRSRLFSKALKGDHRCGETETPKSCSEVAGCK 
AAMRHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATLEERHARGEKHLGVE 
IEKTSGEIIRCEKCJOIERELQQSLERERLSLGTSELDMGKGPMYDVEKLVRTRSCRRS 
PEANPASGEEGWKGDSHRSSPRNPTQELRRPSKSMDKKEDRGPEDQESHAQGAAKAKK 
DLVEVLPVTEEGLREVKKDTRPMSRSKHGGWLLREHQAGFEKLRRTRGEEKEAEKEKK 
PCMSGGRRHTLRDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGIIAANVEKHYETGR 
VIGDGKFAWKECRHRETRQAYAHKIIDKSRLKGKEDMVDSEILHEVYETDMEIYLIL 
EYVOX3GDLFDAIIESVKFPEPDAALMIMDLCKALVHMHDKSIVHRDLKPENLLVQRNE 
DKSTTLKLADFGLAKHVVRPIFTVCGTPTYVAPEILSEKGYGLEVDMWAAGVILYILL 
CGFPPFRSPERDQDELFNIIQLGHFEFLPPYWDNISDAAKDLVSRLLWDPKKRYTAH 
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QVLQHPWIETAGKTNTVKRQKQVSPSSEGHFRSQHKRWEQVS 




SEQIDNO: 57 


2433 bp 


NOV21d, 
CG57 109-04 DNA Sequence 

j 


TTTACAGAGTCAGGCCTCACCGTGAGAGGGCTCCTGTATTAGTCCCTTTTCATGCTGC 
TTATAGAGACATACCTGAGACTGGGCAATTTGCAGAGAAAGGTTTTCTTGGACTTACA 
GTTAGTTCCACGTGGCTGGGGAAGCCTCACAATCATGGCGGAAGGCAAGGAAGGGCAA 
GTCCCATCTTACATGGATGGCAGCAGGCAAAGAGAGAATGAGGAAGATGCAAAAGCGG 
AAACCCCTGATGTAACCATCAGATCTTATGAGATTTATTCACTACCATGGAACAGACA 
GCAAGGCGTATGTGACCATTCTCTAGAATATTTAAGCTCGAGAATCTCAGAGCGGAAG 
CTGCAAGGCTCCTGGCTGCCTGCCAGCCGAGGGAATCTGGAGAAACCATTCCTGGGGC 
CGCGTGGCCCCGTCGTGCCCTTGTTCTGCCCTCGGAATGGCCTTCACTCAGCACATCC 
TGAGAACAGCCCTCIGAAGCCCAGGGTCGTGACCGTAGTGAAGCTGGGTGGGCAGCGC 
CCCCGAAAGATCACICTGCTCCTCAACAGGCGATCAGTGCAGACGTTCGAGCAGCTCT 
TAGCTGACATCTCAGAAGCCTTGGGCTCTCCCAGATGGAAGAATGACCGTGTGAGGAA 
ACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCTGATTTCTTCAGGGAAGGG 
GATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTGAAGAGCATTCAGGTGGCTG 
TAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTGGCCCAGCACAGCCGTGC 
CCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCTCTGAAAGGAGACCACCGC 
TGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGCAGGATGCAAGGCAGCCA 
TGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGATGACAGAGCGAGGACCCA 
GAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTAGCAAGCCCCCCAGGGAA 
GCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATCTTGGGGTGGAGATTGAAA 
AGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGAGAGAGGGAGCTCCAGCA 
GAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAGCTGGATATGGGGAAGGGC 
CCAATGTATGATGTGGAGAAAAAGCCATGTATGTCTGGAGGCAGAAGGATGACTCTCA 




GCCAGAGCGGCCCAGCGGTCGGAAGCCACGGCCCATGGGCATCATTGCCGCCAATGTG 
GAAAAGCATTATGAGACTGGCCGGGTCATTGGGGATGGGAACTTTGCTGTCGTGAAGG 
AGTGCAGACACCGCGAGACCAGGCAGGCCTATGCGATGAAGATCATTGACAAGTCCAG 
ACTCAAGGGCAAGGAGGACATGGTGGACAGTGAGATCTTGATCATCCAGAGCCTCTCT 
CACCCCAACATCGTGAAATTGCATGAAGTTTACGAAACAGACATGGAAATCTACCTGA 
TCCTGGAGTACGTGCAGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTGAAGTT 
CCCGGAGCCCGATGCTGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCCACATG 
CACGACAAGAGCATTGTCCACCGGGACCTCAAGCCGGAAAACCTTTTGGTTCAGCGAA 
ATGAGGACA\ATCTACTACCTTGAAATTGGCTGATTTTGGACTTGCAAAGCATGTGGT 
GAGACCTATATTTACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAAATTCTTTCT 




TGCTGTGTGGCTTTCCCCCATTCCGCAGCCCTGAGAGGGACCAGGACGAGCTCTTTAA 
CATCATCCAGCTGGGCCACTTTGAGTTCCTCCCCCCTTACTGGGACAATATCTCTGAT 




CAGCTCATCAGGTTCTTCAGCACCCCTGGATCGAAACAGCTGGCAAGACCAATACAGT 
GAAACGACAGAAGCAGGTGTCCCCCAGCAGCGAGGGTCACTTCCGGAGCCAGCACAAG 
AGGGTTGTGGAGCAGGTATCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTC 
TGCTCAAGGACAGAGAAAAGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCTTCT 
TCACATAATTGGTGAATCAGAGGGAGAGACACTGAGTATATTTTAAAGCATATTA 




OPvF Start: ATG at 52 


ORF Stop: TAG at 2284 




SEQIDNO: 58 


744 aa J MW at 84729.4kD 


NOV21d, 
CG57 109-04 Protein Sequence 


MLLIETYLFLLGNLQRKVFLDLQLVPRGWGSLTIMAEGKEGQVPSYMDGSRQRENEEDA 
KAETPDVTIRSYEIYSLPWNRQQGVCDHSLEYLSSRISERKLQGSWLPASRGNLEKPF 
LGPRGPWPLFCPRNGLHSAHPENSPLKPRWTWKLGGQRPRKITLLLNRRSVQTFE 
QLLADISEALGSPRWKNDRVRKLFNLKGREIRSVSDFFREGDAFIAMGKEPLTLKSIQ 
VAVEELYPNKARALTLAQHSRAPSPRLRSRLFSKALKGDHRCGETETPKSCSEVAGCK 
AAMRHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATLEERHARGEKHLGVE 
IEKTSGEIIRCEKCK3ERELQQSLERERLSLGTSELDMGKGPMYDVEKKPCMSGGRRM 
TLRDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGIIAANVEKHYETGRVIGDGNFAV 
VKECRHRETROAYAMKIIDKSRLKGKEDMVDSEILIIQSLSHPNIVKLHEVYETDMEI 
YLILE YVQGGDL FDAI I ES VKFPEPDAALMIMDLCKALVHMHDKS IVHRDLKPENLLV 
QRNEDKSTTLKLADFGLAKHVVRPIFTVCGTPTYVAPEILSEKGYGLEVDMWAAGVIL 
YILLCGFPPFRSPERDQDELFNIIQLGHFEFLPPYWDNISDTAAKDLVSRLLWDPKK 
RYTAHQVLQHPWIETAGKTNT\'KRQKQVSPSSEGHFRSQHKRWEQVS 
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SEQIDNO: 59 


2133 bp 


NOV21e, 
CG57 109-05 DNA Sequence 


TTGACCGTGTGAGGAAACTCTTTAACXrrcy^GGGCAGGGAAATCAGGAGCGTCTCTGA 
TTTCTTC^GGGAAGGGGATGCTTTCATAGCTATGGGayiAGaACCaCTGACaCTGARG 
AGCATTCAGGTGGCTGTAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTGG 
CCCAGCACAGCCGTGCCCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCTCT 
GAAAGGAGACX^CCGCTGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGCA 
GGATGCAAGGCAGCCATGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGATG 
ACAGAGCGAGGACCCAGAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTAG 




GGGGTGGAGATTGAAAAGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGAG 
AGAGGGAGCTCCAGCAGAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAGCT 
GGATATGGGGAAGGGCCCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCTGC 
AGGAGGTCTCCCGAGGCAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAGCC 
ACAGGAGCAGCCCCAGGAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATGGA 
CAAGAAAGAGGACAGAGGCCCAGAGGATCAAGAAAGCCATGCTCAGGGAGCGGCCAAG 
GCCAAGAAGGACCTTGTGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGAGAGGTGA 
AGAAGGACACCAGGCCCATGAGCAGGAGCAAACATGGTGGCTGGCTCCTGAGAGAGCA 
CCAGGCGGGCTTTGAGAAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAAG 
GAGAAAAAGCCATGTATGTCTGGAGGCAGAAGGATGACTCTCAGAGACGACCAACCTG 
CAAAGCTAGAAAAGGAGCCCAAGACGAGGCCAGAAG AGAACAAG CCAGAG CGGCCCAG 
CGGTCGGAAGCCACGGCCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATGAG 
ACTGGCCGGGTCATTGGGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGCG 
AGACCAGGCAGGCCTATGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGGA 
GGACATGGTGGACAGTGAGATCTTGATCATCCAGAGCCTCTCTCACCCCAACATCGTG 
AAATTGCATGAAGTCTACGAAACAGACATGGAAATCTACCTGATCCTGGAGTACGTGC 
AGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTGAAGTTCCCGGAGCCCGATGC 
TGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCCACATGCACGACAAGAGCATT 
GTCCACCGGGACCTCAAGCCGGAAAACCTTTTGGTTCAGCGAAATGAGGACAAATCTA 
CTACCTTGAAATTGGCTGATTTTGGACTTGCAAAGCATGTGGTGAGACCTATATTTAC 
TGTGTGTGGGACCCCAACTTACGTAGCTCCCGAAATTCTTTCTGAGAAAGGTTATGGA 
CTG3AGGTGGACATGTGGGCTGCTGGCGTGATCCTCTATATCCTGCTGTGTGGCTTTC 
CCCCATTCCGCAGCCCTGAGAGGGACCAGGACGAGCTCTTTAACATCATCCAGCTGGG 
CCACTTTGAGTTCCTCCCCCCTTACTGGGACAATATCTCTGATGCTGCTAAAGATCTG 
GTGAGCCGGTTGCTGGTGGTACSACCCCAAAAAGCGCTACACAGCTCATCAGGTTCTTC 
AGCACCCCTGGATCGAAACAGCTGGCAAGACCAATACAGTGAAACGACAGAAGCAGGT 
GTCCCCCAGCAGCGAGGGTCACTTCCGGAGCCAGCACAAGAGGGTTGTGGAGCAGGTA 
TCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTCTGCTCAAGGACAGAGAAA 
AGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCTTCTTCACA 




ORF Start: ATG at 90 


ORF Stop: TAG at 2034 




SEQIDNO: 60 


648 aa | MW at 73813.6kD 


NOV21e, 
CG57 109-05 Protein Sequence 


MGKEPLTLKSIQVAVEELYPNKARALTLAQHSRAPSPRLRSRLFSKALKGDHRCGETE 
TPKSCSEVAGCKAAMRHQGKIPEELSLDDRARTQKKWGRGKWEPEPSSKPPREATLEE 
RHARGEKHLGVEIEKTSGEIIRCEKCKRERELQQSLERERLSLGTSELDMGKGPMYDV 
EKLVRTRSCRRSPEANPASGEEGWKGDSHRSSPRNPTQELRRPSKSMDKKEDRGPEDQ 
ESHAQGBAKAKKDLVEVLPVTEEGLREVKKDTRPMSRSKHGGWLLREHQAGFEKLRRT 
RGEEKEAEKEKKPCHSGGRRMTLRDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGII 
AANVEKHYETGRVIGDGNFAWKECRHRETRQAYAMKIIDKSRLKGKEDMVDSEILII 
QSLSHPNIVKLHEVYETDMEIYLILEYVQGGDLFDAIIESVKFPEPDAALMIMDLCKA 
LVHMHDKSIVHT^DLKPENLLVQRNEDKSTTLKI^FGLAKHVVRPIFTVCGTPTYVAP 
EILSEKGYGLEVDMWAAGVILYILLCGFPPFRSPERDQDELFNIIQLGHFEFLPPYWD 
NISDAAKDLVSRLLWDPKKRYTAHQVLQHPWIETAGKTNTVKRQKQVSPSSEGHFRS 
QHKRWEQVS' 
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SEQIDNO:61 


2720 bp 


NOV21f, 
CG57 1 09-06 DNA Sequence 

1 


GGACACTGACATGGACTGAAGGAGTAGAAAAGAAGCCTTGGGCTCTCCCAGATGGAAG 
AATGACCGTGTGAGGAAACTGTTTAACCTCAAGGGCAGGGAAATCAGGAGCGTCTCTG 
ATTTCTTCAGGGAAGGGGATGCTTTCATAGCTATGGGCAAAGAACCACTGACACTGAA 
GAGCATTCAGGTGGCTGTAGAAGAACTGTACCCCAACAAAGCCCGGGCCCTGACACTG 
GCCCAGCACAGCCGTGCCCCTTCTCCAAGGCTGAGGAGCAGGCTGTTTAGCAAGGCTC 
TGAAAGGAGACCACCGCTGTGGGGAGACCGAGACCCCCAAGAGCTGCAGCGAAGTTGC 
AGGATGCAAGGCAGCTATGAGGCACCAGGGGAAGATCCCCGAGGAGCTTTCACTAGAT 
GACAGAGCGAGGACCCAGAAGAAGTGGGGGAGGGGGAAATGGGAGCCAGAACCCAGTA 
GCAAGCCCCCCAGGGAAGCCACTCTGGAAGAGAGGCACGCAAGGGGAGAGAAGCATCT 
TGGGGTGGAGATTGAAAAGACCTCGGGTGAAATTATCAGATGCGAGAAGTGCAAGAGA 
GAGAGGGAGCTTCAGCAGAGCCTGGAGCGTGAGAGGCTTTCTCTGGGGACCAGTGAGC 
TGGATATGGGGAAGGGCCCAATGTATGATGTGGAGAAGCTGGTGAGGACCAGAAGCTG 
CAGGAGGTCTCCCGAGGCAAATCCTGCAAGTGGGGAGGAAGGGTGGAAGGGTGACAGC 
CACAGGAGCAGCCCCAGGAATCCCACTCAAGAGCTGAGGAGACCCAGCAAGAGCATGG 
ACAAGAAAGAGGACAGAGGCCCAGAGGATCAAGAAAGCCATGCTCAGGGAGCAGCCAA 
GGCCAAGAAGGACCTTGTGGAAGTTCTTCCTGTCACAGAGGAGGGGCTGAGGGAGGTG 
AAGAAGGACACCAGGCCCATGAGCAGGAGCAAACATGGTGGCTGGCTCCTGAGAGAGC 
ACCAGGCGGGCTTTGAGAAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAA 
GGAGAAAAAGCCATGTATGTCTGGAGGCAGAAGGATGACTCTCAGAGATGACCAACCT 
GCAAAGCTAGAAAAGGAGCCCAAGACGAGGCCAGAAGAGAACAAGCCAGAGCGGCCCA 
GCGGTCGGAAGCCACGGCCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATGA 
GACTGGCCGGGTCATTGGGGATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGC 
GAGACCAGGCAGGCCTATGCGATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGG 
AGGACATGGTGGACAGTGAGATCTTGCATGAAGTCTACGAAACAGACATGGAAATCTA 
CCTGATCCTGGAGTACGTGCAGGGAGGAGACCTTTTTGACGCCATCATAGAAAGTGTG 
AAGTTCCCGGAGCCCGATGCTGCCCTCATGATCATGGACTTATGCAAAGCCCTCGTCC 
ACATGCACGACAAGAGCATTGTCCACCGGGACCTCAAGCCGGAAAACCTTTTGGTTCA 
GCGAAATGAGGACAAATCTACTACCTTGAAATTGGCTGAITTTGGACTTGCAAAGCAT 
GTGGTGAGACCTATATTTACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAAATTC 
TTTCTGAGAAAGGTAAGTGTTACACATCGATCTGTGGGACTCTAGTTCCCTTACTAAC 
AAATGTTTCATTTGTCATATTTACTAGTTTTCAATATGGAATAAATCTCAGAGAACTG 
ACACTTAGGCTTGGATTTGGACTTCAATGAAAATATTTGAAGTAGGCTTGACCAAAGC 
ATGAGAGCTTTCTCCTCATTAGGGCTGCCCTTGTIACAGICAATGGATCAGTGTGTGT 
GCATGTGTGTGTGTGTGTGTGTGTTTATTGTGTTTAGGCAGGACAGTGAGATGAAAGA 
TGATGGAGAATTGGGTGGGGAACTCAAGCAAGCAAGGCTACTTGACCCAAGGCTATCT 
CTAATAGGAGAGAATTGAAGCAGTCCTTATGGTACTTGGTTTAAAAATTTCTTCACCA 
ACCTTGCATTTAAAGGAAAAGGATCCCATTTTCCTCCATGAACTCTATGAATATTTAT 
TACCTACTTGTATATTATGCAAGGATCCAATGAGCTGTTTTAGTGACAAACTTTCTAA 
AACATTTAAAAAGGAAATAATAGTTATGATATGGCTCTAAATATATGAATGACTATTT 
GACTACTGTGGCACTCCAGGAGAAACCAATTTACCCACCATTGGTAAGATGGGAAGAC 
TCTCATTGGGTTGCAGGGTTGGTGACAGGGAGAAGGGATGGACGGATAGTTTCCCAGC 
AGCAGAAGCATCAGGATAATTAAGATGAGGAGATGGCCAGGGATGGTGGCTCATGCCT 
GTAATCCCAGCTCTTTGGGAGGCTGAGGCAGGTGGATCACCTGAGGTCAGGCGTTTGA 
AACCATCCTGGCCAACATGGTGAAACCCTGTCTGTACTAAAACTACAAAAAAATTAGC 
TGGGCGTGGIGGCACATGCCTGTAATCIGAGCTACTCGGGAGGCTGAGGCAGGAGAAT 
TGCTTGAACCGGGGAGGTGGAGTTTGCAGTGAGCCAAGATCGTGCCATTGCACTCCAG 
CCTGGGCAACAAGAGTGAAACTTCATCTCAAAAAAAAAAAAAAAAAAAAAAC 




ORF Start: ATG at 149 


ORF Stop: TG A at 1826 




SEQ ID NO: 62 


559 aa 1 MW at 63436.9kD 


NOV21f, 
CG571 09-06 Protein Sequence 


MGKEPLTLKSIQVAVEELYPNKAEALTLAQHSRAPSPRLRSRLFSKALKGDHRCGETE 
TPKSCSEVAGCKAAMRHQGKI PEELS LDDRARTQKKWGRGKWEPEPSSKPPREATLEE 
RHARGEKHLGVEIEKTSGEIIRCEKCKRERELQQSLERERLSLGTSELDMGKGPMYDV 
EKLVRTRSCRRSPEANPASGEEGWKGDSHRSSPRNPTQELRRPSKSMDKKEDRGPEDQ 
ESHAQGAAKAKKDLVEVLPVTEEGLREVKKDTRPMSRSKHGGWLLREHQAGFEKLRRT 
RGEEKEAEKEKKPCMSGGRRMTLRDDQPAKLEKEPKTRPEENKPERPSGRKPRPMGII 
AANVEKHYEIGRVIGDGNFAWKECRHRETRQAYAMKIIDKSRLKGKEDMVDSEILHE 
VYETDMEIYLILEYVQGGDLFDAIIESVKFPEPDAALMIMDLCKALVHMHDKSIVHRD 
LKPENLLVQRNEDKSTTLICIADFGLAKHVVRPIFTVCGTPTYVAPEILSEKGKCYTSI 
I 1 T T .YfLLTNVS FVIFTS FQYG INLRELTLRljGFGLQ 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 2 IB. 
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Table 21B. Comparison of NOV21a against NOV21b through NOV21f. 


Protein Sequence 


NOV2 la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV21b 


I. .648 
188..836 


608/649 (93%) 
608/649 (93%) 


NOV21c 


1..648 
22I..855 


595/648 (91%) 
595/648(91%) 


NOV21d 


300..648 
S95..744 


349/350 (99%) 
349/350 (99%) 


NOV21e 


1..648 
1..648 


608/648 (93%) 
608/648 (93%) 


NOV21f 


1 ..529 
1..516 


476/529 (89%) 
476/529 (89%) 



Further analysis of the NOV21a protein yielded the following properties shown in 
Table 2 1C. 



Table 21C. Protein Sequence Properties NOV21a 


PSort analysis: 


0.7000 probability located in plasma membrane; 0.3000 probability located in 
microbody (peroxisome); 0.3000 probability located in nucleus; 0.2000 probability 
located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV21a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 2 ID. 
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Table 21D. Geneseq Results for NOV21a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV2 la Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAY42696 


Rat serine-threonine protein 
kinase PK80 sequence - Rattus 
norvegicus, 733 aa. 
[ WO9950395-A1, 07-OCT- 
1999] 


1..648 
98..73S 


495/649 (76%) 
535/649 (82%) 


0.0 


AAB65622 


Novel protein kinase, SEQ ID 
NO: 148 - Mus musculus, 806 
aa. [WO200073469-A2, 07- 
DEC-2000] 


I. .647 
172..805 


472/648 (72%) 
521/648 (79%) 


0.0 


AAY42697 


Mouse serine-threonine protein 
kinase PK80 sequence - Mus 
musculus, 732 aa. [WO9950395- 
A 1,07 -OCT- 1999] 


98..731 


472/648 (72%) 
521/648 (79%) 


0.0 


AAB65621 


Novel protein kinase, SEQ ID 
NO: 147 - Homo sapiens, 686 
aa. [WO200073469-A2, 07- 
DEC-2000] 


276..648 
313..686 


370/374 (98%) 
371/374 (98%) 


0.0 


AAY54579 


A rat calcium/calmodulin 
dependent protein kinase 
designated CaMK-Vl - Rattus 
sp, 422 aa. [EP978562-A1, 09- 
FEB-2000] 


348..641 
75..367 


154/294 (52%) 
212/294 (71%) 


3e-87 



In a BLAST search of public sequence datbases, the NOV2 la protein was found to 
have homology to the proteins shown in the BLASTP data in Table 2 IE. 
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Table 21E. Public BLASTP Results for NOV21a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV21a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9C098 


KIAA1765 PROTEIN - Homo sapiens 
(Human), 608 aa (fragment). 


41. .648 
I..608 


608/608(100%) 
608/608(100%) 


0.0 


015075 


Serine/threonine-protein kinase 
DCAMKL1 (EC 2.7.1.-) (Doublecortin- 
like and CAM kinase-like 1) - Homo 
sapiens (Human), 740 aa. 


348..641 
382..674 


155/294 (52%) 
212/294 (71%) 


2e-87 


Q9JLM6 


CPG16 - Mus musculus (Mouse), 433 aa. 


348..641 
75..367 


154/294(52%) 
212/294(71%) 


7e-87 


008875 


Serine/threonine-protein kinase 
DCAMKLI (EC 2.7.1.-) (Doublecortin- 

like and CAM kinase-like 1) 
(Ca lc ium/cal mod u 1 in-dependent protein 
kinase type 1-1 ike CPG 1 6) - Rattus 
norvegicus (Rat), 433 aa. 


348.-64 1 
75..367 


154/294 (52%) 
212/294 (71%) 


7e-87 


Q9JLM8 


Serine/threonine-protein kinase 
DCAMKLI (EC 2.7.1.-) (Doublecortin- 
like and CAM kinase-like 1) - Mus 
musculus (Mouse), 756 aa. 


348..641 
398..690 


154/294 (52%) 
212/294 (71%) 


7e-87 



PFam analysis predicts that the NO V2 1 a protein contains the domains shown in 
the Table 2 IF. 



Table 21F. Domain Analysis of NOV21a 


Pfam Domain 


NOV21a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Pkinase 


356..613 


107/297(36%) 
219/297 (74%) 


2.3e-101 



Example 22. 



5 The NOV22 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 22A. 
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Table 22A. NOV22 Sequence Analysis 



NOV22a, 
CG57366-01 DNA Sequence 



SEQ ID NO: 63 



TACATTCTTCATCGTTTAGATCAAGAAGAAGCTTTGCGACAACACCTCACAAAAGAAR 
CTGCAGAGATGTTAAATCAACTGCACATTAAAAGCAGTGGATGCTTTCTTTACCTAGA 
ACGAGTTTTAGATGGAGTTGTAGAAAATTTTATTATGTTAAGAGAAATTCGTGACATC 
CCAGGAACTCTAAATGGTTTATATCTCTGGCTGTGCCAAAGACTTTTTGTAAGAAAAC 
AATTTGCAAAGGTTCAGCCTATTTTGAATGTGATTCTTGCAGCCTGCCGACCTTTGAC 
CATAACGGAATTATATCACGCAGTATGGACCAAAAACATGTCGTTAACTTTGGAAGAT 
TTTCAACGCAAGTTAGATATCCTCTCCAAACTTCTTGTTGATGGACTAGGAAATACAA 
AAATACTGTTTCATTATAGTTTTGCCGAGTGGCTTCTGGATGTGAAACACTGTACTCA 
GAAGTATTTATGTAATGCAGCAGAAGGACACAGAATGTTGGCTATGAGTTATACCTGT 
CAAGCCAAGAATTTAACACCATTGGAAGCACAAGAATTTGCATTGCACTTAATTAACT 
CAAACTTACAATTAGAGACAGCGGAGTTAGCTCTGTGGATGATATGGAATGGTACACC 
TGTCAGAGATTCCCTTTCTACTTTGATACCCAAGGAACAAGAAGTGCTACAGCTGTTG 
GTTAAAGCTGGGGCTCATGTCAACAGTGAAGACGATCGCACATCATGCATAGTTCGAC 
AAGCCTTAGAAAGAGAGGATTCCATTCGGACATTATTAGATAATGGAGCTTCAGTAAA 
TCAGTGTGATTCAAATGGGAGAACATTATTGGCTAATGCTGCATATAGTGGCAGTCTT 
GATGTAGTCAATTTACTTGTCTCTAGGGGAGCAGATTTAGAGATAGAAGATGCTCATG 
GACATACACCACTCACTCTAGCGGCTAGACAGGGACATACCAAGGTGGTTAATTGTTT 
GATTGGGTGTGGAGCAAATATTAATCATACTGATCAAGATGGTTGGACAGCATTAAGA 



AAGTGGATTGTGCAGATGCTGATAGCCGAACAGCTTTGAGAGCAGCAGCATGGGGAGG 
ACACGAGGATATTGTACTGAATTTGCTACAACATGGCGCTGAAGTGAACAAAGCTGAT 
AATGAAGGTAGAACTGCTTTGATAGCAGCAGCATACATGGGACATAGAGAGATTGTGG 
AACACCTACTGGACCATGGAGCAGAAGTAAATCATGAGGATGTTGATGGCAGGACTGC 
ACTCTCTGTAC-CTGCACTTTGTGTGCCTGCAAGTAAAGGGCACGCATCAGTTGTTAGC 
CTTTTAATTGATCGAGGTGCTGAAGTAGATCATTGTGATAAAGATGGCATGACTCCAC 
TGCTGGTAGCTGCCTATGAAGGACATGTTGATGTGGTTGACTTGCTTCTAGAAGGGGG 
AGCAGATGTAGATCACACAGATAACAATGGCCGTACACCCCTCTTAGCAGCAGCGTCT 
ATGGGTCATGCATCAGTTGTAAATACACTTTTGTTTTGGGGTGCAGCTGTGGATAGTA 
TTGATAGTGAAGGTAGGACAGTCCTCAGTATAGCTTCAGCACAAGGAAATGTTGAGGT 
GGTACGTACTCTACTGGATAGAGGGTTAGATGAAAATCACAGAGATGATGCTGGATGG 
ACACCTTTGCACATGGCAGCTTTTGAAGGGCACAGATTGATATGTGAAGCACTTATTG 
AACAAGGTGCTAGAACAAATGAGATTGACAATGATGGACGAATCCCTTTCATATTAGC 
TTCACAAGAGGGTCATTATGATTGTGTTCAAATATTACTGGAAAACAAATCCAACATT 
GATCAAAGAGGTTATGATGGAAGAAATGCACTGCGGGTTGCTGCATTAGAAGGGCACA 
GGGACATTGTTGAATTGCTTTTTAGCCATGGTGCTGATGTTAACTGCAAAGATGCTGA 
TGGTCGGCCTACACTTTATATCTTGGCCTTAGAAAATCAGCTTACAATGGCCGAATAT 
TTTTTAGAAAATGGTGCAAACGTAGAAGCAAGTGATGCTGAAGGAAGGACAGCACTTC 
ATGTGTCTTGTTGGCAAGGCCATATGGAAATGGTGCAGGTCCTGATAGCATACCATGC 
TGACGTCAATGCTGCAGACAATGAAAAGCGCTCTGCTTTGCAGTCTGCAGCCTGGCAG 
GGCCATGTAAAAGTGGTTCAGCTTCTGATTGAGCATGGTGCTGTAGTTGACCATACAT 
GTAACCAAGGTGCAACTGCACTCTGTATTGCAGCCCAGGAAGGGCACATTGATGTTGT 
TCAGGTCTTATTAGAGCATGGTGCTGATCCAAACCATGCTGATCAATTTGGACGCACT 
GCTATGCGTGTTGCAGCCAAAAATGGACATTCTCAGATAATTAAATTATTAGAAAAAT 
ATGGTGCATCTAGTTTGAATGGCTGTTCCCCATCTCCTGTTCACACAATGGAGCAAAA 
ACCTCTACAGTCATTGTCTTCAAAAGTGCAGTCATTAACAATTAAATCAAATAGCTCT 
GGTAGTACTGGTGGAGGGGATATGCAGCCTTCGTTACGTGGTTTACCTAATGGGCCTA 
CTCATGCTTTTAGTTCTCCTTCAGAATCTCCAGATTCTACAGTTGACCGGCAGAAGTC 
ATCACTGTCAAATAATTCCCrGAAAAGCTCAAAAAATTCATCTTTGAGAACTACTTCA 
TCTACAGCAACGGCTCAAACAGTGCCAATTGATAGCTTTCATAACTTGTCATTTACAG 
AACAAATTCAGCAGCATTCATTGCCACGCAGTAGAAGTCGACAGTCAATTGTTTCCCC 
ATCTTCCACAACACAGTCCTTAGGACAGAGTCATAATTCACCAAGTAGTGAATTTGAG 
TGGAGTCAAGTAAAGCCCAC-TTTGAAGTCAACTAAAGCAAGTAAAGGGGGGAAATCAG 
AAAATTCTGCCAAGTCTGGATCAGCTGGGAAAAAAGCGAAACAAAGTAATTCTTCACA 
GCCAAAGGTTTTAGAATATGAAATGACTCAGTTTGATAGAAGAGGACCTATAGCCAAA 
TCCGGGACTGCTGCACCACCTAAACAAATGCCAGCAGAATCTCAATGCAAAATTATGA 
TACCTTCAGCTCAGCAGGA.AATTGGTCGATCTCAACAGCAGTTTCTTATTCACCAACA 
AAGTGGGGAACAC-AAGAAGAGAAATGGAATAATGACAAATCCAAATTATCATCTTCAG 
AGCAACCAGGTTTTTCTTGGTAC-GGTTTCAGTCCCACGAACAATGCAAGATAGAGGGC 
ATCAGGAAGTGTTGGAGC-GATACCCTTCCTCAGAGACAGAATTAAGCCTTAAACAAGC 
TCTGAAGCTTCAC-ATTGAAGGTTCTGACCCTAGCTTCAACTATAAAAAGGAAACACCA 
TTATAAAAGGTAATATTTTGTCAACATAAAGAGTAAAAAACAAAAGTAGAGTTCCCTC 
TACTTGGAATGCTCT I 
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ORF Start: ATG at 67 


ORF Stop: TAA at 3484 




SEQ ID NO: 64 


1139aa I MW at 123942. lkD 


NOV22a, 
CG57366-01 Protein Sequence 


MLNQLHIKSSGCFLYLERVLDGWENFIMLREIRDIPGTLNGLYLWLCQRLFVRKQFA 
KVQPILNVILAACRPLTITELYHAVWTKNMSLTLEDFQRKLDILSKLLVDGLGNTKIL 

QLETAELALWKIWNGTPVRDSLSTLIPKEQEVLQLLVKAGAHWSEDDRTSCIVRQAL 
EREDSIRTLLDNGASVNQCDSNGRTLLANAAYSGSLDWNLIiVSRGADLEIEDAHGHT 
PLTLAARQGHTKWNCLIC-CGANINHTDQDGWTALRSAAWGGHTEWSALLYAGVKVD 
CADADSRTALRAAAWGGHEDIVLNLLQHGAEVNKADNEGRTALIAAAYMGHREIVEHL 
LDHGAEVNHEDVDGRTALSVAALCVPASKGHASWSLLIDRGAEVDHCDKDGMTPLLV 
AAYEGHVDWDLLLEGGADVDHTDNNGRTPLLAAASMGHASWNTLLFWGAAVDSIDS 
EGRTVLSIASAQGNVE^/RTLLDRGLDENHRDDAGWTPLHMAAFEGHRLICEALIEQG 
ARTNEIDNDGRIPFILASQEGHYDCVQILLENKSNIDQRGYDGRNALRVAALEGHRDI 
VELLFSHGADVNCKDADGRPTLYILALENQLTMAEYFLEHGANVEASDAEGRTALHVS 
CWQGHMEI4VQVLIAYHADVNAADNEKRSALQSAAWQGHVKWQLLIEHGAWDHTCNQ 
GATALCIAAQEGHIDWQVLLEHGADPNHADQFGRTAMRVAAKNGHSQIIKLLEKYGA 
SSLNGCSPSPVHTMEQKPLQSLSSKVQSLTIKSNSSGSTGGGDMQPSLRGLPNGPTHA 
FSSPSESPDSTVDRQKSSLSNNSLKSSKNSSLRTTSSTATAQTVPIDSFHNLSFTEQI 
QQHSLPRSRSRQSIVSPSSTTQSLGQSHNSPSSEFEWSQVKPSLKSTKASKGGKSENS 
AKSGSAGKKAKQSNSSQPKVLEYEMTQFDRRGPIAKSGTAAPPKQMPAESQCKIMIPS 
AQQEIGRSQQQFLIHQQSGEQKKRNGIMTNPNYHLQSNQVFLGRVSVPRTMQDRGHQE 
VLEGYPSS ETELSLKQALKLQI EGSDP SFNYKKETPL 



Further analysis of the NOV22a protein yielded the following properties shown in 
Table 22B. 



Table 22B. Protein Sequence Properties NOV22a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV22a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 22C. 
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Table 22C. Geneseq Results for NOV22a 


Geneseq 
Identifier 


Protein/Organisin/Length [Patent #, 
Date] 


NOV22a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU20535 


Human secreted protein, Seq ID No 

527 - Homo sapiens, 447 aa. 
[WO200155326-A2, 02-AUG-200I] 


699.. 1139 
7..447 


439/441 (99%) 
440/441 (99%) 


0.0 


ABB64823 


Drosophila melanogaster polypeptide 
SEQ ID NO 21261 - Drosophila 
melanogaster, 2119 aa. 
[WO200 1 7 1 042-A2, 27-SEP-200 1 ] 


2..1079 
885..2074 


413/1213(34%) 
621/1213(51%) 


e-177 


AAU25462 


Human mddt protein from clone 
Ll:334386.1 :2000MAY01 - Homo 
sapiens, 256 aa. [WO200 1 62922-A2, 
30-AUG-2001] 


466..702 
2..238 


231/237(97%) 
233/237 (97%) 


e-132 


AAU2065 1 


Human secreted protein, Seq ID No 

643 - Homo sapiens, 182 aa. 
[WO200155326-A2, 02-AUG-2001] 


661.818 
1 ..158 


156/158(98%) 
156/158 (98%) 


7e-86 


ABB67412 


Drosophila melanogaster polypeptide 
SEQ ID NO 29028 - Drosophila 
melanogaster, 1549 aa. 
[WO200 1 7 1 042-A2, 27-SEP-2001 ] 


236..813 
215..785 


196/578 (33%) 
308/578 (52%) 


2e-83 



In a BLAST search of public sequence datbases, the NOV22a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 22D. 
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Table 22D. Public BLASTP Results for NOV22a 


Protein 

Accession 
Number 


Protein/Organism/Length 


NOV22a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9ULJ7 


Hypothetical protein KIAA1223 - 
Homo sapiens (Human), 768 aa 
(fragment). 


372..1139 
1..768 


768/768 
(100%) 
768/768 
(100%) 


0.0 


AAH24725 


KIAA1223 PROTEIN -Homo 
sapiens (Human), 743 aa. 


397..1139 
1..743 


743/743 
(100%) 
743/743 
(100%) 


0.0 


AAL89945 


SD03956P - Drosophila 
melanogaster (Fruit fly), 1282 aa. 


2.. 1079 
48..1237 


413/1213 
(34%) 

621/1213 
(51%) 


e-177 


AAL39916 


SD01389P- Drosophila 
melanogaster (Fruit fly), 21 19 aa. 


2.. 1079 
885..2074 


413/1213 
(34%) 

621/1213 
(51%) 


e-177 


Q9VAU5 


CGI 001 1 PROTEIN - Drosophila 
melanogaster (Fruit fly), 2 1 1 9 aa. 


2.. 1079 
885..2074 


413/1213 
(34%) 

621/1213 
(51%) 


e-177 



PFara analysis predicts that the NOV22a protein contains the domains shown in 
the Table 22E. 
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Table 22E. Domain Analysis of NOV22a 


Pfam Domain 


NOV22a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


j Ank 


254..286 


16/33 (48%) 
26/33 (79%) 


2.4e-06 


S Ank 


287..319 


17/33 (52%) 
27/33 (82%) 


9.1e-09 


Ank 


320..352 


17/33 (52%) 
23/33 (70%) 


7.1e-06 


Ank 


353. .385 


16/33 (48%) 
23/33 (70%) 


2.6e-06 


Ank 


386..418 


17/33(52%) 
26/33 (79%) 


4.5e-09 


Ank 


419..456 


17/38(45%) 
31/38(82%) 


2.9e-05 


Ank 


457..489 


20/33(61%) 
30/33(91%) 


I.4e-10 


Ank 


490..522 


17/33 (52%) 
22/33 (67%) 


6.2e-06 


Ank 


523..S55 


16/33(48%) 
27/33 (82%) 


3.9e-06 


Ank 


556..588 


15/33 (45%) 
24/33 (73%) 


9e-08 


Ank 


589..621 


13/33(39%) 
26/33 (79%) 


7.3e-06 


Ank 


622..654 


17/33 (52%) 
26/33 (79%) 


3e-08 


Ank 


655..6S7 


13/33(39%) 
23/33 (70%) 


0.036 


Ank 


688..720 


16/33(48%) 
27/33 (82%) 


8.4e-10 


Ank 


721. .753 


14/33 (42%) 
26/33 (79%) 


2e-05 


Ank 


754..786 


17/33(52%) 
29/33 (88%) 


2.1e-10 



Example 23. 



The NOV23 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 23 A. 
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Table 23A. NOV23 Sequence Analysis 




SEQIDNO:65 j 3321 bp 


NOV23a, 
CG57368-01 DNA Sequence 


GCCTCTTCAGCCCCCGGCCGCCCCCCAGCGAAGACCTCTTCTACGAGACCTACTACAG 
CCTGAGCCAGCAGTACCCGCTGCTGCTGCTGCTGCTGGGGATCGTGCTCTGTGCGCTC 
GCGGCGCTGCTCGCAGTGGCCTGGGCCAGCGGCAGGGAGCTGACCTCAGACCCGAGCT 




CCGGGAGCAGCGACTGCAGCGCTGGACGCGTCCCCTGTCCGGCTTGGTATGGGTCGCG 
CTGCTAGCGCTAGGCCACGCCTTCCTGTTCACCGGGGGCGTGGTGAGCGCCTGGGACC 
AGGTGTCCTATTTTCTCTTCGTCATCTTCACGGCGTATGCCATGCTGCCCTTGGGCAT 




CTGTATCTTGGGCCACAGCCGGACTCACGGCCTGCACTGCTGCCGCAGTTGGCAGCAA 
ACGCAGTGCTGTTCCTGTGCGGGAACGTGGCAGGAGTGTACCACAAGGCGCTGATGGA 
GCGCGCCCTGCGGGCCACGTTCCGGGAGGCACTCAGCTCCCTGCACTCACGCCGGCGG 
CTGGACACCGAGAAGAAGCACCAGGAACACCTTCTCTTGTCCATCCTTCCTGCCTACC 
TGGCCCGAGAGATGAAGGCAGAGATCATGGCACGGCTGCAGGCAGGACAGGGGTCACG 
GCCAGAGAGCACTAACAATTTCCACAGCCTCTATGTCAAGAGGCACCAGGGAGTCAGC 
GTGCTGTATGCTGACATCGTGGGCTTCACGCGGCTGGCCAGCGAGTGTTCCCCTAAGG 
AGCTGGTGCTCATGCTCAATGAGCTCTTTGGCAAGTTCGACCAGATTGCCAAGGAGCA 
TGAATGCATGCGGATCAAGATCCTGGGGGACTGTTACTACTGTGTCTCTGGGCTGCCA 
CTCTCACTGCCAGACCATGCCATCAACTGCGTGCGCATGGGCCTGGACATGTGCCGGG 
CCATCAGGAAACTGCGGGCAGCCACTGGCGTGGACATCAACATGCGTGTGGGCGTGCA 
CTCAGGCAGCGTACTGTGTGGAGTCATCGGGCTGCAGAAGTGGCAGTACGACGTTTGG 
TCACATGATGTCACACTGGCTAACCACATGGAGGCAGGCGGTGTACCAGGGCGAGTGC 
ACATCACAGGGGCTACCCTGGCCCTGCTGGCAGGGGCTTATGCTGTGGAGGACGCAGG 
CATGGAGCATCGGGACCCCTACCTTCGGGAGCTAGGGGAGCCTACCTATCTGGTCATC 
GATCCACGGGCAGAGGAGGAGGATGAGAAGGGCACTGCAGGAGGCTTGCTGTCCTCGC 
TTGAGGGCCTCAAGATGCGTCCATCACTGCTGATGACCCGTTACCTGGAGTCCTGGGG 
CGCAGCCAAGCCTTTTGCCCACCTGAGCCACGGAGACAGCCCTGTGTCCACCTCCACC 
CCTCTCCCGTGGCTCTCACTGTGCAGGAGCCGTACCCCCCGGGGACTAGATGATGAAC 
TGGACACCGGGGATGCCAAGTTCTTCCAGGTCATTGAGCAGCTCAACTCGCAGAAACA 
GTGGAAGCAGTCGAAGGACTTCAACCCACTGACACTGTACTTCAGAGAGAAGGAGATG 
GAGAAAGAGTACCGACTCTCTGCAATCCCCGCCTTCAAATACTATGAAGCCTGCACCT 
TCETGGTTTTTCTCTCCAACTTCATCATCCAGATGCTAGTGACAAACAGGCCCCCAGC 




TTCTCAGAG AC( , 1 ir i CCTGAAAGGCCCCAAGATGCTGCACTGGCTGC 
CTGCACTGTCTGGCCTGGTGGCCACACGACCAGGACTGAGAATAGCCTTGGGCACCGC 
CACCATCCTCCTTGTCTTTGCCATGGCCATTACCAGCCTGTTCTTCTTCCCAACATCA 
TCAGACTGCCCTTTCCAAGCTCCCAATGTGTCCTCCATGATTTCCAACCTCTCCTGGG 
AGCTCCCTGGGTCTCTGCCTCTCATCAGTGTCCCATACTCCATGCACTGCTGCACGCT 
GGGCTTCCTCTCCTGCTCCCTCTTTCTGCACATGAGCTTCGAGCTGAAGCTGCTGCTG 




CGGAATGCCTCATCGTCCGCCTCTATCTGGGCCCCTTGGACTCCAGGCCCGGAGTGCT 
GAAGGAGCCCAAACTGATGGGTGCTATCTCCTTCTTCATCTTCTTCTTCACCCTCCTT 
GTCCTGGCTCGCCAGAATGAGTACTACTGCCGCCTGGACTTCCTGTGGAAGAAGAAGC 
TGAGGCAGGAGAGGGAGGAGACAGAGACGATGGAGAACCTGACTCGGCTGCTCTTGGA 
GAACGTGCTCCCTGCACACGTGGCCCCCCAGTTCATTGGCCAGAACCGGCGCAACGAG 
GATCTCTACCACCAGTCCTATGAATGCGTTTGTGTCCTCTTCGCCTCAGTCCCAGACT 
TCAAGGAGTTCTACTCTGAATCCAACATCAATCATGAGGGCCTAGAGTGTCTGAGGCT 
GCTCAATGAGATAATTGCTGATTTTGATGAGCTGCTCTCCAAGCCCAAGTTCAGTGGG 
GTGGAGAAGATCAAGACCATCGGCAGCACCTACATGGCAGCCACAGGCTTAAATGCCA 
CCTCTGGACAGGATGCACAACAGGATGCTGAACGGAGCTGCAGCCACCTTGGCACTAT 
GGTGGAATTTGCCGTGGCCCTGGGGTCTAAGCTGGACGTCATCAACAAGCATTCATTC 
AACAACTTCCGCCTGCGAGTGGGGTIGAACCATGGACCCGTAGTAGCTGGAGTTATTG 
GGGCCCAGAAGCGGCAATATGACATTTGGGGCAACACAGTGAACGTGGCCAGCCGCAT 
GGAGAGTACAGGAGTCCTTGGCAAAATCCAAGTGACTGAGGAGACAGCATGGGCCCTA 




AGCTCTGCACCTACTTCCTGAACACAGACTTGACACGAACTGGACCTCCTTCAGCTAC 
CCTAGGCTGAGATTGCACTCGCCTTCTAAGAACCTCAATAAAGAGACTCTGGGGTGTC 
TGGAGCCCATTGATG 
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ORF Start: ATG at 52 


ORF Stop: TGA at 3256 




SEQ ID NO: 66 


1068 aa | MW at 1 1 8800.7kD 


NOV23a, 
CG57368-01 Protein Sequence 

j 
1 

i 


MARLFSPRPPPSEDLFYETYYSLSQQYPLLLLLLGIVLCALAALLAVRWASGRELTSD 

WDQVSYFLFVIFTAYAMLPLGMRDAAVAGLASSLSHLLVLGLYLGPQPDSRPALLPQL 
AANAVLFLCGNVAGVYHKALMERALRATFREALSSLHSRRRLDTEKKHQEHLIjLSILP 
AYLAREMKAEIMARLQAGQGSRPESTNNFHSLYVKRHQGVSVLYADIVGFTRLASECS 
PKELVLMLNELFGKFDQIAKEHEC.MRIKILGDCYYCVSGLPLSLPDHAINCVRMGLDM 
CRAIRKLRAATGVDINMRVGVHSGSVLCGVIGLQKWQYDVWSHDVTLANHWEAGGVPG 
RVHITGATLALLAGAYAVEDAGMEHRDPYLRELGEPTYLVIDPRAEEEDEKGTAGGLL 
SSLEGLKMRPSLLMTRYLESWGAAKPFAHLSHGDSPVSTSTPLPWLSLCRSRTPRGLD 
DELDTGDAKFFQVIEQLNSQKQWKQSKDFNPLTLYFREKEMEKEYRLSAIPAFKYYEA 
CTFLVFLSNFIIQMLVTNRPPALAITYSITFLLFLLILFVCFSEDLMRCVLKGPKMLH 
WLPALSGLVATRPGLR IALGTATILLVFAMAITSLFFFPTSSDCPFQAPNVSSMISNL 
SWELPGSLPLISVPYSMHCCTLGFLSCSLFLHMSFELKLLLLLLWLAASCSLFLHSHA 
WLSECLIVRLYLGPLDSRPGVLKEPKLMGAISFFIFFFTLLVLARQNEYYCRLDFLWK 
KKLRQEREETETMENLTRLLLENVLPAHVAPQFIGQNRRNEDLYHQSYECVCVLFASV 
PDFKEFYSESNINHEGLECLRLLNEIIADFDELLSKPKFSGVEKIKTIGSTYMAATGL 
NATSGQDAQQDAERSCSHLGTMVEFAVALGSKLDVINKHSFNNFRLRVGLNHGPWAG 
VIGAQKPQYDIWGNTWVASRMESTGVLGKIQVTEETAWALQSLGYTCYSRGVIKUKG 
KGQLCTYFLNTDLTRTGPPSATLG 




SEQ ID NO: 67 


995 bp 


NOV23b, 
| CG57368-02 DNA Sequence 

i 


ATCATGGCCCGCCTCTTCAGCCCCCGGCCGCCCCCCAGCGAAGACCTCTTCTACGAGA 

CCTACTACAGCCTGAGCCAGCAGTACCCGCTGCTGCTGCTGCTGCTGGGGATCGTGCT 

CTGTGCGCTCGCGGCGCTGCTCGCAGTGGCCTGGGCCAGCGGCAGGGAGCTGACCTCA 

GACCCGAGCTTCCTGACCACTGTGCTGTGCGCGCTGGGCGGCTTCTCGrTG( [ ( I 

GCCTCC-CTTCCCGGGAGCAGCGACTC-CAGCGCTGGACGCGTCCCCTGTCCGGCTTGGT 

ATGGGTCGCGCTGCTAGCGCTAGGCCACGCCTTCCTGTTCACCGGGGGCGTGGTGAGC 

C-CCTGGGACCAGGTGTCCTATTTTCTCTTCGTCATCTTCACGGCGTATGCCATGCTGC 

~f-rr - -G-Z.T.-CGGGACGCCGCCGTCGCGGGCCTCGCCTCCTCACTCTCGCATCTGCT 

GGTCCTCGGGCTGTATCTTGGGCCACAGCCGGACTCACGGCCTGCACTGCTGCCGCAG 

TTGGCAGCAAACGCAGTGCTGTTCCTGTGCGGGAACGTGGCAGGAGTGTACCACAAGG 

CGCTGATGGAGCGCGCCCTGCGG < 3TTCCGGGAGGCACTCAGCTCCCTGCACTC 

ACGCCGGCGGCTGGACACCGAGAAGAAGCGCCAGGAACACCTTCTCTTGTCCATCCTT 

CCTGCCTACCTGGCCCGAGAGATGAAGGCAGAGATCATGGCACGGCTGCAGGCAGGAC 

AGGGGTCACGGCCAGAGAGCACTAACAATTTCCACAGCCTCTATGTCAAGAGTACAGG 

AGTCCTTGGCAAAATCCAAGTGACTGAGGAGACAGCATGGGCCCTACAGTCCCTGGGC 

TACACCTGCTACAGCCGGGGTGTCATCAAGGTGAAAGGCAAAGGGCAGCTCTGCACCT 

ACTTCCTGAACACAGACTTGACACGAACTGGACCTCCTTCAGCTACCCTAGGCTGAGA 

TTGCACTCG 




ORF Start: ATG at 4 


ORF Stop: TGA at 982 




SEQ ID NO: 68 


326 aa 1 MW at 35652.1kD 


NOV23b, 
CG57368-02 Protein Sequence 


MARLFSPRPPPSEDLFYETYYSLSQQYPLLLLLLGIVLCALAALLAVAWASGRELTSD 
PSFLTTVLCALGGFSIiLLGLASREQRLQRWTRPLSGLVWVALLALGHAFLFTGGVVSA 
WDQVSYFLFVIFTAYAMLPLGMRDAAVAGLASSLSHLLVLGLYLGPQPDSRPALLPQL 
AANAVLFLCGNVAGVYHKALMERALRATFREALSSLHSRRRLDTEKKRQEHLLLSILP 
AYLAREMKAEIMARLQAGQGSRPESTNNFHSLYVKSTGVLGKIQVTEETAWALQSLGY 
TCYSRGVIKVKGKGQLCTYFLNTDLTRTGPPSATLG 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 23B. 



Table 23B. Comparison of NOV23a against NOV23b. 


Protein Sequence 


NOV23a Residues/ 


Identities/ 


Match Residues 


Similarities for the Matched Region 


NOV23b 


1..267 


208/267 (77%) 




1..267 


208/267 (77%) 
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Further analysis of the NOV23a protein yielded the following properties shown in 
Table 23C. 



Table 23C. Protein Sequence Properties NOV23a 


PSort analysis: 


0.8000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.3000 
probability located in microbody (peroxisome) 


Si m;i IP analysis: 


Cleavage site between residues 5 1 and 52 



A search of the NOV23a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 23D. 



Table 23D. Geneseq Results for NOV23a 


i 

| Geneseq 
| Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV23a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAU01924 


Human adenylate cyclase 
polypeptide - Homo sapiens, 1077 
aa. [WO200125448-A1, 12-APR- 
2001) 


1..1068 
1-1077 


1066/1078 (98%) 
1066/1078(98%) 


0.0 


AAB02008 


Type IV adenylyl cyclase - Homo 
sapiens, 1064 aa. [US6107076-A, 
22-AUG-2000] 


1..1066 
1..1064 


976/1073 (90%) 
999/1073 (92%) 


0.0 


AAB02006 


Adenylyl cyclase type II-C2 C2 
alpha domain - Homo sapiens, 
1090 aa. [US6 1 07076-A, 22- 
AUG-2000] 


12..1060 
28..1082 


605/1064(56%) 
772/1064(71%) 


0.0 


AAR94560 


Rat adenylyl cyclase - Rattus sp, 
1090 aa. [WO9608260-A1, 21- 
MAR-1996] 


12..1060 
28..1082 


605/1064 (56%) 
772/1064 (71%) 


0.0 


AAE02938 


Human adenylate cyclase 25678 - 
Homo sapiens, 1086 aa. 
[WO200144453-A1, 21-JUN- 
2001] 


10..1060 
22..1078 


605/1066 (56%) 
771/1066 (71%) 


0.0 



In a BLAST search of public sequence datbases, the NOV23a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 23E. 
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Table 23E. Public BLASTP Results for NOV23a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV23a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC37757 


SEQUENCE 2 FROM PATENT 
WOO 125448 - Homo sapiens 
(Human), 1077 aa. 


1..1068 
1..1077 


1066/1078 
(98%) 

1066/1078 
(98%) 


0.0 


Q91WF3 


SIMILAR TO ADENYLYL 
CYCLASE 4 (ADENYLYL 
CYCLASE TYPE 4) (EC 4.6.1.1) - 
Mus musculus (Mouse), 1077 aa. 


1..1066 
1..1077 


988/1078 

(91%) 
1010/1078 

(93%) 


0.0 


P26770 


Adenylate cyclase, type IV (EC 
4.6. 1.1) (ATP pyrophosphate- lyase) 
(Adenylyl cyclase) - Rattus 
norvegicus (Rat), 1064aa. 


1..1066 
1.1064 


976/1073 
(90%) 

999/1073 
(92%) 


0.0 


P26769 


Adenylate cyclase, type II (EC 
4.6. 1 . 1 ) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Rattus 
norvegicus (Rat), 1 090 aa. 


12.. 1060 
28..1082 


605/1064 
(56%) 

772/1064 
(71%) 


0.0 


Q08462 


Adenylate cyclase, type II (EC 
4.6. 1.1) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Homo sapiens 

(Human), 887 aa (fragment). 


188.1060 
1..879 


526/888 (59%) 
658/888 (73%) 


0.0 



PFarn analysis predicts that the NOV23a protein contains the domains shown in 
the Table 23F. 



Table 23F. Domain Analysis of NOV23a 


Pfam Domain 


NOV23a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Sodcu 


376.-383 


6/8 (75%) 
8/8 (100%) 


0.73 


guanylate _cyc 


264..448 


71/226(31%) 
156/226(69%) 


4.2e-66 


guanylate_cyc 


855.. 1055 


94/228 (41%) 
177/228 (78%) 


2.7e-85 



Example 24. 



The NOV24 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 24A. 
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Table 24A. NOV24 Sequence Analysis 




SEQ ID NO: 69 


1 997 bp 


NOV24a, 
CG59955-01 DNA Sequence 


TTCTAATCTGTGGTXTCTTCACATCAACTGAAACAATGCAGCAAAATAACAGTGTGAC 
TGAATTCATACTGTTAGGATTAACACAGGATCCCTTGAGGCAGAAAATAGTGTTTGTA 




TCAAGTCCAGCCGGACACTAGGAAGCCCCATGTACTTCTTTCTATTTTATTTGTCCTT 
TGCAGATTCTTGCTTTTCAACTTCCACAGCCCCTAGATTAATTGTGGATGCTCTCTCT 
GAAAAGAAAATTATAACCTACAATGAGTGCATGACACAAGTCTTTGCACTACATTTAT 
TTGGCTGCATGGAGATCTTTGTCCTCATTCTCATGGCTGTTGATCGCTATGTGGCCAT 
CTGTAAGCCCTTGCGTTACCCAACCATCATGAGCCAGCAGGTCTGCATCATCCTGATT 
GTTCTTGCCTGGATAGGGTCTTTAATACACTCTACAGCTCAGATTATCCTGGCCTTAA 
GATTGCCTTTCTGTGGACCCTATTTGATTGATCATTATTGCTGTGATTTGCAGCCCTT 
GTTGAAACTTGCCTGCATGGACACTTACATGATCAACCTGCTGTTGGTGTCTAACAGT 
GGGGCAATTTGCTCAAGTAGTTTCATGATTTTGATAATTTCATATATTGTCATCTTGC 
ATTCACTGAGAAACCACAGTGCCAAAGGGAAGAAAAAGGCTCTCTCCGCTTGCACGTC 
TCACATAATTGTAGTCATCTTATTCTTTGGCCCATGTATATTCATATATACACGCCCC 
CCGACCACTTTCCCCATGGACAAGATGGTGGCAGTATTTTATACTATTGGACCACCCT 
TTCTCAATCCACTCATCTACACACTGAGGAATGCAGAAGTGAAAAATGCCATGAGAAA 
GTTATGGCATGGCAAAATTATTTCAGAAAACAAAGGATAAATTGAGGGCCTGACCTGA 
TTACTTTTT CA 




ORF Start: ATG at 36 


ORF Stop: TAA at 966 




SEQ ID NO: 70 


310 aa | MW at 35002.7kD 


NOV24a, 
CG59955-01 Protein Sequence 


MQQNNSVTEFILLGLTQDPLRQKIVFVIFLIFYMGTWGNMLIIVTIKSSRTLGSPMY 
FFLFYLSFADSCFSTSTAPRLIVDALSEKKIITYNECMTQVFALHLFGCMEIFVLILM 
AVDRYVAICKPLRYPTIMSQQVCIILIVLAWIGSLIHSTAQI ILALRLPFCGPYLIDH 
VCCDLQPLLKLACMDTYMINLLLVSNSGAICSSSFMILIISYIVILHSLRNHSAKGKK 
KAIiSACTSHIIWILFFGPCIFIYTRPPTTFPMDKMVAVFYTIGPPFLNPLIYTLRNA 
EVKNAMRKLWHGKI I SENKG 



Further analysis of the NOV24a protein yielded the following properties shown in 
Table 24B. 



Table 24B. Protein Sequence Properties NOV24a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.0300 
probability located in mitochondrial inner membrane 


SignalP analysis: 


Cleavage site between residues 40 and 41 



A search of the NOV24a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 24C. 
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Table 24C. Geneseq Results for NOV24a 1 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV24a Residues/ 
Match Residues 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


AAG72121 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1802 - 
Homo sapiens, 299 aa. 
[WO200127158-A2, 19-APR- 
2001] 


1..298 
2. .299 


296/298 
(99%) 

296/298 
(99%) 


e-170 


AAU24690 


Human olfactory receptor 
AOLFR189 - Homo sapiens, 298 
aa. [WO200168805-A2, 20-SEP- 
2001] 


1..286 
1..286 


284/286 
(99%) 

284/286 
(99%) 


e-163 


AAE09969 


G-protein coupled-receptor 
(GPCR) NOV5 protein - 
Unidentified, 310 aa. 
[WO200166746-A2, I3-SEP- 
2001] 


1..309 
1..309 


210/309 
(67%) 

262/309 
(83%) 


e-125 


AAG72412 


Human OR-like polypeptide query 
sequence. SEQ ID NO: 2093 - 

Homo sapiens, 310 aa. 
[WO200127158-A2, I9-APR- 
2001] 


1..309 
1..309 


210/309 
(67%) 

262/309 
(83%) 


e-125 


AAG72314 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1995 - 
Homo sapiens, 310 aa. 
[WO200127158-A2, 19-APR- 
2001] 


1..309 
1..309 


210/309 
(67%) 

262/309 
(83%) 


e-125 



In a BLAST search of public sequence datbases, the NOV24a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 24D. 
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Table 24D. Public BLASTP Results for NOV24a 


Protein 

Accession 
Number 


Protein/Organism/Length 


NOV24a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q8VGF4 


OLFACTORY RECEPTOR 
MOR230-3 - Mus musculus 
(Mouse), 307 aa. 


1..307 
1..30T 


272/307 (88%) 
294/307 (95%) 


e-160 


Q8VGF5 


OLFACTORY RECEPTOR 
MOR230-2 - Mus, musculus 
(Mouse), 307 aa. 


1..305 


261/305 (85%) 
285/305 (92%) 


e-155 


Q8VGF6 


OLFACTORY RECEPTOR 
MOR230-1 - Mus musculus 
(Mouse), 307 aa. 


1.307 
1..307 


259/307 (84%) 
286/307 (92%) 


e-154 


Q8VFF8 


OLFACTORY RECEPTOR 
MOR230-6 - Mus musculus 
(Mouse), 303 aa. 


4..304 
3. .303 


238/301 (79%) 
273/301 (90%) 


e-143 


Q8VFF9 


OLFACTORY RECEPTOR 
MOR230-5 - Mus musculus 
(Mouse), 303 aa. 


4..304 
3..303 


237/301 (78%) 
275/301 (90%) 


e-141 



PFam analysis predicts that the NOV24a protein contains the domains shown ii 
the Table 24E. 



Table 24E. Domain Analysis of NOV24a 


Pfam Domain 


NOV24a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l 


39..199 


36/181 (20%) 
120/181 (66%) 


4.4e-27 



Example 25. 



The NOV25 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 25A. 
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Table 25A. NOV25 Sequence Analysis 




SEQIDNO:71 | 847 bp 


NOV25a, 
CG89211-01 DNA Sequence 


AACAATGCAGTGCAGGATGGCATACCATGGCAACAGGGGCACTTCTCACCCAGCCACA 
TTTTTTCTCATTGGAATCCCAGGTCTGGAAGACGTCCATATGAGAATCTCCCTGCATT 
TCTGCTCTGTTTACCTTTTGGCTTTGCTGGGAAATGCTACCATTCTGTTAGTCATCAA 
GGCAGAACAGACCCTCCGGGAGCCCATGTTCTACTTTCTGGCCATCCTTTCCACAATT 
GATTTGGCCCTTTCTACAACCTCTGTGCCTCGTACGCTGGGTATCTTCTGGTTTGATG 
TTCATGAGATTAACTTTGGAGCTTGTGTGGCCCAGATGTTTCTGATCCATGCCTTCAC 
TGGCATGGAGGCTGAGGTCCTGGTGGCCATGGCCTTTGACCATTACGTGGCCATCTGC 






GCATT7TAATTCGTCCAGTTCTGTTTACACTCCCGATAATCTATCTCATCTACCGTTT 
ACCATTTTGTCAGGCTCATATAA7AGCCCATTCCTACTATGAGCACATGGGCATTGCA 
AAATTGTCCTGTGGAAACATCCGTGTCAATGCTATCTATGGGCTCTTTGTGGTCTCCC 
TCTTTCTCCTGAACCTGGTCCTTATTGTTATCTCATATGTTTACATTCTCTGTGCTGT 




TTGGACACAACATTCCACATTACATTCACATTCTT 




ORF Start: ATG at 5 j ORF Stop: TGA at 809 




SEQIDNO:72 j 268 aa j MWat30031.5kD 


NOV25a, 
CG8921 1-01 Protein Sequence 


MQCRMAYHGNRGTSHPATFFLIGIPGLEDVHMRISLHFCSVYLLALLGNATILLVIKA 
EQTLREPMFYFLAILSTIDLALSTTSVPRTLGIFWFDVHEINFGACVAQMFLIHAFTG 
ME1AEVLVAMAF3HYVAICNPLHYTNILTCRVLVGITMCILIRPVLFTLPIIYLIYRLP 
FCQAHIIAHSYYEHMGIAKLSCGNIRVNAIYGLFWSLFLLNLVLIVISYVYILCAVF 
CLPSHDARLKALSTCGSHVGVICVFYI PSVFSFLTH 



Further analysis of the NOV25a protein yielded the following properties shown in 
Table 25B. 



Table 25B. Protein Sequence Properties NOV25a 


PSort analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi body; 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 59 and 60 



A search of the NOV25a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 25C. 
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Table 25C. Geneseq Results for NOV25a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV25a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG71560 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1241 

- Homo sapiens, 321 aa. 
[WO200127158-A2, 19-APR- 
2001] 


5..268 
1..264 


253/264 (95%) 
255/264 (95%) 


e-146 


ABB44529 


Human GPCR5b polypeptide 
SEQ ID NO 18 -Homo 
sapiens, 311 aa. 
[WO200174904-A2, 1 1-OCT- 
2001] 


5. .2 68 
1 ..263 


225/264 (85%) 
239/264 (90%) 


e-127 


ABB44530 


Human GPCR5c polypeptide 
SEQ ID NO 20 - Homo 
sapiens, 3 1 1 aa. 
[WO200174904-A2, 11-OCT- 
2001] 


5. .268 


225/264 (85%) 
238/264 (89%) 


e-127 


AAG71562 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1243 

- Homo sapiens, 3 1 1 aa. 
[WO200127158-A2, 19-APR- 
2001] 


5. .268 
1..263 


225/264 (85%) 
237/264 (89%) 


e-127 


AAU24572 


Human olfactory receptor 
AOLFR62 - Homo sapiens, 
311 aa. [WO200168805-A2, 
20-SEP-2001] 


5. .268 
1..263 


225/264 (85%) 
237/264 (89%) 


e-127 



In a BLAST search of public sequence datbases, the NOV25a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 25D. 
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Table 25D. Public BLASTP Results for NOV25a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV25a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q8VGV8 


OLFACTORY RECEPTOR 
MOR32-3 - Mus musculus 
(Mouse), 317 aa. 


15..268 
10..263 


148/254 (58%) 
194/254 (76%) 


9e-86 


Q8VGX4 


OLFACTORY RECEPTOR 
MOR32-2 - Mus musculus 
(Mouse), 3 12 aa. 


10. .268 
5..263 


151/259 (58%) 
194/259 (74%) 




Q8VG26 


OLFACTORY RECEPTOR 
MOR32-5 - Mus musculus 
(Mouse), 313 aa. 


8. .268 
3. .263 


146/261 (55%) 
196/261 (74%) 


4e-84 


Q8VF06 


OLFACTORY RECEPTOR 
MORS 2-1 1 - Mus musculus 
(Mouse), 312 aa. 


15. .268 
10..263 


144/254 (56%) 
191/254 (74%) 


2e-83 


Q8VG28 


OLFACTORY RECEPTOR 
MOR32-4 - Mus musculus 
(Mouse), 3 12 aa. 


9.268 
4..26S 


148/260 (56%) 
188/260 (71%) 


2e-80 



PFam analysis predicts that the NOV25a protein contains the domains shown in 
the Table 25E. 



Table 25E. Domain Analysis of NOV25a 


Pfam Domain 


NOV25a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l 


48..157 


30/1 12 (27%) 
76/112 (68%) 


2.2e-09 



Example 26. 



The NOV26 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 26A. 
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Table 26A. NOV26 Sequence Analysis 




SEQIDNO:73 | 614 bp 


NOV26a, 
CG90530-02 DNA Sequence 


CTTAGTGTGGAGCAGTGAACTGTGTGTGGTTCCTTCTACTTGGGGATCATGCAGAGAG 
CTTCACGTCTGAAGAGAGAGCTGCACATGTTAGCCACAGAGCCACCCCCAGGCATCAC 
ATGTTGGCAAGATAAAGACCAAATGGATGACCTGCGAGCTCAAATATTAGGTGGAGCC 
AACACACCTTATGAGAAAGGTGTTTTTAAGCTAGAAGTTATCATTCCTGAGAGGTACC 
CATTTGAACCTCCTCAGATCCGATTTCTCACTCCAATTTATCATCCAAACATTGATTC 
TGCTGGAAGGATTTGTCTGGATGTTCTCAAATTGCCACCAAAATCCTCAGAATTTAAA 
TATAATAAGCCAGCCTTCCTCAAGAATGCCAGACAGTGGACAGAGAAGCATGCAAGAC 
AGAAACAAAAGGCTGATGAGGAAGAGATGCTTGATAATCTACCAGAGGCTGGTGACTC 
CAGAGTACACAACTCAACACAGAAAAGGAAGGCCAGTCAGCTAGTAGGCATAGAAAAG 
AAATTTCATCCTGATGTTTAGGGGACTTGTCCTGGTTCATCTTAGTTAATGTGTTCTT 
TGCCAAGGTATCTAAGTTGCCTACCTTGAATTTT 




ORP Start: ATG at 49 


ORF Stop: TAG at 541 




SEQ ID NO: 74 


164 aa j MW at 18958.6kD 


NOV26a, 
CG90530-02 Protein Sequence 


MQRASRLKRELHMLATEPPPGITCWQDKDQMDDLRAQILGGANTPYEKGVFKLEVIIP 
ERYPFEPPQIRFLTPIYHPNIDSAGRICLDVLKLPPKSSEFKYNKPAFLKNARQWTEK 
HARQKQKADEEEMLDNLPEAGDSRVHNSTQKRKASQLVGIEKKFHPDV 



Further analysis of the NOV26a protein yielded the following properties shown in 
Table 26B. 



Table 26B. Protein Sequence Properties NOV26a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome): 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV26a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 26C. 
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Table 26C. Geneseq Results for NOV26a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV26a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAB51233 


Human ubiquitin-conjugating 
enzyme 2 protein sequence SEQ 
ID NO: 7 - Homo sapiens, 197 aa. 
[CN1268564-A, 04-OCT-2000] 


1 ..164 
1.. 197 


164/197 (83%) 
164/197 (83%) 


2e-89 


AAM4I45 
5 


Human polypeptide SEQ ID NO 
6386 - Homo sapiens, 207 aa. 
[WO200 1 533 12-A 1 , 26-JUL- 
2001] 


I. -164 

II. .207 


164/197 (83%) 
164/197 (83%) 


2e-89 


AAM3966 
9 


Human polypeptide SEQ ID NO 
2814 - Homo sapiens, 197 aa. 
[WO2001533I2-AI,26-JUL- 
2001] 


1..I64 
1..I97 


164/197 (83%) 
164/197 (83%) 


2e-89 


AAU23245 


Novel human enzyme 
polypeptide #33 i - Homo 
sapiens, 207 aa. [WO20015530I- 
A2, 02-AUG-2001] 


I. .164 

II. .207 


164/197 (83%) 
164/197 (83%) 


2e-89 


AAY23I57 


Human ubiquitin-like 
conjugating protein (UBCLE) - 

Homo sapiens, 197 aa. 
[W0993I252-A1, 24-JUN-I999] 


1.. 164 

1.. 197 


164/197 (83%) 
164/197 (83%) 


2e-89 



In a BLAST search of public sequence datbases, the NOV26a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 26D. 
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Table 26D. Public BLASTP Results for NOV26a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV26a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Value 


Q9NPD8 


UBIQUITIN-CONJUGATING 
ENZYME ISOLOG (HSPCI50) 
(UBIQUITIN-CONJUGATING 
ENZYME E2) (CDNA FLJ20497 
FIS, CLONE KAT08890) (HSPCI50 
PROTEIN SIMILAR TO 
UBIQUITIN-CONJUGATING 
ENZYME) - Homo sapiens 
(Human), 197 aa. 


I ..164 
I..I97 


164/197 (83%) 
164/197 (83%) 


4e-89 


Q9CQ37 


2700084L22RIK PROTEIN - Mus 
musculus (Mouse), 204 aa. 


1 ..164 
I..I96 


136/197 (69%) 
145/197 (73%) 


7e-70 


Q9XHP3 


UBIQUITIN-CONJUGATING 
ENZYME E2 - Catharanthus roseus 
(Rosy. periwinkle) (Madagascar 
periwinkle), 153 aa. 


6..91 
9..94 


49/86 (56%) 
60/86 (68%) 


2e-23 


Q8W0II 


UBIQUITIN-CONJUGATING 
ENZYME E2 - Oryza sativa (Rice), 
153 aa. 


6..91 
9..94 


46/86 (53%) 
58/86 (66%) 


7e-21 


Q94A97 


AT1G78870/F9K20_8 - Arabidopsis 
thaliana (Mouse-ear cress), 153 aa. 


6..91 
9.-94 


46/86 (53%) 
57/86 (65%) 


7e-21 



PFam analysis predicts that the NOV26a protein contains the domains shown in 
the Table 26E. 



Table 26E. Domain Analysis of NOV26a 


Pfam Domain 


NOV26a Match Region 


Identities/ 
for the Matched Region 


Expect Value 


UQ_con 


1..116 


51/165 (31%) 
91/165 (55%) 


2.9e-23 



Example 27. 



The NOV27 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 27A. 
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Table 27A. NOV27 Sequence Analysis 




SEQ ID NO: 75 | 973 bp 


NOV27a, 
CG93076-0I DNA Sequence 


G^TGAACCAGACTTTGAATAGCAGTGGGACCGTGGAGTCAGCCCTAAACTATTCCAG 


CTGTGCGGGATGGCAGGCAACAGCATGGTGATCTGGCTGCTGGGCTTTCGAATGCACA 
GGAACCCCTTCTGCATCTATATCCTCAACCTGGCGGCAGCCGACCTCCTCTTCCTCTT 
CAGCATGGCTTCCACGCTCAGCCTGGAAACCCAGCCCCTGGTCAATACCACTGACAAG 
GTCCACGAGCTGATGAAGAGACTGATGTACTTTGCCTACACAGTGGGCCTGAGCCTGC 
TGACGGCCATCAGCACCCAGCGCTGTCTCTCTGTCCTCTTCCCTATCTGGTTCAAGTG 

CTGATGAACGGGTTGACCTCTTCCTTCTGCAGCAAGTTCTTGAAATTCAATGAAGATC 
GGTGCTTCAGGGTGGACATGGTCCAGGCCGCCCTCATCATGGGGGTCTTAACCCCAGT 
GATGACTCTGTCCAGCCTGACCCTCTTTGTCTGGGTGCGGAGGGGCTCCCAGCAGTGG 




TCTGTTCCCTGCCTCTGAGCATCTACTGGTTTGTGCTCTACTGGTTGAGCCTGCCGCC 
CGAGATGCAGGTCCTGTGCTTCAGCTTGTCACGCCTCTCCTCGTCCGTAAGCAGCAGC 
GCCAACCCCGTCATCTACTTCCTGGTGGGCAGCCGGAGGAGCCACAGGCTGCCCACCA 

GGAGACGCCCACCGTGGGCACCAATGAGATGGGGGCTTGAGAGCC 




ORF Start: ATG at 3 


ORF Stop: TGA at 966 




SEQ ID NO: 76 


321 aa j MW at 36087.lkD 


NOV27a, 
CG93076-01 Protein Sequence 


MNQTLNSSGTVESALNYSRGSTVHTAYLVLSSLAMFTCLCGMAGNSMVIWLLGFRMHR 
NPFCIYILNLAAADLLFLFSMASTLSLETQPLVNTTDKVHELMKRLMYFAYTVGLSLL 
TAISTQRCLSVLFPIWFKCHRPRHLSAWVCGLLWTLCLLMNGLTSSFCSKFLKFNEDR 
CFRVDM7QAALIKGVLTPVMTLSSLTLFVWVRRGSQQWRRQPTRLFWVLASVLVFLI 
CSLPLSIVWFVLVWLSLPPEMQVLCFSLSRLSSSVSSSANPVIYFLVGSRRSHRLPTR 
SLGTVLQQAT.REEPELEGGETPTVGTNEMGA 



Further analysis of the NOV27a protein yielded the following properties shown in 
Table 27B. 



Table 27B. Protein Sequence Properties NOV27a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.3000 
probability located in microbody (peroxisome) 


SignalP analysis: 


Cleavage site between residues 45 and 46 



A search of the NOV27a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 27C. 
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Table 27C. Geneseq Results for NOV27a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV27a 
Residues/ 
Match Residues 


Similarities 
for the 
Matched 
Region 


Expect 
Value 


ABB04658 


Human G protein-coupled receptor TGR7 
SEQ ID NO: 1 - Homo sapiens, 321 aa. 
[WO200183748-A1, 08-NOV-2001] 


I..321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 


AAU 19292 


Human G protein-coupled receptor 
nGPCR-74 - Homo sapiens, 321 aa. 
[WO200166750-A2, 13-SEP-2001] 


1..321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 


AAG64124 


Human G protein-coupled receptor 
GPRv51 - Homo sapiens, 321 aa. 
[ WO200 1 48 1 88-A 1 , 05-JUL-200 1 ] 


1..321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 


AAU04366 


Human G-protein coupled receptor, 
hRUP12 - Homo sapiens, 321 aa. 
[WO200136471-A2, 25-MAY-2001] 


I. .321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 


AAE06768 


Human G-protein coupled receptor-18 
(GCREC-18) protein - Homo sapiens, 
321 aa. [WO200157085-A2, 09-AUG- 
2001] 


1..321 
1..321 


320/321 
(99%) 

320/321 
(99%) 


0.0 



In a BLAST search of public sequence datbases, the NOV27a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 27D. 
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Table 27D. Public BLASTP Results for NOV27a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV27a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q91ZB8 


G PROTEIN-COUPLED 
RECEPTOR - Mus musculus 
(Mouse), 321 aa. 


1..318 
1.319 


189/321 (58%) ■ 
237/321 (72%) 


3e-98 


Q96LA9 


G PROTEIN-COUPLED 
RECEPTOR - Homo sapiens 
(Human), 322 aa. 


1..310 
1.305 


134/317(42%) 
189/317 (59%) 


7e-54 


AAL86883 


G PROTEIN-COUPLED 
RECEPTOR SNSR6 - Homo 
sapiens (Human), 322 aa. 


1..310 
1..305 


133/317(41%) 
188/317 (58%) 


2e-53 


AAL86882 


G PROTEIN-COUPLED 
RECEPTOR SNSR5 - Homo 
sapiens (Human), 322 aa. 


1..310 
1..305 


134/317 (42%) 
188/317(59%) 


3e-53 


AAL86880 


G PROTEIN-COUPLED 
RECEPTOR SNSR3 - Homo 
sapiens (Human), 322 aa. 


1..309 
1..304 


129/314(41%) 
188/314 (59%) 


2e-52 



PFam analysis predicts that the NOV27a protein contains the domains shown in 
the Table 27E. 



Table 27E. Domain Analysis of NOV27a 


Pfam Domain 


NOV27a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l 


44..78 


17/37 (46%) 
33/37(89%) 


1.4e-09 


7tm_l 


104..276 


39/205 (19%) 
119/205(58%) 


7.3e-09 



Example 28. 



The NOV28 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 28A. 
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Table 28A. NOV28 Sequence Analysis 




SEQ ID NO: 77 | 2856 bp 


NOV28a, 
CG94235-01 DNA Sequence 


TTCCCGACTC-CACCCTGC-CTCACTTCGCCCTAGGCGCCGACGCCCCCGGCGACGCAGA 
CGCCCCCGACCCCCGCCTGGCGQCGCTC-CTGGGGCCCCCGGAGCGCAGCTACTCGCTG 




ACCAGCGCCTGCTGCACCAGCTGCGCCGCGGCCCCTTCCAGCGGTGCCAGCTGCTCAG 

CGCGACCCCCTGGATGACCCTGACACCCGGCAAGCGCTGCTCGAGCTGCTGGGCGCCT 
GTCAGGAGGCACCACGCCCGCACTTGGGCGAGTTCGAGGCCGACCCGCGCGGCCAGCT 
GTGGCAGCGCCTCTGGGAGGTGCAAGACGGCAGGCGGCTGCAGGTGGGCTGCGCACAG 
GTCGTGCCCGTCCCGGAGCCCCCGCTGCACCCGGTGGTGCCAGACTTGCCCAGTTCCG 
TGGTCTTCCCGGACCGGGAAGCCGCCCGGGCCGTTTTGGAGGAGTGTACCTCCTTTAT 
TCCTGAAGCCCGGGCAGTGCTTGACCTGGTCGACCAGTGCCCAAAACAGATCCAGAAA 
GGAAAGTTCCAGGTTGTTGCCATCGAAGGACTGGATGCCACGGGTGGTAAAACCACGG 
TGACCCAGTCAGTGGCAGATTCACTTAAGGCTGTCCTCTTAAAGTCACCACCCTCTTG 
CATTGGCCAGTGGAGGAAGATCTTTGATGATGAACCAACTATCATTAGAAGAGCTTTT 
TACTCTTTGGGCAATTATATTGTGGCCTCCGAAATAGCTAAAGAATCTGCCAAATCTC 
CTGTGATTGTAGACAGGCACAGCACGGCCACCTATGCCATAGCCACTGAGGTGAGTGG 
GGGTCTCCAGCACCTGCCCCCAGCCCATCACCCTGTGTACCAGTGGCCAGAGGACCTG 
CTCAAACCTGACCTTATCCTGCTGCTCACTGTGAGTCCTGAGGAGAGGTTGCAGAGGC 
TGCAGGGCCGGGGCATGGAGAAGACCAGGGAAGAAGCAGAACTTGAGGCCAACAGTGT 
GTTTCGTCAAAAGGTAGAAATGTCCTACCAGCGGATGGAGAATCCTGGCTGCCATGTG 
GTTGATGCCAGCCCCTCCAGAGAAAAGGTCCTGCAGACGGTATTAAGCCTAATCCAGA 
ATAGTTTTAGTGAACCGTAGTTACTCTGGCCAGGTGCCACGTCTAACTAGATTAGATG 
TTGTTTGAAACATCTACATCCACCATTTGTTATGCAGTGTTCCCAAATTTCTGTTCTA 
CARGCATGTTGTGTGGCAGAAAACTGGAGACCAGGCATCTTAATTTTACTTCAGCCAT 
CGTACCCTCTTCTGACTGATGGACCCGTCATCACAAAGGTCCCTCTCTYTCATGTTCCA 
GTGAGAGGCCAGCGATTGCTTTCTTCCTGGCATAGTAAACATTTTCTTGGAACATATG 
TTTCACTTAATCACTACCAAATATCTGGAAGACCTGTCTTACTCAGACAGCACCAGGT 

TCTCCTACACTGGCATGCTGATGAGATC r P. C ATTGGCTTCTTCCACAT 

CAAGTGTTCCTGCAGAGGCTGTCTATGTGTCCTGGCTGCCCAAGGACACTCCTGCAGA 

GAGCCCTTTTGATAGGCTTCTGAGTCATATATAAAGACATTCAAGCCAAGATGCTCCA 
ACTGCAAATATACCAACCTTCTCTGAATTATATTTTGCTTATTTATATTTCTTTTCTT 
TTTTTCTAAAGTATGGCTCTGAATAGAATGCACATTTTCCATTGAACTGGATGCATTT 
CATTTAGCCAATCCAGTAATTTATTTATATTAATCTATACATAATATGTTTCCTCAGC 
ATAGGAGCTATGATTCATTAATTAAAAGTGGAGTCAAAACGCTAAATGCAATGTTTGT 




GAGTGGGATTTCTTGGTAAATTATCTTGCACTTGAATGTCTCATGATTACATATGAAA 
TCGCTTTGACATATCTTTAGACAGAAAAAAGTAGCTGAGTGAGGGGGAAATTATAGAG 
CTGTGTGACTTTAGGGAGTAGGTTGAACCAGGTGATTACCTAAAATTCCTTCCAGTTC 
AAAGGCAGATAAATCTGTAAATTATTTTATCCTATCTACCATTTCTTAAGAAGACATT 
ACTCCAAAATAATTAAATTTAAGGCTTTATCAGGTCTGCATATAGAATCTTAAATTCT 
AATAAAGTTTCATGTTAATGTCATAGGATTTTTAAAAGAGCTATAGGTAATTTCTATA 
TAATATGTGTATATTA^iAATGTAATTGATTTCAGTTGAAAGTATTTTAAAGCTGATAA 
ATAGCATTAGGGTTCTTTGCAATGTGGTATCTAGCTGTATTATTGGTTTTATTTACTT 
TAAACATTTTGAAAAGCTTATACTGGCAGCCTAGAAAAACAAACAATTAATGTATCTT 
TATGTCCCTGGCACATGAATAAACTTTGCTGTGGTTTACTAATCTAAAAAAAAAAAAA 
AAAGGGCGGCCGCT 




ORF Start: ATG at 28 


ORF Stop: TAG at 1294 




SEQ ID NO: 78 


422 aa j MW at 46476.6kD 


NOV28a, 
CG94235-01 Protein Sequence 


MAPPRRFVLELPDCTLAHFALGADAPGDADAPDPRLAALLGPPERSYSLCVPVTPDAG 
CGARVRAARLHQRLLHQLRRGPFQRCQLLRLLCYCPGGQAGGAQOGFLLRDPLDDPDT 
RQALLELLGACQEAPRPHLGEFEADPRGQLWQRLWEVQDGRRLQVGCAQVVPVPEPPL 
HPWPDLPSSWFPDREAARAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWAIE 
GLDATGGKTTVTQSVADSLKAVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIVA 
SEIAKESAKSPVIVDRHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDIiKPDLILLL 
TVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHWDASPSREK 
VLQTVLSLIQNSFSEP 
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SEQIDNO.-79 j 2331 bp 


NOV28b, 
CG94235-02 DNA Sequence 

j 

! 


GTCTGCGCTGGGGCCATGGCTCCGCCGCGCCGCTTCGTCCTGGAGCTTCCTGACTGCA 
CCCTGGCTCACTTCGCCCTAGGCGCCGTTTTGGAGGAGTGTACCTCCTTTATTCCTGA 
AGCCCGGGCAGTGCTTGACCTGGTCGACCAGTGCCCAAAACAGATCCAGAAAGGAAAG 
TTCCAGGTTGTTGCCATCGAAGGACTGGATGCCACGGGTAAAACCACGGTGACCCAGT 
CAGCGGCAGATTCACTTAAGGCTGTCCTCTTAAAGTCACCACCCTCTTGCATTGGCCA 
GTGGAGGAAGATCTTTGATGATGAACCAACTATCATTAGAAGAGCTTTTTACTCTTTG 
GGCAATTATATTGTGGCCTCCGAAATAGCTAAAGAATCTGCCAAATCTCCTGTGATTG 
TAGACAGGTACTGG CACAG CACGGCCACCTATGCCATAGCCACTGAGGTGAGTGGGGG 
TCTCCAGCACCTGCCCCCAGCCCATCACCCTGTGTACCAGTGGCCAGAGGACCTGCTC 
AAACCTGACCTTATCCTGCTGCTCACTGTGAGTCCTGAGGAGAGGTTGCAGAGGCTGC 
AGGGCCGGGGCATGGAGAAGACCAGGGAAGAAGCAGAACTTGAGGCCAACAGTGTGTT 
TCGTCAAAAGGTAGAAATGTCCTACCAGCGGATGGAGAATCCTGGCTGCCATGTGGTT 
GATGCCAGCCCCTCCAGAGAAAAGGTCCTGCAGACGGTATTAAGCCTAATCCAGAATA 
GTTTTAGTGAACCGTAGTTACTCTGGCCAGGTGCCACGTCTAACTAGATTAGATGTTG 
TTTGAAACATCTACATCCACCATTTGTTATGCAGTGTTCCCAAATTTCTGTTCTACAA 
GCATGTTGTGTGGCAGAAAACTGGAGACCAGGCATCTTAAGTTTACrrCAGCCATCGT 
ACCCTCTTCTGACTGATGGACCCGTCATCACAAAGGTCCCTCTCATCATGTTCCAGTG 
AGAGGCCAGCGATTGCTTTCTTCCTGGCATAGTAAACATTTTCTTGGAACATATGTTT 
CACTTAATCACTACCAAATATCTGGAAGACCTGTCTTACTCAGACAGCACCAGGTGTA 
CAGAAGCAGCAGACAAGATCTTCCAGATCAGCAGGGAGACCCCGGAGCCTCTGCTTCT 
CCTACACTGGCATGCTGATGAGATCGTGACATGCCCACATTGGCTTCTTCCACATCTG 
GTTGCACTCGTCATGATGGGCTCGCTGCATCTCCCTCAGTCCCAAATTCTAGAGCCAA 
GTGTTCCTGCAGAGGCTGTCTATGTGTCCTGGCTGCCCAAGGACACTCCTGCAGAGCC 
ATTTTTGGGTAAGGAACACTTACAAAGAAGGCATTGATCTTGTGTCTGAGGCTCAGAG 
CCCTTTTGATAGGCTTCTGAGTCATATATAAAGACATTCAAGCCAAGATGCTCCAACT 
GCAAATATACCAACCTTCTCTGAATTATATTTTGCTTATTTATATTTCTTTTCTTTTT 
TTCTAAAGTATGGCTCTGAATAGAATGCACATTTTCCATTGAACTGGATGCATTTCAT 
TTAGCCAATCCAGTAAi TTATTTATATTAATCT/iTACATAATATGTTTCCTCAGCATA 
GGAGCTATGATTCATTAATTAAAAGTGGAGTCAAAACGCTAAATGCAATGTTTGTTGT 
GTATTTTCATTACACAAACTTA^TTTGTCTTGTTAAATAAGTACAGTGGATCTTGGAG 
TGGGATTTCTTGGTAAATTATCTTGCACTTGAATGTCTCATGATTACATATGAAATCG 
CTTTGACATATCTTTAGACAGAAAAAAGTAGCTGAGTGAGGGGGAAATTATAGAGCTG 
TGTGACTTTAGGGAGTAGGTTGAACCAGGTGATTACCTAAAATTCCTTCCAGTTCAAA 
GGCAGATAAATCTGTAAATTATTTTATCCTATCTACCATTTCTTAAGAAGACATTACT 
CCAAAATAATTAAATTTAAGGCTTTATCAGGTCTGCATATAGAATCTTAAATTCTAAT 
AAAGTTTCATGTTAATGTCATAGGATTTTTAAAAGAGCTATAGGTAATTTCTGTATAA 
TATGTGTATATTAAAATGTAATTGATTTCAGTTGAAAGTATTTTAAAGCTGATAAATA 

ACATTTTGAAAAGCTTATACTGGCAGCCTAGAAAAACAAACAATTAATGTATCTTTAT 
GTCCCTGGCACATGAATAAACTTTGCTGTGGTTTACTAAAAAAAAAAAAAAAAAAAAA 
GGGCGGCCGCT 




ORF Start: ATG at 16 


ORF Stop: TAG at 769 




SEQ ID NO: 80 


251 aa 1 MW at 27980.8kD 


NOV28b, 
CG94235-02 Protein Sequence 


VIAPPRRFVLELPDCTLAHFALGAVLEECTSFIPEARAVLDLVDQCPKQIQKGKFQWA 
IEGLDATGKTTVTQSAADSLKAVLLKSPPSCIGQWRKIFDDEPTIIRRAFYSLGNYIV 
\SEIAKESAKSPVIVDRYWHSTATYAIATEVSGGLQHLPPAHHPVYQWPEDLLKPDLI 
jLLTVSPEERLQRLQGRGMEKTREEAELEANSVFRQKVEMSYQRMENPGCHWDASPS 
iEKVLQTVLSLIQNSFSEP 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 28B. 



Table 28B. Comparison of NOV28a against NOV28b. 


Protein Sequence 


NOV28a Residues/ 


Identities/ 


Match Residues 


Similarities for the Matched Region 


NOV28b 


192..422 


217/233 (93%) 




20..251 


217/233 (93%) 
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Further analysis of the NOV28a protein yielded the following properties shown in 
Table 28C. 



Table 28C. Protein Sequence Properties NOV28a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1939 probability located in lysosome (lumen); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV28a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 28D. 



Table 28D. Geneseq Results for NOV28a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV28a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB57297 


Mouse ischaemic condition related 
protein sequence SEQ ID NO:834 - Mus 
musculus,431 aa. [WO200188188-A2, 
22-NOV-2001] 


199.. 384 
1 57..344 


131/193 (67%) 
150/193 (76%) 


3e-64 


AAU34536 


E. coli cellular proliferation protein #117 
- Escherichia coli, 213 aa. 
[WO200170955-A2, 27-SEP-2001] 


229..421 
6..210 


51/212(24%) 
88/212(41%) 


0.054 


AAB72201 


E. coli thymidylate kinase amino acid 
sequence - Escherichia coli, 213 aa. 
[WO200111025-A2, 15-FEB-2001] 


229..421 
6..210 


51/212(24%) 
88/212(41%) 


0.054 


AAU34536 


E. coli cellular proliferation protein #1 17 
- Escherichia coli, 213 aa. 
[WO200170955-A2, 27-SEP-2001] 


229..421 
6..210 


51/212(24%) 
88/212(41%) 


0.054 


AAY28786 


E.coli thymidylate kinase- 1 - Escherichia 
coli, 213 aa. [WO9941404-A2, 19-AUG- 
1999] 


229..421 
6..210 


51/212(24%) 
88/212(41%) 


0.054 



In a BLAST search of public sequence datbases, the NOV28a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 28E. 
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Table 28E. Public BLASTP Results for NOV28a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV28a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q9DC34 


1200004E04R1K PROTEIN - 
Mus musculus (Mouse), 395 
aa. 


42..418 
16..393 


292/379 (77%) 
328/379 (86%) 


e-168 


Q62316 


THYMIDYLATE KINASE 

HOMOLOGUE - Mus 
musculus (Mouse), 431 aa. 


199..384 
157..344 


131/193 (67%) 
150/193 (76%) 


7e-64 


Q96AL8 


HYPOTHETICAL 6.7 KDA 
PROTEIN - Homo sapiens 
(Human), 58 aa (fragment). 


S65..422 
1..58 


58/58 (100%) 
58/58 (100%) 


3e-25 


060970 


TKRP1 - Leishmania major, 
274 aa. 


199..418 
44..254 


73/220 (33%) 
108/220 (48%) 


9e-22 


Q9CKE9 


Thymidylate kinase (EC 
2.7.4.9) (dTMP kinase) - 
Pasteurella multocida, 209 aa. 


224..416 
4..203 


59/210 (28%) 
87/210 (41%) 


4e-04 



PFam analysis predicts that the NOV28a protein contains the domains shown in 
the Table 28F. 



Table 28F. Domain Analysis of NOV28a 


Pfam Domain 


NOV28a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Thymidylate_kin 


231..411 


53/208 (25%) 
131/208(63%) 


4.9e-08 



Example 29. 



5 The NOV29 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 29A. 
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Table 29A. NOV29 Sequence Analysis 




SEQ ID NO: 81 


867 bp 


NOV29a, 
CG94692-01 DNA Sequence 


ATGCCGGTGGAGGAATTTGTGGCTGGCTGGATCTCTGGCGCTCTGGGCTTGGTCCTGG 
GACACCCGTTTGACACTGTAAAGGTGAGGCTGCAGACCCAGACCACCTACCGGGGCAT 
CGTTGATTGCATGGTCAAGATTTACCGCCATGAGTCCCTCCTGGGCTTCTTCAAGGGA 




CAGCTACATGCACATCTTCCTAGCGGGCTGCACCGGGGGGTTCCTGCAGGCCTACTGT 
CTGGCTCCTTTTGACCTCATCAAAGTCCGGCTACAAAACCAGACAGAGCCAAGGGCCC 
AGCCAGGGAGCCCCCCACCCCGGTACCAGGGGCCCGTGCACTGTGCAGCCTCCATCTT 
CCGGGAGGAGGGGCCCCGGGGGCTGTTCCGAGGAGCCTGGGCCCTGACGCTGAGGGAC 




CAGAAGGCCAGAATCCCAGCTCAGCCACGGTGCTGGTGGCAGGGGGCTTTGCAGGCAT 
TGCTTCCTGGGTGGCAGCCACGCCCTTAGACGTGATCAAGTCCCGGATGCAGATGGAT 
GGACTGAGACGCAGAGTGTACCAGGGGATGCTGGACTGCATGGTGAGCAGCATCCGGC 
AGGAAGGACTGGGAGTCTTCTTCCGGGGGGTCACCATCAACAGTGCCCGCGCCTTTCC 




ORF Start: ATG at 1 


ORF Stop: TGA at 865 




SEQ ID NO: 82 


288 aa 1 MW at 31873.6kD 


NOV29a, 
CG94692-01 Protein Sequence 


MPVEEFVAGWISGAIjGLVLGHPFDTVKVRLQTQTTYRGIVDCMVKIYRHESL / 1 f ; 
MSFPIASIAWNSVLFGVYSNTLLVLTATSHQERRAQPPSYMHIFLAGCTGGPLQAYC 
LAPFDLIKVRLQNQTEPRAQPGSPPPRYQGPVHCAASIFREEGPRGLFRGAWALTLRD 
TPTVGIYFITYEGLCRQYTPEGQNPSSATVLVAGGFAGIASWAATPLDVIKSRMOMD 
GLRRRVYQGMLDCMVSSIRQEGLGVFPRGVTINSARAFPVNAVTFLSYEYLLRWWG 




SEQ ID NO: 83 ] 867 bp 


NOV29b, 
CG94692-02 DNA Sequence 


ATGCCGGTGGAGGAATTTGTGGCTGGCTGGATCTCTGGCGCTCTGGGCTTGGTCCTGG 
GACACCCGTTTGACACTGTAAAGGTGAGGCTGCAGACCCAGACCACCTACCGGGGCAT 
CGTTGATTGCATGGTCAAGATTTACCGCCATGAGTCCCTCCTGGGCTICTTCAAGGGA 
ATGAGCTTCCCCATTGCCAGCATAGCTGTGGTCAACTCTGTCCTGTTTGGGGTCTATA 
GCAACACCCTGCTGGTGCTCACGGCCACCTCCCACCAGGAGCGGCGGGCCCAGCCGCC 
CAGCTACATGCACATCTTCCTAGCGGGCTGCACCGGGGGGTTCCTGCAGGCCTACTGT 
CTGGCTCCTTTTGACCTCATCAAAGTCCGGCTACAAAACCAGACAGAGCCAAGGGCCC 
AGCCAGGGAGCCCCCCACCCCGGTACCAGGGGCCCGTGCACTGTGCAGCCTCCATCTT 
CCGGGAGGAGGGGCCCCGGGGGCTGTTCCGAGGAGCCTGGGCCCTGACGCTGAGGGAC 
ACCCCCACGGTGGGGATCTACTTCATCACCTATGAAGGGCTCTGTCGCCAGTACACAC 
CAGAAGGCCAGAATCCCAGCTCAGCCACGGTGCTGGTGGCAGGGGGCTTTGCAGGCAT 
TGCTTCCTGGGTGGCAGCCACGCCCTTAGACGTGATCAAGTCCCGGATGCAGATGGAT 
GGACTGAGACGCAGAGTGTACCAGGGGATGCTGGACTGCATGGTGAGCAGCATCCGGC 
AGGAAGGACTGGGAGTCTTCTTCCGGGGGGTCACCATCAACAGTGCCCGCGCCTTTCC 
CGTCAATGCTGTCACCTTCCTCAGCTACGAATATCTCCTCCGCTGGTGGGGATGA 




ORF Start: ATG at 1 


OR F Stop: TGA at 865 




SEQ ID NO: 84 


288 aa j MW at 3 1873.6kD 


NOV29b, 
CG94692-02 Protein Sequence 


MPVEEFVAGWISGALGLVLGHPFDTVKVRLQTQTTYRGIVDCMVKIYRHESLLGFFKG 
MSFPIASIAWNSVLFGVYSNTLLVLTATSHQERRAQPPSYMHIFLAGCTGGFLQAYC 
LAPFDLIKVRLQNQTEPRAQPGSPPPRYOGPVHCAASIFREEGPRGLFRGAWALTLRD 
TPTVGIYFITYEGLCRQYTPEGQNPSSATVLVAGGFAGIASWVAATPLDVIKSRMQMD 
GLRRRVYQGMLDCMVSSIRQEGLGVFFRGVTINSARAFPVNAVTFLSYEYLLFV.'WG 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 29B. 



Table 29B. Comparison of NOV29a against NOV29b. 


Protein Sequence 


NOV29a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 
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NOV29b 


1..288 


288/288 (100%) 




1..288 


288/288 (100%) 



Further analysis of the NOV29a protein yielded the following properties shown in 
Table 29C. 



Table 29C. Protein Sequence Properties NOV29a 


PSort analysis: 


0.7900 probability located in plasma membrane; 0.6400 probability located in 
microbody (peroxisome); 0.3000 probability located in Golgi body; 0.2000 
probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV29a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 29D. 



Table 29D. Geneseq Results for NOV29a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV29a Residues/ 
Match Residues 


Similarities for the 
Matched Region 


Expect 
Value 


AAE10330 


Human transporter and ion 
channel-7 (TRICH-7) protein - 

[WO200162923-A2, 30-AUG- 
2001] 


1..288 
1..288 


288/288 (100%) 
288/288 (100%) 


e-170 


ABG27643 


Novel human diagnostic 
protein #27634 - Homo 
sapiens, 346 aa. 
[WO200175067-A2, 11-OCT- 
2001] 


1..288 
59. 346 


286/288 (99%) 
286/288 (99%) 


e-168 


ABG27643 


Novel human diagnostic 
protein #27634 - Homo 
sapiens, 346 aa. 
[WO200175067-A2, 11-OCT- 
2001] 


1..288 
59.346 


286/288 (99%) 
286/288 (99%) 


e-168 


AAE 16774 


Human transporter and ion 

channel- 11 (TRICH-11) 
protein - Homo sapiens, 181 
aa. [WO200192304-A2, 06- 
DEC-2001] 


113..288 
6..181 


176/176(100%) 
176/176(100%) 


e-101 


AAM39422 


Human polypeptide SEQ ID 
NO 2567 - Homo sapiens, 3 1 1 
aa. [WO200 1 533 12-A 1,26- 
JUL-2001] 


S..285 
6..300 


144/295 (48%) 
183/295 (61%) 


4e-73 
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In a BLAST search of public sequence datbases, the NOV29a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 29E. 



Table 29E. Public BLASTP Results for NOV29a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV29a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q8VBZ7 


HYPOTHETICAL 32.4 
KDA PROTEIN - Mus 
musculus (Mouse), 294 aa 
(fragment). 


1..287 
7..293 


246/287 (85%) 
267/287 (92%) 


e-147 


CAC24997 


SEQUENCE 41 FROM 
PATENT WOO 100806 
PRECURSOR - Homo 
sapiens (Human), 308 aa. 


5..285 
2. .301 


128/301 (42%) 
187/301 (61%) 


le-66 


Q922X4 


SIMILAR TO CG4995 
GENE PRODUCT - Mus 
musculus (Mouse), 306 aa. 


5. .286 
4..275 


117/288 (40%) 
166/288 (57%) 


le-53 


Q9VKZ5 


CG4995 PROTEIN 
(GH13054P)-Drosophila 
melanogaster (Fruit fly), 
399 aa. 


3. .284 
41..309 


112/287 (39%) 
155/287 (53%) 


7e-51 


| Q9AX03 


PUTATIVE 
CARN1T1NE/ACYLCARN 
1T1NE TRANSLOCASE - 
Oryza sativa (Rice), 322 aa. 


4..286 
14..307 


119/300 (39%) 
166/300 (54%) 


3e-50 



PFam analysis predicts that the NOV29a protein contains the domains shown in 
the Table 29F. 



Table 29F. Domain Analysis of NOV29a 


Pfam Domain 


NOV29a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mito_carr 


1..90 


31/125 (25%) 
72/125 (58%) 


5.1e-22 


mito_carr 


98.. 198 


34/126 (27%) 
84/126 (67%) 


7.1e-29 


mito_carr 


200..288 


33/125 (26%) 
75/125 (60%) 


7.1e-24 
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Example 30. 

The NOV30 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 30A. 



Table 30A. NOV30 Sequence Analysis 




SEQ ID NO: 85 


947 bp 


NOV30a, 
LAjy^f /Z"4-u i L>ri/\ sequence 


ACTGACCATGGCCGACCAGCCAAAGCCCATCAGCCTGCTCRAGAACCTGCTGGCCflGT 
GGCTTTGGTGGCATGTGCCTGGTGTTCATGGGGCACCCTCTGGACACTGTCAAGGTCC 
GACTGCAGACACAGCCACCGAGTTTGCCTAGACAGCCTCCCATGTACTCTGGGACCTT 
TGACTCTCTCCCAAAGACTCTTAGAAGGGACATCACAGGGCTATATAAGGGAATGGCT 
GCCCCTATCATCGGTGTCACCCCCATCTTTGCTGTGTGCTTCTTTGGGTTTGGTTTGA 
GGAAGAAACTAGGACAGAAACACCCAGAAGATGTGCTCAGCTATCCCCAGCTGTTTGC 
AGCTGGGATGTTATCTGGTACATTCACCACAGGCATCGCGACCCCTGGAGAACCCATC 

ACTGTGCAAAGAAGTTGTACCAGGAGTTTCAGATCCGAGGCATCTACAAAGGGACCGT 
GCTTACCCTTATGAGAGACGTTCCAGCTAGCGGAACGTATTTCATGACAAATGAATGG 
CTAAAAAATATCTTCACTCCGGAAGGAAAGAGGGTCTGTGAGCTCAGTGTGCCTAGAA 
TCCTGGTGGCTGGGTGCATTGCAGGGATCTTCAACTGGGCAATGGCAGTCCCGCAAGA 
TGTGCTCAAGTATCCCTTCCAGACGGCACCTCCTGGGAAATATCCTAACAGTTTTGGA 
GATGTGCTGAGGGAGCTGATCTGGGATGAAGGAATCACATCCTTGTCTAAAGGGTCTG 
ATGCAGTGATGACCCGAGCCTTCCCAGCCAATGCAGCCTGTTTCCTTGGCCTTGAAGT 
TGCCATGAAGTTTCTTAATTGGGCCACCCCCAACTTGTGAGGCAGAAGGCTGCTCAAG 
ACTTCTGGATGCTGGAACT 




ORF Start: ATG at 8 


ORF Stop: TGA at 908 




SEQ ID NO: 86 


300 aa | MW at 32855. lkD 


NOVSOa, 
CG94724-01 Protein Sequence 


MADQPKPISLLKNLLASGFGGMCLVFMGHPLDTVKVRLQTQPPSLPRQPPMYSGTFDS 
LPKTLRRDITGLYKGMAAPIIGVTPIFAVCFFGFGLRKKLGQKHPEDVLSYPQLFAAG 
MLSGTFTTGIATPGEPIKSLLHFQPSSGETKYTGTLDCAKKLYQEFQIRGI YKGTVLT 
LMRDVPASGTYFMTNEWLKNIFTPEGKRVCELSVPRILVAGCIAGIFNWAMAVPQDVL 
KYPFQTAPPGKYPNSFGDVLRELIWDEGITSLSKGSDAVMTRAFPANAACFLGLEVAM 
KFLNWATPNL 



Further analysis of the NOV30a protein yielded the following properties shown in 
Table 30B. 



Table 30B. Protein Sequence Properties NOV30a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.1000 
probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV30a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 30C. 
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Table 30C. Geneseq Results for NOV30a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV30a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM79434 


Human protein SEQ ID NO 
3080 - Homo sapiens, 3 1 8 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..300 
I8..318 


258/301 (85%) 
268/301 (88%) 


e-146 


AAM78450 


Human protein SEQ [D NO 
1 1 12 - Homo sapiens, 318 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..300 
1 8.3 18 


258/301 (85%) 
268/301 (88%) 


e-146 


AAY25740 


Human secreted protein 
encoded from gene 30 - Homo 
sapiens, 228 aa. [W09938881- 
A 1,05- AUG- 1999] 


74..300 
1..227 


196/227(86%) 
202/227 (88%) 


e-110 


ABB59582 


Drosophila melanogaster 
polypeptide SEQ ID NO 5538 - 
Drosophila melanogaster, 306 
aa. [WO200171042-A2, 27- 
SEP-2001] 


11. .299 
I6..305 


137/291 (47%) 
190/291 (65%) 


le-73 


ABB59928 


Drosophila melanogaster 
polypeptide SEQ ID NO 6576 - 
Drosophila melanogaster, 299 
aa. [WO200171042-A2.27- 
SEP-2001] 


11. .293 
15..297 


121/285 (42%) 
172/285 (59%) 


7e-57 



In a BLAST search of public sequence datbases, the NOV30a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 30D. 
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Table 30D. Public BLASTP Results for NOV30a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV30a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


043772 


Mitochondrial 
carnitine/acylcarnitine carrier 
protein (Carnitine/acylcarnitine 
translocase) (CAC) - Homo sapiens 
(Human), 301 aa. 


1..300 
1..301 


258/301 (85%) 
268/301 (88%) 


e-146 


Q9Z2Z6 


MCAC PROTEIN - Mus musculus 
(Mouse), 301 aa. 


1..300 
1..301 


243/301 (80%)' 
257/301 (84%) 


e-138 


P97521 


Mitochondrial 
carnitine/acylcarnitine carrier 
protein (Carnitine/acylcarnitine 
translocase) (CAC) - Rattus 
norvegicus (Rat), 301 aa. 


1..300 
1..301 


233/301 (77%) 
254/301 (83%) 


e-134 


Q9VQG4 


COLT PROTEIN (GM13207P) - 
Drosophila melanogaster (Fruit 
fly), 306 aa. 


11.. 299 
16..305 


137/291 (47%) 
190/291 (65%) 


3e-73 


001396 


COLT PROTEIN (CONGESTED- 
LIKE TRACHEA PROTEIN) - 
Drosophila melanogaster (Fruit 
fly), 306 aa. 


11. .299 
16..305 


136/291 (46%) 
188/291 (63%) 


4e-72 



PFam analysis predicts that the NOV30a protein contains the domains shown in 
the Table 30E. 



Table 30E. Domain Analysis of NOV30a 


Pfam Domain 


NOV30a Match Region 


Identities/ 
for the Matched Region 


Expect Value 


mitocarr 


9.. 104 


37/125(30%) 
79/125(63%) 


3.7e-23 


mito_carr 


108..202 


34/125(27%) 
76/125(61%) 


2.4e-19 


mito_carr 


207..299 


28/125(22%) 
70/125(56%) 


1.6e-ll 



Example 31. 



The NOV3 1 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 3 1 A. 
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Table 31A. NOV31 Sequence Analysis 




SEQIDNO:87 | n 7 i bp 


NOV31a, 
CG94871-01 DN A Sequence 


GTTGGCTCCAGACAAATAAACATGGAGTCrATf-TTrr Jfr.nr.,, M ^ MJVYCCCTC" C 
TTTGTGCTCARCATTGCCTGAATAACTTATTGCAAGGAGAATATTTTAGCCCTGTGGA 
ATTATCCTCAATTGCACATCAGCTGGATGAGGAGGAGAAGATGAGAATGGCAGAAGGA 
GGAGTTACTAGTGAAGATTATCGCACGTTTTTACAGCAGCCTTCTGGAAATATGGATG 
ACAGTGGTTTTTTCTCTATTCAGGTTATAAGCAATGCCTTGAAAGTTTGGGGTTTAGA 
ACTAATCCTGTTCAACAGTCCAGAGTATCAGAGGCTCAGGATCGATCCTATAAATGAA 
AGATCATTTATATGCAATTATAAGGAACACTGGTTTACAGTTAGAAAATTAGGAAAAC 
AGTGGTTTAACTTGAATTCTCTCTTGACGGGTCCAGAATTAATATCAGATACATATCT 
TGCACTTTTCTTGGCTCAATTACAACAGGAAGGTTATTCTATATTTGTTGTTAAGGGT 
GATCTGCCAGATTGCGAAGCTGACCAACTCCTGCAGATGATTAGGGTCCAACAGATGC 
ATCGACCAAAACTTATTGGAGAAGAATTAGCACAACTAAAAGAGCAAAGAGTCCATAA 
AACAGACCTGGAACGAGTGTTAGAAGCAAATGATGGCTCAGGAATGTTAGACGAAGAT 
GAGGAGGATTTGCAGAGGGCTCTGGCACTAAGTCGCCAAGAAATTGACATGGAAGATG 
AGGAAGCAGATCTCCGCAGGGCTATTCAGCTAAGTATGCAAGGTAGTTCCAGAAACAT 
ATCTCAAGATATGACACAGACATCAGGTACAAATCTTACTTCAGAAGAGCTTCGGAAG 
AGACGAGAAGCCTACTTTGAAAAACAGCAGCAGCAGCAGGGGGACCTATCAGGACAGA 
GTTCACATCCATGTGAAAGGCCAGCCACCAGTTCAGGAGCACTTGGGAGTGATCTAGG 
TGATGCTATGAGTGAAGAAGACATGCTTCAGGCAGCTGTGACCATCTCTTTAGAAACT 
GTCAGAAATGATTTGAAAACAGAAGGAAAAAAATAATACCTTTAAAAAATAATTTAGA 
TATTCATACTTTCCAACATTATCCTGTGTGATTACAGCATAGGGTCCACTTT 1 _7 
GTGTCAAAGAG 




ORF Start: ATG at 22 


ORF Stop: TA A at 1 078 




SEQ ID NO: 88 


352 aa i MW at 40068.4kD 


NOV31a, 
CG94871-01 Protein Sequence 


MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEEKMRMAEGGVTSEDY 
RTFLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYORr.RIDPINERSFICNY 
KEHWFTVRKLGKQWFNLMSI.LTGPELISDTYLALFLAQLQQEGYSIFWKGDLPDCEA 
DQLLQMIRVQQKHRPKLIGEELAOLKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRA 
IALSRQElnMEDEEADLRRAIQLSMQGSSRI«SQDMTOTSGTNLTSEELRKRREAYFE 
^0^Q GDLS <3QSSHPCERPATSSGALGSDLGDAMSEEDMLQAAVTMSLETVRNDLKT 




SEQ ID NO: 89 | 1219 bp 


NOV31b, 
CG94871-02 DNA Sequence 


GTTGGCTCCAGACAAATAAACATGGAGTCCATrTTrramjr-Ea^^^^^ocTc,^ 
TTTGTGCTCAACATTGCCTGAATAACTTATTGCAAGGAGAATATTTTAGCCCTGTGGA 
ATTATCCTCAATTGCACATCAGCTGGATGAGGAGGAGAGGATGAGAATGGCAGAAGGA 
GGAGTTACTAGTGAAGATTATCGCACGTTTTTACAGCAGCCTTCTGGAAATATGGATG 
ACTGTGGTTTTTTCTCTATTCAGGTTATAAGCAATGCCTTGAAAGTTTGGGGTTTAGA 
ACTAATCCTGTTCAACAGTCCAGAGTATCAGAGGCTCAGGATCGATCCTATAAATGAA 
AGATCATTTATATGCAATTATAAGGAACACTGGTTTACAGTTAGAAAATTAGGAAAAC 
AGTGGTTTAACTTGAATTCTCTCTTGACGGGTCCAGAATTAATATCAGATACATATCT 
TGCACTTTTCTTGGCTCAATTACAACAGGAAGGTTATTCTATATTTGTTGTTAAGGGT 
GATCTGCCAGATTGCGAAGCTGACCAACTCCTGCAGATGATTAGGGTCCAACAGATGC 
ATCGACC AAAACTT ATTGGAGAAGAATTAG CACAACTAAAAG AG CAAAG AGT CCATAA 
AACCGACCTGGAACGAATGTTAGAAGCAAATGATGGCTCAGGAATGTTAGACGAAGAT 
GAGGAGGATTTGCAGAGGGCTCTGGCACTAAGTCGCCAAGAAATTGACATGGAAGATG 
AGGAAGCAGATCTCCGCAGGGCTATTCAGCTAAGTATGCAAGGTAGTTCCAGAAACAT 
ATCTCAAGATATGACACAGACATCAGGTACAAATCTTACTTCAGAAGAGCTTCGGAAG 
AGACGAGAAGCCTACTTTGAAAAACAGCAGCAAAAGCAGCAACAGCAGCAGCAGCAGC 
AGCAGCAGCAGCAGCAGCAGCAGCAGCAGCGGGACCTATCAGGACAGAGTTCACATCC 
ATGTGAAAGGCCAGCCACCAGTTCAGGAGCACTTGGGAGTGATCTAGGTGATGCTATG, 
AGTGAAGAAGACATGCTTCAGGCAGCTGTGACCATGTCTTTAGAAACTGTCAGAAATG 
ATTTGAAAACAGAAGGAAAAAAATAATACCTTTAAAAAATAATTTAGATATTCATACT 
rTCCAACATTATCCTGTGTGATTACAGCATAGGGTCCACTTTGGTAATGTGTCAAAGA 




ORF Start: ATG at 22 


ORF Stop: TAA at 1126 




SEQ ID NO: 90 


368 aa 1 MW at 42277.8kD 


NOV31b, 
CG94871-02 Protein Sequence 

i 


lESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDY 
ITFLQQPSGMNDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNY 
CEHVJFTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFWKGDLPDCEA 

QLLQKIRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEEDLQRA 
xALSRQE IDMEDE EADLRRAI QLSMQGSSRNI SQDMTQTSGTNLTSEELRKRREAYFE 

QQQKQQQQQQQQQQQQQQQQQRDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLO 
1AVTMSLETVRNDLKTEGKK 
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SEQIDNO:91 | 1153 bp 


NOV31c, 
CG9487I-04DNA Sequence 


GTTGGCTCCAGACAAATAAACATGGAGTCCATCTTCCACGAGAAACAAGAAGGCTCAC 
TTTGTGCTCAACATTGCCTGAATAACTTATTGCAAGGAGAATATTTTAGCCCTGTGGA 
ATTATCCTCAATTGCACATCAGCTGGATGAGGAGGAGAGGATGAGAATGGCAGAAGGA 
GGAGTTACTAGTGAAGATTATCGCACGTTTTTACAGGTTATAAGCAATGCCTTGAAAG 
TTTGGGGTTTAGAACTATTCCTGTTCAACAGTCCAGAGTATCAGAGGCTCAGGATCGA 
TCCTATAAATGAAAGATCATTTATATGCAATTATAAGGAACACTGGTTTACAGTTAGA 
AAATTAGGAAAACAGTGGTTTAACTTGAATTCTCTCTTGACGGGTCCAGAATTAATAT 
CAGATAGATATCTTGCACTTTTCTTGGCTCAATTACAACAGGAAGGTTATTCTATATT 
TGTCGTTAAGGGTGATCTGCCAGATTGCGAAGCTGACCAACTCCTGCAGATGATTAGG 
GTCCAACAGATGCATCGACCAAAACTTATTGGAGAAGAATTAGCACAACTAAAAGAGC 
AAAGAGTCCATAAAACAGACCTGGAACGAGTGTTAGAAGCAAATGATGGCTCAGGAAT 
GTTAGACGAAGATGAGGAGGATTTGCAGAGGGCTCTGGCACTAAGTCGCCATGAAATT 
GACATGGAAGATGAGGAAGCAGATCTCCGCAGGGCTATTCAGCTAAGTATGCAAGGTA 
GTTCCAGAAACATATCTCAAGATATGACACAGACATCAGGTACAAATCTTACTTCAGA 
AGAGCTTCGGAAGAGACGAGAAGCCTACTTTGAAAAACAGCAGCAAAAGCAGCAACAG 
CAGCAGCAGCAGCAGCAGCAGGGGGACCTATCAGGACAGAGTTCACATCCATGTGAAA 
GGCCAGCCACCAGTTCAGGAGCACTTGGGAGTGATCTAGGTGATGCTATGAGTGAAGA 
AGACATGCTTCGGGCAGCTGTGACCATGTCTTTAGAAACTGTCAGAAATGATTTGAAA 
ACAGAAGGAAAAAAATAATACCTTTAAAAAATAATTTAGATATTCATACTTTCCAACA 
TTATCCTGTGTGATTACAGCATAGGGTCCACTTTGGTAATGTGTCAAAGAG 




ORF Start: ATG at 22 


ORF Stop: TAA at 1060 




SEQ ID NO: 92 


346 aa ] MW at 39764.1kD 


NOV31c, 
CG94871-04 Protein Sequence 


MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDY 
RTFLQVISMALKVWGLELFLFNSPEYQRLRIDPINERSPICNYKEHWPTVRKLGKQWP 
NLNSLLTGPELrSDRYLALFLAQLQQEGYSIFWKGDLPDCEADQLLQMIRVQQMHRP 
KLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRALALSRHEIDMEDEEA 
DLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQQQQQQ 
GDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLRAAVTMSLETVRNDLKTEGKK 




SEQ ID NO: 93 j 1000 bp 


NOVSld, 
CG94871-05 DNA Sequence 


GTTGGCTCCAGACAAATAAACATGGAGTCCATCTTCCACGAGAAACAAGAAGGCTCAC 
TTTGTGCTCAACATTGCCTGAATAACTTATTGCAAGGAGAATATTTTAGCCCTGTGGA 
ATTAT CC T C AAT TG CACATCAG C TGG ATG AGGAGGAG AGGATGAG AATGGCAGAAGGA 
GGAGTTACTAGTGAAGATTATCGCACGTTTTTACAGTGGTTTAACTTGAATTCTCTCT 
TGACGGGTCCAGAATTAATATCAGATACATATCTTGCACTTTTCTTGGCTCAATTACA 
ACAGGAAGGTTATTCTATATTTGTCGTTAAGGGTGATCTGCCAGATTGCGAAGCTGAC 
CAACTCCTGCAGATGATTAGGGTCCAACAGATGCATCGACCAAAACTTATTGGAGAAG 
AATTAGCACAACTAAAAGAGCAAAGAGTCCATAAAACAGACCTGGAACGAGTGTTAGA 
AGCAAATGATGGCTCAGGAATGTTAGACGAAGATGAGGAGGATTTGCAGAGGGCTCTG 
GCACTAAGTCGCCAAGAAATTGACATGGAAGATGAGGAAGCAGATCTCCGCGGGGCTA 
TTCAGCTAAGTATGCAAGGTAGTTCCAGAAACATATCTCAAGATATGACACAGACATC 
AGGTACAAATCTTACTTCAGAAGAGCTTCGGAAGAGACGAGAAGCCTACTTTGAAAAA 
CAGCAGCAAAAGCAGCAACAGCAGCAGCAGCAGCAGCAGCAGGGGGACCTATCAGGAC 
AGAGTTCACATCCATGTGAAAGGCCAGCCACCAGTTCAGGAGCACTTGGGAGTGATCT 
AGGTGATGCTATGAGTGAAGAAGACATGCTTCGGGCAGCTGTGACCATGTCTTTAGAA 
ACTGTCAGAAATGATTTGAAAACAGAAGGAAAAAAATAATACCTTTAAAAAATAATTT 
AGATATTCATACTTTCCAACATTATCCTGTGTGATTACAGCATAGGGTCCACTTTGGT 
AATGT GT C AAAG AG 




ORF Start: ATG at 22 


ORF Stop: TAA at 907 




SEQ ID NO: 94 


295 aa | MW at 33420.8kD 


NOV31d, 
CG94871-05 Protein Sequence 


MESIFHEKQSGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDY 
RTFLQWFNLNSLLTGPELISDTYLALFIiAQLQQEGYSIFWKGDLPDCEADQLLQMIR 
VQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRALALSRQEI 
DMEDEEADLRGAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQ 
QQQQQQQGDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLRAAVTMSLETVRNDLK 
TEGKK 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 3 IB. 
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Table 31B. Comparison of NOV31a against NOV31b through NOV31d. 


Protein Sequence 


NOV3 la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV31b 


1..352 
1..368 


343/368 (93%) 
345/368 (93%) 


NOV31c 


1..352 
1..346 


326/361 (90%) 
328/361 (90%) 


NOV31d 


129..352 
63..29S 


21.6/233 (92%) 
217/233 (92%) 



Further analysis of the NOV31a protein yielded the following properties shown in 
Table 3 1C. 



Table 31C. Protein Sequence Properties NOV31a 


PSort analysis: 


0.3000 probability located in nucleus; 0.1000 probability located in mitochondrial 
matrix space; 0. 1000 probability located in lysosome (lumen); 0.0000 probability 
located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NO V3 1 a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 3 ID. 
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Table 31D. Geneseq Results for NOV31a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV3 la Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAY33492 


Human Machado-Joseph 
disease associated protein - 

Homo sapiens, 360 aa. 
[W09945944-A1, 16-SEP- 
1999] 


1.326 
1..347 


321/347 (92%) 
323/347 (92%) 


0.0 


AAR96128 


Human Machado-Joseph 
disease-related protein - Homo 
sapiens, 291 aa. [JP08092289- 
A, 09-APR-1996] 


1..291 
1..291 


289/291 (99%) 
291/291 (99%) 


e-166 


ABG22866 


Novel human diagnostic 
protein #22857 - Homo 

[WO200175067-A2, 11-OCT- 
2001] 


129..291' 
292..454 


163/163 (100%) 
163/163 (100%) 


2e-87 


ABG22866 


Novel human diagnostic 
protein #22857 - Homo 
sapiens, 454 aa. 
[WO200175067-A2, 11-OCT- 
2001] 


129..291 
292..454 


163/163 (100%) 
163/163 (100%) 


2e-87 


ABG 17452 


Novel human diagnostic 
protein # 1 7443 - Homo 
sapiens, 731 aa. 
[WO200175067-A2, 1I-OCT- 
2001] 


1 55.32 1 
16..191 


163/176 (92%) 
165/176 (93%) 


le-85 



In a BLAST search of public sequence datbases, the NO V3 1 a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 3 IE. 
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Table 31E. Public BLASTP Results for NOV31a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV31a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q96TC4 


ATAXIN-3 - Homo sapiens 
(Human), 361 aa. 


1..352 
1..361 


351/361 (97%) 
352/361 (97%) 


0.0 


O 15284 


JOSEPHIN MJD1 - Homo 
sapiens (Human), 361 aa. 


1..352 
1..361 


350/361 (96%) 
351/361 (96%) 


0.0 


O 15286 


JOSEPHIN MJDl-Homo 
sapiens (Human), 373 aa 
(fragment). 


2..352 
1..373 


347/373 (93%) 
349/373 (93%) 


0.0 


Q96TC3 


ATAXIN-3 - Homo sapiens 
(Human), 364 aa. 


1..326 
1..335 


323/335 (96%) 
324/335 (96%) 


0.0 


015285 


JOSEPHIN MJD1 - Homo 
sapiens (Human), 364 aa. 


1..326 
1..335 


322/335 (96%) 
323/335 (96%) 


0.0 



PFam analysis predicts that the NO V3 1 a protein contains the domains shown in 
the Table 3 IF. 



1 , 

Table 3 IF. Domain Analysis of NOV31a 


Pfam Domain 


NOV3 la Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Josephin 


1..198 


110/209(53%) 
198/209(95%) 


2.8e-I39 


UIM 


223. .240 


11/18(61%) 
17/18(94%) 


0.011 


UIM 


243 ..260 


8/18 (44%) 
16/18(89%) 


0.0044 


UIM 


325.342 


9/18 (50%) 
17/18(94%) 


0.02 



Example 32. 



The NOV32 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 32A. 
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Table 32A. NOV32 Sequence Analysis 




QUA |n \rr». nc I f^lA hn 
OE^J ID INW. 1 ozz^f op 


NOV32a, 
CG94946-0 1 DNA Sequence 


CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 




CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCAT 




ACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 
CAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACC 
TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 
CGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 




CGACGGCCCTTGTGACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGT 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 


1 


GGTCGAGACCAGTGCCCGGAGCCCTGCCGGTTCAATGCCGTGTGCCTGTCCCGCCGTG 
GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 
TGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
TCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGGAGCCCT 




CCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACG 
CCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGTGAGACCTGTGG 
AGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTGTGTCCCCGG 
TGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTG 
CCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGC 
AGGGCCGTGCGAGCAGGCCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGACGGT 
GACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGG 
ACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGG 
CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA 
CAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGGCCCCGCCTTCGCCCCGC 
TGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAA 
TATCACCGCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAAC 
CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 
CAGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGG 
CATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 




GTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCA 
ACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 
GACCATCGCCTGCCGCCAGGGCCTGCAAATCTCTATCCAGAGCCTGGGCCCGTGCCAG 
GAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAG 
GGCTCCTCCTGAGCCAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAG 
CAGTACCGCACACAGCCAGACCACCCCTCCGCCCTCATCGCGACCTCGGACCACTGCC 
AGCGTCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCC 
CTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAG 
CGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCTCGAG 
CCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCT 
ACAACTCCGCGTTGGGCTGCTGCTCTGATGGGAAGACGCCCTCGCTGGACGCAGAGGG 
CTCCAACTGCCCCGCCACCAAGGTGTTCCAGGGCGTCCTGGAGCTGGAGGGCGTCGAG 
GGCCAGGAGCTGTTCTACACGCCCGAGATGGCTGACCCCAAGTCAGAACTGTTCGGGG 
AGACAGCCAGGAGCATTGAGAGCACCCTGGACGACCTCTTCCGGAATTCAGACGTCAA 
GAAGGATTTCCGGAGTGTCCGCTTGCGGGACCTGGGGCCCGGCAAATCCGTCCGCGCC 
ATTGTGGATGTGCACTTTGACCCCACCACAGCCTTCAGGGCACCCGACGTGGCCCGGG 




GCAGGAGCACGTGCGATTTATGGACTTTGACTGGTTTCCTGCGTTTATCACGGGGGCC 
ACGTCAGGAGCCATTGCTGCGGGAGCCACGGCCAGAGCCACCACTGCATCGCGCCTGC 



206 



WO 03/010327 



PCT/US02/14199 





CCCGGACGTCGGCCCCCGGCCCCCCAGCRGCCTCCARAGCCCTGTGACTCACAGCCCT 
GCTTCCACGGGGGGACCTGCCAGGACTGGGCATTGGGCGGGGGCTTCACCTGCAGCTG 
CCCGGCAGGCAGGGGAGGCGCCGTCTGTGAGAAGGTGCTTGGCGCCCCTGTGCCGGCC 




TGGCACTGGAATTCCGGGCGCTGGAGCCTCAGGGGCTGCTGCTGTACAATGGCAACGC 
CCGGGGCAAGGACTTCCTGGCATTGGCGCTGCTAGATGGCCGCGTGCAGCTCAGGTTT 
GACACAGGTTCGGGGCCGGCGGTGCTGACCAGTGCCGTGCCGGTAGAGCCGGGCCAGT 
GGCACCGCCTGGAGCTGTCCCGGCACTGGCGCCGGGGCACCCTCTCGGTGGATGGTGA 
GACCCCTGTTCTGGGCGAGAGTCCCAGTGGCACCGACGGCCTCAACCTGGACACAGAC 
CTCTTTGTGGGCGGCGTACCCGAGGACCAGGCTGCCGTGGCGCTGGAGCGGACCTTCG 
TGGGCGCCGGCCTGAGGGGGTGCATCCGTTTGCTGGACGTCAACAACCAGCGCCTGGA 

CACCCCTGCCTGCCCAACCCCTGCCATGGCGGGGCCCCATGCCAGAACCTGGAGGCTG 
GAAGGTTCCATTGCCAGTGCCCGCCCGGCCGCGTCGGACCAACCTGTGCCGATGAGAA 

GGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCCAGACAGCCT 
CGGGGCAGGACGGCTCTGGGCCCTTCCTGGCTGACTTCAACGGCTTCTCCCACCTGGA 
GCTGAGAGGCCTGCACACCATTGCACGGGACCTGGGGGAGAAGATGGCGCTGGAGGCC 
GTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAGAAGACGGACG 
GCAAGGGGGACTTCGTGTCGCTGGCACTGCGGGACCGCCGCCTGGAGTTCCGCTACGA 
CCTGGGCAAGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGAGCCTGG 
ACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGACGGCC 
CCCGTGTGTTGGGGGAGTCCCCGAAATCCCGCAAGGTTCCGCACACCGTCCTCAACCT 
GAAGGAGCCGCTCTACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCT 
GCCGTGTCCTCTGGCTTCGACGGTGCCATCCAGCTGGTCTCCCTCGGAGGCCGCCAGC 
TGCTGACCCCGGAGCACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCC 
CTGCACCCGGGCCTCAGGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAG 

TGGTGGAGAAGTCAGCGGGGGACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGT 
CGAGTACCTCAACGCTGTGACCGAGAGCGAGAAGGCACTGCAGAGCAACCACTTTGAA 
CTGAGCCTGCGCACTGAGGCCACGCAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGG 
AGCGGGCAGACTATGTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAA 

TTGCGGGTCGTGGCACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGG 
CCCCTGTGACCGGCTCCLCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCT ■ 

ACAGGCTTTGTGGGCTGCTTGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCACCTGC 
TGGAGGACGCCGTCACCAAGCCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCAC 
CAGAGCCCCGCGCCCGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 6 1 96 




SEQ ID NO: 96 


2053 aa 1 MW at215628.0kD 


NOV32a, 
CG94946-01 Protein Sequence 


MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANWLTGTVEEI 
LNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKWISGFGDPLICDNQVSTGDT 
RIFFVNPAPPYLWPAHKNELMLNSSLMRITLRMLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAPVCGSDASTYSNECELQRAQC 
SQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECQLLRRACARQENVFKKFDGPCDPCQGALPDPSRSCRVNPRTRRPEMLL 
RPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGP 
CDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEAT 
ACTLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPS 
ECMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGS 
DGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
IWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGAC 
RGPAFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPA 
TGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLC 
SCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCP 
MLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSA 
SVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLT 
VPPTAPS PAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGS SVATPGP 
PVERASCYNSALGCCSDGKTPSLDAEGSNCPATKVFQGVLELEGVEGQELFYTPEMAD 
PKSELFGETARSIESTLDDLFRNSDVKKDFRSVRLRDLGPGKSVRAIVDVHFDPTTAF 
RAPDVARALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATAR 
ATTASRLPSSAVTPRAPHPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPPAPQQPP 
KPCDSQPCFHGGTCQDWALGGGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTL 
RAYHTLRLALEFRALEPOGLLLYNGNARGKDFIiALALLDGRVQLRFDTGSGPAVLTSA 
VPVEPGQWHRLELSRHKRRGTLSVDGETPVLGESPSGTDGLNLDTDLFVGGVPEDQAA 
VALERTFVGAGLRGCIRLLDVNNQRLELGIGPGAATRGSGVGECGDHPCLPNPCHGGA 
PCQNLEAGRFHCQCPPGRVGPTCADEKSPCQPNPCHGAAPCRVLPEGGAQCECPLGRE 
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GTFCQTASGQDGSGPFIADFNGFSHLELRGLHTiaRDLGEKMALEAVFLftRGPSGLLL 
YNGQKTDGKGDFVSLALPJDRIUjEFRYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKG 
ALRVGDGPRVLGESPKSRKVPHTVLNX»KEPLYVGGAPDFSKEjARAAAVSSGFDGAIQL 

vslggrqlltpehvlrqvdvtsfaghpctrasghpclngascvprearyvclcpggfs 
gphcekglveksagdvdtlafdgrtfveylnavtesekalqsnhfelslrteatqglv 
lwsgkateradwaiiaivdghlqlsynlgsqpvvlrstvpvntnrwlrwahreqreg 
slqvgneapvtgssplgatqldtdgalwlgglpelpvgpalpkaygtgfvgclrdvw 
grhplhlledavtkpelrpcptp 




SEQlDNO:97 4760 bp 


NOV32b, 
CG94946-02 DNA Sequence 


CCGGCGCGGCCCGCGCGCTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 
CACCCGGGCCCGCTGCGGGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 
CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 
ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 
CCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCC 
CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCAT 
GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCG 

CAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACC 
TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 
CGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 
ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTT 
CGACGGCCCTTGTGACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGT 
GTGAACCCGCGCACGCGGCGCCCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 




TGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
TCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGGAGCCCT 
GTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGGCCCAG 
CCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACG 
CCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGTGAGACCTGTGG 
AGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTGTGTCCCCGG 
TGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTG 
CCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGC 


GACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGG 
ACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGG 
CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA 


TGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAA 




CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 
CAGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGG 
CATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 
GTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTG 
GACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGC 
TG ACGCTTC TGCGCCTG CG ACCTGTGCGG AGATG CG CTG TGAGTTCGGTGCGCGGTGC 
GTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCA 
ACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 
GACCATCGCCTGCCGCCAGGGCCTGCAAATCTCTATCCAGAGCCTGGGCCCGTGCCAG 
GAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAG 

CAGTACCGCACACAGCCAGACCACCCCTCCGCCCTCATCGCGACCTCGGACCACTGCC 
AGCGTCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCC 
CTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAG 
CGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCTCGAG 
CCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCT 
ACAACCCCTGCCATGGGGCGGCGCCCTGCCGTGTGCTGCCCGAGGGTGGTGCTCAGTG 
CGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCCAGACAGCCTCGGGGCAGGACGGC 
TCTGGGCCCTTCCTGGCTGACTTCAACGGCTTCTCCCACCTGGAGCTGAGAGGCCTGC 
ACACCATTGCACGGGACCTGGGGGAGAAGATGGCGCTGGAGGCCGTGTTCCTGGCACG 
AGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAGAAGACGGACGGCAAGGGGGACTTC 
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GAGTCCCCGAAATCCCGCAAGGTTCCGCACACCGTCCTCAACCTGAAGGAGCCGCTCT 




CACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCCCTGCACCCGGGCCT 
CAGGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAGGCTGCCTATGTGTG 
CCTGTGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGAGAAGTCA 
GCGGGGGACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGTCGAGTACCTCAACG 
CTGTGACCGAGAGCGAGAAGGCACTGCAGAGCAACCACTTTGAACTGAGCCTGCGCAC 
TGAGGCCACGCAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGGAGCGGGCAGACTAT 
GTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAACCTGGGCTCCCAGC 
CCGTGGTGCTGCGTTCCACCGTGCCCGTCAACACCAACCGCTGGTTGCGGGTCGTGGC 
ACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACCGGC 
TCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCTGTGGCTTGGGGGCC 
TGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGTGGG 
CTGCTTGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCACCTGCTGGAGGACGCCGTC 
ACCAAGCCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCACCAGAGCCCCGCGCC 
CGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 4732 




SEQ ID NO: 98 


1565 aa | MWat 163817.1kD 


NOV32b, 
CG94946-02 Protein Sequence 


MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANWLTGTVEEI 
LNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKWISGFGDPLICDNQVSTGDT 
RIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAWCGSDASTYSNECELQRAQC 
SQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECQLLRRACARQENVFKKFDGPCDPCQGALPDPSRSCRVNPRTRRPEMLL 
RPESCPARQAPVCGDDGVTYENDCVI1GRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGP 
CDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEAT 
ACTLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPS 
3CMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGS 
DGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
IWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGAC 
RGPAFAPLPPVAPLHCAQTPYGCCQDNITAARGVGIAGCPSACQCNPHGSYGGTCDPA 
TGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLC 
SCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCP 
MLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSA 
SVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLT 
VPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGP 
PVERASCYNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLADFNGFS 
HLELRGLHTIARDLGEKMALEAVFIiARGPSGLLLYNGQKTDGKGDFVSIiALRDRRLEF 
RYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKGALRVGDGPRVLGESPKSRKVPHTV 
LNLKEPLYVGGAPDFSKLARAAAVSSGFDGAIQLVSLGGRQLLTPEHVLRQVDVTSFA 
GHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGR 
TFVEYLNAVTESEKALQSNHFELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQL 
SYNLGSQPWLRSTVPVNTNRWLRWAHREQREGSLQVGNEAPVTGSSPLGATQLDTD 
GALWLGGLPELPVGPALPKAYGTGFVGCLRDWVGRHPLHLLEDAVTKPELRPCPTP 




SEQ ID NO: 99 


893 bp 


NOV32c, 
CG94946-03 DNA Sequence 


CCGGCGCGGCCCGCGCGCTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 
CACCCGGGCCCGCTGCGGGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 
CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 
ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 
CCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCC 
CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGGTGCTCTGGAGTGGCAAGGC 
CACGGAGCGGGCAGACTATGTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGC 
TACAACCTGGGCTCCCAGCCCGTGGTGCTGCGTTCCACCGTGCCCGTCAACACCAACC 
GCTGGTTGCGGGTCGTGGCACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAA 
TGAGGCCCCTGTGACCGGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGA 
GCCCTGTGGCTTGGGGGCCTGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCT 
ACGGCACAGGCTTTGTGGGCTGCTTGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCA 
CCTGCTGGAGGACGCCGTCACCAAGCCAGAGCTGCGGCCCTGCCCCACCCCATGAGCT 
GGCACCAGAGCCCCGCGCCCGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 865 




SEQ ID NO: 100 


276 aa | MW at 29688.8kD 
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NOV32c, 
CG94946-03 Protein Sequence 



MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANWLTGTVEEI 
LNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKWISGFGDPLICDNQVSTGDT 
RIFFVNPAPPYLKPAHKNELVLWSGKATERADYVALAXVDGHLQLSYNLGSQPVVLRS 
TVPVNTNRWLRWAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPV 
GPALPKAYGTGFVGCLRDVWGRHPLHLLEDAVTKPELRPCPTP 





SEQIDNO: 101 


1931 bp 


NOV32d, 
CG94946-04 DNA Sequence 


CACCCGGGCCCGCTGCGGGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 
CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 

CCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCC 
CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCAT 
GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 
CAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACC 
TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 




ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTT 




GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 




GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 
TGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
TCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGGAGCCCT 
GCCTGTGACCGGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCTG 
TGGCTTGGGGGCCTGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCA 
CAGGCTTTGTGGGCTGCTTGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCACCTGCT 
GGAGGACGCCGTCACCAAGCCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCACC 
AGAGCCCCGCGCCCGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 1903 




SEQIDNO: 102 


622 aa I MW at 66353.9kD 


NOV32d, 
CG94946-04 Protein Sequence 


MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANWLTGTVEEI 
LNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKWISGFGDPLICDNQVSTGDT 
RIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAPVCGSDASTYSNECELQRAQC 
SQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECQLLRRACARQENVFKKFDGPCDPCQGALPDPSRSCRVNPRTRRPEMLL 
RPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGP 
CDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEAT 
ACTLGREIQVARKGPCDRCGQCRFGALPVTGSSPLGATQLDTDGALWLGGLPELPVGP 
ALPKAYGTGFVGCLRDVWGRKPLHLLEDAVTKPELRPCPTP 
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SEQIDNO.-103 ] 4697 bp 


NOV32e, 
CG94946-05 DNA Sequence 


CCGGCGCGGCCCGCGCGCTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 
CACCCGGGCCCGCTGCGGGGCGGCCGCTGCTGCCTCTCCTTGTGGTGGCCGCGTGCGT 




GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 
ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 
CCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCC 
CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCAT 
GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 




TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 




ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 

CGACGGCCCTTGTGACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGT 
GTGAACCCGCGCACGCGGCGCCCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 






GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 
TGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
TCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGGAGCCCT 
GTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGGCCCAG 




CCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGTGAGACCTGTGG 
AGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAGGGCAGTGTGTGTGTCCCCGG 




CCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGC 
AGGGCCGTGCGAGCAGGCCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGACGGT 
GACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGG 
ACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGG 
CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA 




TGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAA 

CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 
CAGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGG 
CATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 
GTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTG 




GTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCA 




GACCATCGCCTGCCGCCAGGGCCTGCAAATCTCTATCCAGAGCCTGGGCCCGTGCCAG 
GAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAG 
GGCTCCTCCTGAGCCAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAG 
CAGTACCGCACACAGCCAGACCACCCCTCCGCCCTCATCGCGACCTCGGACCACTGCC 
AGCGTCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCC 
CTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAG 
CGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCTCGAG 
CCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCT 
ACAACTCCGCGTTGGGCTGCTGCTCTGATGGGAAGACGCCCTCGCTCGACGCAGAGGG 
CTCCAACTGCCCCGCCACCAAGGTGTTCCAGGGCGTCCTGGAGCTGGAGGGCGTCGAG 
GGCCAGGAGCTGTTCTACACGCCCGAGATGGCTGACCCCAAGTCAGAACTGTTCGGGG 
AGACAGCCAGGAGCATTGAGAGCACCCTGGACGACCTCTTCCGGAATTCAGACGTCAA 
GAAGGATTTCCGGAGTGTCCGCTTGCGGGACCTGGGGCCCGGCAAATCCGTCCGCGCC 
ATTGTGGATGTGCACTTTGACCCCACCACAGCCTTCAGGGCACCCGACGTGGCCCGGG 
CCCTGCTCCGGCAGATCCAGGTGTCCAGGCGCCGGTCCTTGGGGGTGAGGCGGCCGCT 
GCAGGAGCACGTGCGATTTATGGACTTTGACTGGTTTCCTGCGTTTATCACGGGGGCC 
ACGTCAGGAGCCATTGCTGCGGGAGCCACGGCCAGAGCCACCACTGCATCGCGCCTGC 
CGTCCTCTGCTGTGACCCCTCGGGCCCCGCACCCCAGTCACACAAGCCAGCCCGTTGC 
CAAGACCACGGCAGCCCCCACCACACGTCGGCCCCCCACCACTGCCCCCAGCCGTGTG 
CCCGGACGTCGGCCCCCGGCCTCCTGCGTCCCGAGGGAGGCTGCCTATGTGTGCCTGT 
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GTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGAGAAGTCAGCGGG 
GGACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGTCGAGTACCTCAACGCTGTG 
AC03AGAGCGAGAAGGC»CTGCAGAGCmCCACTraGAACTGAGCCTGCGCACTGAGG 
CCACGCAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGGAGCGGGCAGACTATGTGGC 
ACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAACCTGGGCTCCCAGCCCGTG 




GGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACCGGCTCCTC 




TGCGGGATGTGGTGGTGGGCCGGCACCCGCTGCACCTGCTGGAGGACGCCGTCACCAA 
GCCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCACCAGAGCCCCGCGCCCGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 4669 




SEQIDNO: 104 


1544 aa | MW at 1 62003. 7kD 


NOV32e, 
CG94946-05 Protein Sequence 


MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANVVLTGTVEEI 
LNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKWISGFGDPLICDNQVSTGDT 
RIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAPVCGSDASTYSNECELQRAQC 
SQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECQLLRRACARQENVFKKFDGPCDPCQGALPDPSRSCRVNPRTRRPEMLL 
RPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGP 
CDQAPSPCLGVOCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEAT 
ACTLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPS 
ECMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGS 
DGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
IWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAOGAC 
RGPAFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPA 
TGQCSCRPGVGGLRCDRCEPGFWMFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLC 
SCKPGVAGPKCGQCPDGRAU3PAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCP 
MLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQISIQSLGPCQEAVAPSTHPTSA 
SVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLT 
VPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGP 
PVERASCYNSALGCCSDGKTPSLDAEGSNCPATKVFQGVLELEGVEGQELFYTPEMAD 
PKSELFGETARSIESTLDDLFRNSDVKKDFRSVRLRDLGPGKSVRAIVDVIIFDPTTAF 
RAPDVZiRALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATAR 
ATTASRLPSSAVTPRAPHPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPPASCVPR 
EAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDGRTFVEYLNAVTESEKALQSNHF 
ELSLRTEATQGLVLWSGKATERADYVALAIVDGHLQLSYNLGSQPWLRSTVPVNTNR 
WLRWAHREQREGSLQVGNEAPVTGSSPLGATQLDTDGALWLGGLPELPVGPALPKAY 
GTGFVGCLRDWVGRHPLHLLEDAVTKPELRPCPTP 




Sr^IDNO: 105 


6494 bp 


NOV32f, 


CCGGCGCGGCCCGCGCGCTCTTCCGCCGCC 


TCTCGCATGCGCCATGGCCGGCCGGTCC 


CG94946-06 DNA Sequence 


CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 
ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 
CCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCC 
CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCAT 
GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 
CAAGAAGAGCCCGTGCCCCAGCGTGGTGGCGCCTGTGTGTGGGTCGGACGCCTCCACC 
TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 

ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTT 
CGACGGCCCTTGTGACCCCTGTCAGGGCGCCCTCCCTGACCCGAGCCGCAGCTGCCGT 
GTGAACCCGCGCACGCGGCGCCCTGAGATGCTCCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 
CCGATCGGGGGCCGCCCGGGGTCTCCTCCTGCAGAAAGTGCGCTCCGGCCAGTGCCAG 




GCCGTCCCCGCTGCTCCTGCGACCGCGTCACCTGTGACGGGGCCTACAGGCCCGTGTG 




GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
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AGATGCCGTGTGTGCTTTTGGGGCTGTGTGCTCCGCAG 

\GCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGA03GTGTCACCTACGGCAGTG 
CCTGCGAGCTACGGGAAGCCGCCTGCCTCCAGCAGAC&CAGATCGAGGAGGCCCGGGC 
AGGGCCGTGCGAGCAGGCCGAGTGCGGTTCCGGAGGCTCTGGCTCTGGGGAGGACGGT 
GACTGTGAGC AG GAG CTGTG CC GGCAGCGCGGTGG CATCTGGGACGAGGACTCGGAGG 
ACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGG 
CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA 



TGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAA 
TATCACCGCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAAC 
CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 
CAGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGG 
CATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 
GTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTG 
GACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGC 
TGACGCTTCTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGC 
GTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCA 
ACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 
GACCATCGCCTGCCGCCAGGGCCTGCAAATCTCTATCCAGAGCCTGGGCCCGTGCCAG 
GAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGACTGTGACCACCCCAG 
GGCTCCTCCTGAGCCAGGCACTGCCGGCCCCCCCCGGCGCCCTCCCCCTGGCTCCCAG 
CAGTACCGCACACAGCCAGACCACCCCTCCGCCCTCATCGCGACCTCGGACCACTGCC 
AGCGTCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCCCCACGGCACCCTCCC 
ACCCAGCCTGGTGGCGTCCGCCTTTGGTGAATCTGGCAGCACTGATGGAAGCAG 
CGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGGGGGTGGCTCTGGGGGGCTCGAG 
CCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCGAGAGGGCTTCCTGCT 
ACAACTCCGCGTTGGGCTGCTGCTCTGATGGGAAGACGCCCTCGCTGGACGCAGAGGG 
CTCCAACTGCCCCGCCACCAAGGTGTTCCAGGGCGTCCTGGAGCTGGAGGGCGTCGAG 
GGCCAGGAGCTGTTCTACACGCCCGAGATGGCTGACCCGAAGTCAGAACTGTTCGGGG 
AGACAGCCAGGAGCATTGAGAGCACCCTGGACGACCTCTTCCGGAATTCAGACGTCAA 
GAAGGATTTCCGGAGTGTCCGCTTGCGGGACCTGGGGCCCGGCAAATCCGTCCGCGCC 
ATTGTGGATGTGCACTTTGACCCCACCACAGCCTTCAGGGCACCCGACGTGGCCCGGG 
CCCTGCTCCGGCAGATCCAGGTGTCCAGGCGCCGGTCCTTGGGGGTGAGGCGGCCGCT 
GCaGGAGCACGTGCGATTTATGGACTTTGACTGGTTTCCTGCGTTTATCACGGGGGCC 
ACGTCAGGAGCCATTGCTGCGGGAGCCACGGCCAGAGCCACCACTGCATCGCGCCTGC 
CGTCCICTGCTGTGACCCCTCGGGCCCCGCACCCCAGTCACACAAGCCAGCCCGTTGC 
CAAGACCACGGCAGCCCCCACCACACGTCGGCCCCCCACCACTGCCCCCAGCCGTGTG 
CCCGGACGTCGGCCCCCGGCCCCCCAGCAGCCTCCAAAGCCCTGTGACTCACAGCCCT 
GCTTCCACGGGGGGACCTGCCAGGACTGGGCATTGGGCGGGGGCTTCACCTGCAGCTG 
CCCGGCAGGCAGGGGAGGCGCCGTCTGTGAGAAGGTGCTTGGCGCCCCTGTGCCGGCC 
TTCGAGGGCCGCTCCTTCCTGGCCTTCCCCACCCTCCGCGCCTACCACACGCTGCGCC 

CCGGGGCAAGGACTTCCTGGCATTGGCGCTGCTAGATGGCCGCGTGCAGCTCAGGTTT 



TGGGCGCCGGCCTGAGGGGGTGCATCCGTTTGCTGGACGTCAACAACCAGCGCCTGGA 
GCTTGGCATTGGGCCGGGGGCTGCCACCCGAGGCTCTGGCGTGGGCGAGTGCGGGGAC 
CACCCCTGCCTGCCCAACCCCTGCCATGGCGGGGCCCCATGCCAGAACCTGGAGGCTG 
GAAGGTTCCATTGCCAGTGCCCGCCCGGCCGCGTCGGACCAACCTGTGCCGATGAGAA 
GAGCCCCTGCCAGCCCAACCCCTGCCATGGGGCGGCGCCCTGCCGTGTGCTGCCCGAG 

CGGGGCAGGACGGCTCTGGGCCCTTCCTGGCTGACTTCAACGGCTTCTCCCACCTGGA 
GCTGAGAGGCCTGCACACCATTGCACGGGACCTGGGGGAGAAGATGGCGCTGGAGGCC 
GTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAGAAGACGGACG 
GCAAGGGGGACTTCGTGTCGCTGGCACTGCGGGACCGCCGCCTGGAGTTCCGCTACGA 



IAAATCCCGCAAGGTTCCGCACACCGTCCTCAACCT 
GAAGGAGCCGCTCTACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCT 
STGCCATCCAGCTGGTCTCCCTCGGAGGCCGCCAGC 
TGCTGACCCCGGAGCACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCC 
CTGCACCCGGGCCTCAGGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAG 
GCTGCCTATGTGTGCCTGTGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGC 
TGGTGGAGAAGTCAGCGGGGGACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGT 
CGAGTACCTCAACGCTGTGACCGAGAGCGAGAAGGCACTGCAGAGCAACCACTTTGAA 
CTGAGCCTGCGCACTGAGGCCACGCAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGG 
AGCGGGCAGACTATGTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAA 
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CCCCTGTGACCGGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCT 
GTGGCTTGGTGAGTGTTTTGGGGAGACTAGAGAGGGA.TGCCCAAGGGTCTCAGATATC 
CGAGGGACAGACTCCACCCCCCAGCGCCCACCCTTGAGTCAGGGTGCATGTGAGCCGG 

GAGGCGGGTGCCCAGGTGTGGGCCCCCTGCTGGTCACCTGCTCGTTGGGGTGCCCATC 
AGCATCACTGAGTCACAGCCGGGTGACTCCCACTGTCTGTGCTGCAGGGGCCTGCCGG 
AGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGTGGGCTGCTT 
GCGGGATGTGGTGGTGGGCCGGCACCCGCTGCACCTGCTGGAGGACGC03TCACCAAG 
CCAGAGCTGCGGCCCTGCCCCACCCCATGAGCTGGCACCAGAGCCCCGCGCCCGCT 




ORF Start: ATG at 37 


ORF Stop: TGA at 6466 




SEQ ID NO: 106 


2143 aa I MW at 225054.6RD 


NOV32f, 
CG94946-06 Protein Sequence 


MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANWLTGTVEEI 
LNVDPVQHTYSCKVRVKRYLKGKDLVARESLLDGGNKWISGFGDPLICDNQVSTGDT 
RIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAPVCGSDASTYSNECELQRAQC 
SQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECQLLRRACARQEOTFKKFDGPCDPCQGALPDPSRSCRVNPRTRRPEMLL 
RPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGP 
CDQAPS PCLGVQCAFGATCAVKNGQAACECLQACS SLYDPVCGSDGVTYGS ACELEAT 
ACTLGREIQVARKGPCDRCGQCRFGALCEAETGRCVCPSECVALAQPVCGSDGHTYPS 
ECMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGQCVCPRCEHPPPGPVCGS 
DGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
IWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGAC 
RGPAFAPLPPVAPLKCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPA 
TGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLC 
SCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCP 
MLTCPEANATKVCGSDGVTYGNECQLKTIACRQGLQrSIQSIXSPCQEAVAPSTHPTSA 
SVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTVWPVLT 
VPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSGGLEPLEGSSVATPGP 
PVERASCYNSALGCCSDGKTPSLDAEGSNCPATKVFQGVLELEGVEGQELFYTPEMAD 
PKS ELFGETARS I ESTLDDLFRNSDVKKDFRSVRLRDLGPGKSVRAIVDVHFDPTTAF 
RAPDVARALLRQIQVSRRRSLGVRRPLQEHVRFMDFDWFPAFITGATSGAIAAGATAR 
ATTASRLPSSAVTPRAPKPSHTSQPVAKTTAAPTTRRPPTTAPSRVPGRRPPAPQQPP 
KPCDSQPCFHGGTCQDWAI.GGGFTCSCPAGRGGAVCEKVLGAPVPAFEGRSFLAFPTL 
RAYHTLRLALEFRALEPQGLLLYNGNARGKDFLALALLDGRVQLRFDTGSGPAVLTSA 
VPVEPGQWHRLELSRHWRRGTLSVDGETPVLGESPSGTDGLNLDTDLFVGGVPEDQAA 
VALERTFVGAGLRGCIRLLDVNNQRLELGIGPGAATRGSGVGECGDHPCLPNPCHGGA 
PCQNLEAGRFHCQCPPGRVGPTCADEKSPCQPNPCHGAAPCRVLPEGGAQCECPLGRE 
GTFCQTASGQDGSGPFLADFNGFSHLELRGLHTIARDLGEKMALEAVFLARGPSGLLL 
YNGQKTDGKGDFVSLALRDRRLEFRYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKG 
ALRVGDGPRVLGESPKSRKVPHTVLNIiKEPLYVGGAPDFSKIiARAAAVSSGFDGAIQL 
VSLGGRQLLTPEHVLRQVDVTSFAGHPCTRASGHPCLNGASCVPREAAYVCLCPGGFS 
GPHCEKGLVEKSAGDVDTLAFDGRTFVEYLNAVTESEKALQSNHFELSLRTEATQGLV 
LWSGKATERADYVALiAIVDGKLQLSYNLGSQPWLRSTVPVNTNRWLRWAHREQREG 
SLQVGNEAPVTGSSPLGATQLDTDGALWLGECFGETREGCPRVSDIRGTDSTPQRPPL 
SQGACEPAGWALFSRCSPCSSQCCGAGRRVPRCGPPAGHLLVGVPISITESQPGDSHC 
LCCRGLPELPVGPALPKAYGTGFVGCLRDVWGRHPLHLLEDAVTKPELRPCPTP 




SEQ ID NO: 107 5688 bp 


NOV32& 
CG94946-07 DNA Sequence 


CCGGCGCGGCCCGCGCGCTCTTCCGCCGCCTCTCGCATGCGCCATGGCCGGCCGGTCC 

CCTGCCCGGAGCCGGCGGGACATGCCCGGAGCGCGCGCTGGAGCGGCGCGAGGAGGAG 
GCGAACGTGGTGCTCACCGGGACGGTGGAGGAGATCCTCAACGTGGACCCGGTGCAGC 
ACACGTACTCCTGCAAGGTTCGGGTCTGGCGGTACTTGAAGGGCAAAGACCTGGTGGC 
CCGGGAGAGCCTGCTGGACGGCGGCAACAAGGTGGTGATCAGCGGCTTTGGAGACCCC 
CTCATCTGTGACAACCAGGTGTCCACTGGGGACACCAGGATCTTCTTTGTGAACCCTG 
CACCCCCATACCTGTGGCCAGCCCACAAGAACGAGCTGATGCTCAACTCCAGCCTCAT 
GCGGATCACCCTGCGGAACCTGGAGGAGGTGGAGTTCTGTGTGGAAGATAAACCCGGG 
ACCCACTTCACTCCAGTGCCTCCGACGCCTCCTGATGCGTGCCGGGGAATGCTGTGCG 
GCTTCGGCGCCGTGTGCGAGCCCAACGCGGAGGGGCCGGGCCGGGCGTCCTGCGTCTG 




TACAGCAACGAATGCGAGCTGCAGCGGGCGCAGTGCAGCCAGCAGCGCCGCATCCGCC 
TGCTCAGCCGCGGGCCGTGCGGCTCGCGGGACCCCTGCTCCAACGTGACCTGCAGCTT 
CGGCAGCACCTGTGCGCGCTCGGCCGACGGGCTGACGGCCTCGTGCCTGTGCCCCGCG 
ACCTGCCGTGGCGCCCCCGAGGGGACCGTCTGCGGCAGCGACGGCGCCGACTACCCCG 
GCGAGTGCCAGCTCCTGCGCCGCGCCTGCGCCCGCCAGGAGAATGTCTTCAAGAAGTT 




GTGAACCCGCGCACGCGGCGCCCTGAGATGCGCCTACGGCCCGAGAGCTGCCCTGCCC 
GGCAGGCGCCAGTGTGTGGGGACGACGGAGTCACCTACGAAAACGACTGTGTCATGGG 



214 



WO 03/010327 



PCT/US02/14199 



GGTCGAGACCAGTGCCCGGAGCCCTGCCGGTTCAATGCCGTGTGCCTGTCCCGCCGTG 
GCCGTCCCCGCTGCTCCTGCGACCGGGTCACCTGTGACGGGGCCTACAGGCC03TGTG 
TGCCCAGGACGGGCGCACGTATGACAGTGATTGCTGGCGGCAGCAGGCTGAGTGCCGG 
CAGCAGCGTGCCATCCCCAGCAAGCACCAGGGCCCGTGTGACCAGGCCCCGTCCCCAT 
GCCTCGGGGTGCAGTGTGCATTTGGGGCGACGTGTGCTGTGAAGAACGGGCAGGCAGC 
GTGTGAATGCCTGCAGGCGTGCTCGAGCCTCTACGATCCTGTGTGCGGCAGCGACGGC 
GTCACATACGGCAGCGCGTGCGAGCTGGAGGCCACGGCCTGTACCCTCGGGCGGGAGA 
TCCAGGTGGCGCGCAAAGGACCCTGTGACCGCTGCGGGCAGTGCCGCTTTGGAGCCCT 
GTGCGAGGCCGAGACCGGGCGCTGCGTGTGCCCCTCTGAATGCGTGGCTTTGGCCCAG 
CCCGTGTGTGGCTCCGACGGGCACACGTACCCCAGCGAGTGCATGCTGCACGTGCACG 
CCTGCACACACCAGATCAGCCTGCACGTGGCCTCAGCTGGACCCTGCGAGACCTGTGG 



TGTGAGCACCCCCCGCCCGGCCCCGTGTGTGGCAGCGACGGTGTCACCTACGGCAGTG 
CCTGCGAGCTACGGGARGCCGCCTGCCTCCAGCAGACACAGATCGAGGAGGCCCGGGC 



GACTGTGAGCAGGAGCTGTGCCGGCAGCGCGGTGGCATCTGGGACGAGGACTCGGAGG 

ACGGGCCGTGTGTCTGTGACTTCAGCTGCCAGAGTGTCCCAGGCAGCCCGGTGTGCGG 

CTCAGATGGGGTCACCTACAGCACCGAGTGTGAGCTGAAGAAGGCCAGGTGTGAGTCA 

CAGCGAGGGCTCTACGTAGCGGCCCAGGGAGCCTGCCGAGGCCCCACCTTCGCCCCGC 

TGCCGCCTGTGGCCCCCTTACACTGTGCCCAGACGCCCTACGGCTGCTGCCAGGACAA 

TATCACCGCAGCCCGGGGCGTGGGCCTGGCTGGCTGCCCCAGTGCCTGCCAGTGCAAC 

CCCCATGGCTCTTACGGCGGCACCTGTGACCCAGCCACAGGCCAGTGCTCCTGCCGCC 

CAGGTGTGGGGGGCCTCAGGTGTGACCGCTGTGAGCCTGGCTTCTGGAACTTTCGAGG 

CATCGTCACCGATGGCCGGAGTGGCTGTACACCCTGCAGCTGTGATCCCCAAGGCGCC 

GTGCGGGATGACTGTGAGCAGATGACGGGGCTGTGCTCGTGTAAGCCCGGGGTGGCTG 

GACCCAAGTGTGGGCAGTGTCCAGACGGCCGTGCCCTGGGCCCCGCGGGCTGTGAAGC 

TGACGCTTCTGCGCCTGCGACCTGTGCGGAGATGCGCTGTGAGTTCGGTGCGCGGTGC 

GTGGAGGAGTCTGGCTCAGCCCACTGTGTCTGCCCGATGCTCACCTGTCCAGAGGCCA 

ACGCTACCAAGGTCTGTGGGTCAGATGGAGTCACATACGGCAACGAGTGTCAGCTGAA 

GACCATCGCCTGCCGACGGTGTCACCTACGCCAGGGCCTGCAAATCTCTATCCAGAGC 

CTGGGCCCGTGCCAGGAGGCTGTTGCTCCCAGCACTCACCCGACATCTGCCTCCGTGA 

CTGTGACCACCCCAGGGCTCCTCCTGAGCCAGGCACTGCCGGCCCCCCCrGGCGCCCT 

CCCCCIGGCTCCCAGCAGTACCGCACACAGCCAGACCACCCCTCCGCCCTCATCGCGA 

CCTCGGACCACTGCCAGCGTCCCCAGGACCACCGTGTGGCCCGTGCTGACGGTGCCCC 

CCACGGCACCCTCCCCTGCACCCAGCCTGGTGGCGTCCGCCTTTGGTGAA^rTGr,ran 

CACTGATGGAAGCAGCGATGAGGAACTGAGCGGGGACCAGGAGGCCAGTGG3CC G , 

TCTGGGGGGCCCGAGCCCTTGGAGGGCAGCAGCGTGGCCACCCCTGGGCCACCTGTCG 

AGAGGGCTTCCTGCTACAACCCCTGCCATGGGGCGGCGCCCTGCCGTGTGCTGCCCGA 

GGGTGGTGCTCAGTGCGAGTGCCCCCTGGGGCGTGAGGGCACCTTCTGCCAGACAGCC 

TCGGGGCAGGACGGCTCTGGGCCCTTCCTGGCTGACTTCAACGGCTTCTCCCArrTG-v; 

AGCTGAGAGGCCTGCACACCTTTGCACGGGACCTGGGGGAGAAGATGGCGCTGGAGGT 

CGTGTTCCTGGCACGAGGCCCCAGCGGCCTCCTGCTCTACAACGGGCAGAAGACGGAC 

ACCTGGGCAAGGGGGCAGCGGTCATCAGGAGCAGGGAGCCAGTCACCCTGGGAGCCTG 
GACCAGGGTCTCACTGGAGCGAAACGGCCGCAAGGGTGCCCTGCGTGTGGGCGACGGC 
CCCCGTGTGTTGGGGGAGTCCCCGGTTCCGCACACCGTCCTCAACCTGAAGGAGCCGC 
TCTACGTAGGGGGCGCTCCCGACTTCAGCAAGCTGGCCCGTGCTGCTGCCGTGTCCTC 
TGGCTTCGACGGTGCCATCCAGCTGGTCTCCCTCGGAGGCCGCCAGCTGCTGACCCCG 
GAGCACGTGCTGCGGCAGGTGGACGTCACGTCCTTTGCAGGTCACCCCTGCACCCGGG 
CCTCAGGCCACCCCTGCCTCAATGGGGCCTCCTGCGTCCCGAGGGAGGCTGCCTATGT 
GTGCCTGTGTCCCGGGGGATTCTCAGGACCGCACTGCGAGAAGGGGCTGGTGGAGAAG 
TCAGCGGGGGACGTGGATACCTTGGCCTTTGACGGGCGGACCTTTGTCGAGTACCTCA 
ACGCTGTGACCGAGAGCGAGAAGGCACTGCAGAGCAACCACTTTGAACTGAGCCTGCG 
CACTGAGGCCACGCAGGGGCTGGTGCTCTGGAGTGGCAAGGCCACGGAGCGGGCAGAC 
TATGTGGCACTGGCCATTGTGGACGGGCACCTGCAACTGAGCTACAACCTGGGCTCCC 
AGCCCGTGGTGCTGCGTTCCACCGTGCCCGTCAACACCAACCGCTGGTTGCGGGTCGT 
GGCACATAGGGAGCAGAGGGAAGGTTCCCTGCAGGTGGGCAATGAGGCCCCTGTGACC 
GGCTCCTCCCCGCTGGGCGCCACGCAGCTGGACACTGATGGAGCCCTGTGGCTTGGGG 
GCCTGCCGGAGCTGCCCGTGGGCCCAGCACTGCCCAAGGCCTACGGCACAGGCTTTGT 



GTCACCAAGCCAGAGCTGCGGCCCTGCCCCACCCCATG AGCTGGCACCAGAGCCCCGC 
GCCCGCTGTAATTATTTTCTATTTTTGTAAACTTGTTGCTTTTTGATATGATTTTCTT 
GCCTGAGTGTTGGCCGGAGGGACTGCTGGCCCGGCCTCCCTTCCGTCCAGGCAGCCGT 
GCTGCAGACAGACCTAGTGCTGAGGGATGGACAGGCGAGGTGGCAGCGTGGAGGGCTC 
GGCGTGGATGGCAGCCTCAGGACACACACCCCTGCCTCAAGGTGCTGAGCCCCCGCCT 
TGCACTGCGCCTGCCCCACGGTGTCCCCGCCGGGAAGCAGCCCCGGCTCCTGAATCAC 
CCTCGCTCCGTCAGGCGGGACTCGTGTCCCAAAAAGGAAGGGGCTGCTGAGGTCTGAT 
GGGGCCCTTCCTCCGGGTGACCCCACAGGGCCTTTCCAAGCCCCTATTTGAGCTGCTC 

AACAATGTTACTGCTGGGCACAGCTCTGCGTTGCTCCCGTGCTGCCTGCGCCAGCCCC 
AGGCTGCTGAGGAGCAGAGGCCAGACCAGGGCCGATCTGGGTGTCCTGACCCTCAGCT 
GGCCCTGCCCAGCCACCCTGGACATGACCGTATCCCTCTGCCACACCCCAGGCCCTGC 
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GAGGGGCTATCGAGAGGAGCTCACTGTGGGATGGGGTTGACCTCTGCCGCCTGCCTGG 
GTATCTGGGCCTGGCCATGGCTGTGTTCTTCATGTGTTGATTTTATTTGACCCCTGGA 
GTGGTGGGTCTCATCTTTCCCATCTCGCCrGAGAGCGGOTGAGGGCTGCCTCACTGCA 

CAGTGTCTGACCAAGGTCAAGGGGCAGGTGCAGAGGTGGCAGGGATGGCTCCGAAGCC 
AGAA 





ORF Start: ATG at 37 


ORF Stop: TGA at 4735 




SEQ ID NO: 108 


1566 aa 1 MW at 164102.4kD 


NOV32g, 
CG94946-07 Protein Sequence 


MRHGRPVPPGPAAGRPLLPLLWAACVLPGAGGTCPERALERREEEANWLTGTVEEI 
LNVDPVQHTYSCKVRVWRYLKGKDLVARESLLDGGNKWISGFGDPLICDNQVSTGDT 
RIFFVNPAPPYLWPAHKNELMLNSSLMRITLRNLEEVEFCVEDKPGTHFTPVPPTPPD 
ACRGMLCGFGAVCEPNAEGPGRASCVCKKSPCPSWAPVCGSDASTYSNECELQRAQC 
SQQRRIRLLSRGPCGSRDPCSNVTCSFGSTCARSADGLTASCLCPATCRGAPEGTVCG 
SDGADYPGECaLLRRACARQENVFKKFDGPCDPCOGALPDPSRSCRVNPRTRRPEMRL 
RPESCPARQAPVCGDDGVTYENDCVMGRSGAARGLLLQKVRSGQCQGRDQCPEPCRFN 
AVCLSRRGRPRCSCDRVTCDGAYRPVCAQDGRTYDSDCWRQQAECRQQRAIPSKHQGP 
CDQAPSPCLGVQCAFGATCAVKNGQAACECLQACSSLYDPVCGSDGVTYGSACELEAT 
ACTLGREIOVARKGPCDRCGQCRFGALCEAETGRCVCPSECVAIAQPVCGSDGHTYPS 
ECMLHVHACTHQISLHVASAGPCETCGDAVCAFGAVCSAGOCVCPRCEHPPPGPVCGS 
DGVTYGSACELREAACLQQTQIEEARAGPCEQAECGSGGSGSGEDGDCEQELCRQRGG 
IWDEDSEDGPCVCDFSCQSVPGSPVCGSDGVTYSTECELKKARCESQRGLYVAAQGAC 
RGPTFAPLPPVAPLHCAQTPYGCCQDNITAARGVGLAGCPSACQCNPHGSYGGTCDPA 
TGQCSCRPGVGGLRCDRCEPGFWNFRGIVTDGRSGCTPCSCDPQGAVRDDCEQMTGLC 
SCKPGVAGPKCGQCPDGRALGPAGCEADASAPATCAEMRCEFGARCVEESGSAHCVCP 
MLTCPEANATKVCGSDGVTYGNECQLKTIACRRCHLRQGLQISIQSLGPCQEAVAPST 
HPTSASVTVTTPGLLLSQALPAPPGALPLAPSSTAHSQTTPPPSSRPRTTASVPRTTV 
WPVLTVPPTAPSPAPSLVASAFGESGSTDGSSDEELSGDQEASGGGSCGPEPLEGSSV 
ATPGPPVERASCYNPCHGAAPCRVLPEGGAQCECPLGREGTFCQTASGQDGSGPFLAD 
FNGFSHLELRGLHTFARDLGEKMALEWFLARGPSGLLLYNGQKTDGKGDFVSLALRD 
RRLEFRYDLGKGAAVIRSREPVTLGAWTRVSLERNGRKGALRVGDGPRVLGESPVPHT 
VLNLKEPLYVGGAPDFSKLARAAAVSSGFDGAIQLVSLGGRQLLTPEHVLRQVDVTSF 
AGHPCTRASGHPCLNGASCVPREAAYVCLCPGGFSGPHCEKGLVEKSAGDVDTLAFDG 
RTFVEYLNAVTESEKALQSNHFELSLRTEATQGLVLWSGKATERADYVALiAI VDGHJjQ 
LSYNLGSQPWLRSTVPWTNRWLRWAIIREQREGSLOVGNEAPVTGSSPI,GATQLDT 
DGALWLGGLPELPVGPALPKAYGTGFVGCJLRDVWGRHPLHLLEDAVTKPELRPCPTP 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 32B. 



Table 32B. Comparison of NOV32a against NOV32b through NOV32g. 


Protein Sequence 


NOV32a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV32b 


34.. 1536 
34.. 1549 


1072/1557(68%) 
1128/1557(71%) 


NOV32c 


191 3. .2053 
136..276 


141/141 (100%) 
141/141 (100%) 


NOV32d 


34..549 
34..549 


505/516(97%) 
505/516(97%) 


NOV32e 


34.. 1536 
34.. 1528 


1170/1515(77%) 
1201/1515 (79%) 


NOV32f 


34..2002 
34..2002 


1758/1969 (89%) 
1758/1969 (89%) 


NOV32g 


34.. 1536 
34..1550 


1081/1562(69%) 
1134/1562 (72%) 
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Further analysis of the NOV32a protein yielded the following properties shown 
Table 32C. 



Table 32C. Protein Sequence Properties NOV32a 


PSort analysis: 


0.7618 probability located in outside; 0. 1900 probability located in lysosome 
(lumen); 0.1000 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 34 and 35 



A search of the NOV32a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 32D. 
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Table 32D. Geneseq Results for NOV32a 




Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV32a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW26609 


Human agrin - Homo sapiens, 492 
aa. [W09721811-A2, 19-JUN-1997] 


1591. .2053 
22.. 492 


460/471 (97%) 
461/471 (97%) 


0.0 


AAB93754 


Human protein sequence SEQ ID 
NO: 13424 - Homo sapiens, 413 aa. 
[EP 1 0746 1 7-A2, 07-FEB-200 1 ] 


583..968 
1..386 


381/386 (98%) 
384/386 (98%) 


0.0 


AAY73993 


Human prostate tumor EST fragment 

derived protein #1 80 - Homo 
sapiens, 416 aa. [DEI 9820 190-A1, 
04-NOV-1999] 


1634..2053 
1.416 


414/420 (98%) 
414/420(98%) 


0.0 


AAB31889 


Amino acid sequence of a human 
protein - Homo sapiens, 4393 aa. 
[WO200105422-A2, 25-JAN-2001] 


1355..2052 
S639..4393 


252/794 (31%) 
352/794 (43%) 


7e-88 


ABG23265 


Novel human diagnostic protein 
#23256 - Homo sapiens, 4436 aa. 
[WO200175067-A2, ll-OCT-2001] 


1355..2051 
3672..44S5 


252/803 (31%) 
350/803 (43%) 


7e-85 



In a BLAST search of public sequence datbases, the NOV32a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 32E. 



Table 32E. Public BLASTP Results for NOV32a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV32a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


000468 


AGRTN PRECURSOR - Homo 
sapiens (Human), 2026 aa 
(fragment). 


24..2053 
1..2026 


2022/2030 (99%) 
2022/2030 (99%) 


0.0 


P25304 


Agrin precursor - Rattus 
norvegicus (Rat), 1959 aa. 


160..2053 
51..1959 


1558/1914 (81%) 
1663/1914(86%) 


0.0 


P31696 


Agrin precursor - Gallus gallus 
(Chicken), 1955 aa. 


128..2050 
I ..1952 


1234/1970 (62%) 
1479/1970 (74%) 


0.0 


Q90404 


Agrin - Discopyge ommata 
(Electric ray), 1328 aa 
(fragment). 


716..2051 
1..1325 


733/1353 (54%) 
932/1353 (68%) 


0.0 


Q96IC1 


UNKNOWN (PROTEIN FOR 
IMAGE-.3544662) - Homo 
sapiens (Human), 488 aa 
(fragment). 


1562..2053 
1..488 


486/492 (98%) 
486/492 (98%) 


0.0 
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PFam analysis predicts that the NOV32a protein contains the domains shown in 
the Table 32F. 



Table 32F. Domain Analysis of NOV32a 


Pfam Domain 


NOV32a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Kazal 




0<\/Al (A 10/C\ 
23/u 1 1 /o) 

36/61 (59%) 




Kazal 




21/62 (34%) 
33/62(53%) 


5.1e-13 


Kazal 




1 8/61 (30%) 
33/61 (54%) 


7.9e-12 


Kazal 




38/61 (62%) 


4.1e-16 


Kazal 




38/61 (62%) 








19/61 (31%) 
38/61 (62%) 




| Kazal 




26/62 (42%) 
37/62 (60%) 


_ 

1.5e-17 


Kazal 




zmoz \jy /o) 

40/62 (65%) 




laminin EGF 


797.848 


28/61 (46%) 
46/61 (75%) 




laminin EGF 




37/59 (63%) 




Kazal 


927.. 973 


25/62 (40%) 
41/62(66%) 




SEA 


1134..1256 


39/132(30%) 
112/132 (85%) 


1.4e-36 


EGF 


1337..1370 


16/47(34%) 
24/47 (51%) 


0.00054 


lamininG 


1404..1535 


70/162(43%) 
119/162(73%) 


3.1e-53 


EGF 


1557..1589 


16/47(34%) 
27/47 (57%) 


5.1e-06 


EGF 


1596.. 1628 


16/47(34%) 
25/47(53%) 


0.0002 


lamininG 


1672.. 1807 


70/161 (43%) 
123/161 (76%) 


5.1e-51 


EGF 


1826.. 1860 


14/47 (30%) 


2.3e-06 
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25/47 (53%) 




laminin_G 


1905..2036 


59/161 (37%) 
125/161 (78%) 


1.7e-50 



Example 33. 

The NOV33 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 33A. 



Table 33A. NOV33 Sequence Analysis 




SEQIDNO:109 5354 bi 


NOV33a, 
CG95 165-01 DNA Sequence 


TGTGGCAGGAGGCGATGCGGCGCCGCCGCTACCTGCGGGACCGCTCr-GAGGAGGr-GGr 
GGGCGGCGGAGACGGGCTGCCGCGGTCCCGGGACTGGCTCTACGAGTCCTACTACTGC 
ATGAGCCAGCAGCACCCGCTCATCGTCTTCCTGCTGCTCATCGTCATGGGCTCCTGCC 
TCGCCCTGCTCGCCGTCTTCTTCGCGCTCGGCCTGGAAGTTGAAGACCATGTGGCGTT 
TCTAATAACAGTTCCAACTGCCCTGGCGATTTTCTTTGCGATATTTATCCTGGTCTGC 
ATCGAGTCTGTGTTTAAGAAGCTGCTGCGCCTCTTCTCGTTGGTGATATGGATATGCC 
TTGTTGCCATGGGATACCTGTTCATGTGTTTTGGAGGCACCGTCTCTCCCTGGGACCA 
GGTATCGTTCTTCCTCTTCATCATCTTCGTGGTGTACACCATGCTGCCCTTCAACATG 
CGAGACGCCATCATTGCCAGCGTCCTCACCTCCTCCTCCCACACCATCGTGCTAAGCG 
TCTGCCTGTCTGCAACACCGGGAGGCAAGGAGCACCTGGTCTGGCAGATCCTGGCCAA 
TGTGATCATTTTCATCTGTGGGftACCTGGCGGGAGCCTACCATAAGCACCTCATGGAA 
CTCGCTCTTCAGCA.AACATATCAGGACACCTGTAATTGCATCAAGTCGCGGATCAAGT 




CGCCATGGAGATGAAAGCGGAGA7CATCCAGAGGCTGCAGGGCCCCAAGGCGGGCCAG 
ATGGAGAACACAAATAACTTCCACAACCTGTATGTGAAGCGGCATACAAACGTGAGCA 
TCTTATACGCTGACATCGTTGGCTTTACCCGGCTGGCAAGTGACTGCTCCCCGGGAGA 
ACTAGTCCACATGCTGAATGAGCTCTTTGGAAAGTTTGATCAAATTGCAAAGGAGAAT 
GAATGCATGAGAATTAAAATTTTAGGAGACTGCTACTACTGTGTATCTGGACTCCCTA 
TATCTCTCCCTAACCATGCCAAGAACTGTGTGAAAATGGGGCTGGACATGTGTGAAGC 
CATAAAGAAAGTGAGGGATGCTACTGGAGTTGATATCAACATGCGCGTGGGCGTGCAT 




CACATGATGTGACCTTGGCCAACCACATGGAAGCTGGAGGGGTCCCTGGACGTGTTCA 
CATTTCTTCTGTCACCCTGGAGCACTTGAATGGCGCTTATAAAGTGGAGGAGGGAGAT 
GGTGACATTAGGGACCCATATTTAAAACAGCACCTGGTGAAAACCTACTTTGTGATCA 
ACCCCAAGGGAGAACGACGGAGCCCCCAGCATCTCTTCAGACCTCGCCACACCCTTGA 
TGGAGCCAAAATGAGGGCCTCGGTCCGCATGACCCGGTACTTGGAGTCCTGGGGGGCA 
GCCAAGCCCTTTGCACACCTACATCACAGGGACAGCATGACCACAGAGAACGGCAAGA 
TCAGCACCACGGATGTACCCATGGGTCAGCATAATTTTCAAAATCGCACCTTAAGAAC 
CAAGTCACAAAAGAAGAGATTTGAAGAAGAATTGAATGAAAGGATGATTCAAGCAATT 
GATGGGATTAATGCACAGAAGCAATGGCTCAAGTCTGAAGACATTCAGAGAATCTCAC 
TGCTTTTCTATAACAAAGTACTAGAAAAAGAGTACCGGGCCACGGCACTGCCAGCGTT 
CAAGTATTATGTGACTTGTGCCTGTCTCATATTCTTCTGCATCTTCATTGTGCAGATT 
CTCGTGCTGCCAAAAACGTCTGTCCTGGGCATCTCCTTTGGGGCTGCGTTTCTCTTGC 
TGGCCTTCATCCTCTTCGTCTGCTTTGCTGGACAGCTTCTGCAATGCAGCAAAAAAGC 
CTCTCCCCTGCTCATGTGGCTTTTGAAGTCCTCGGGCATCATTGCCAACCGCCCCTGG 
CCACGGATCTCTCTCACGATCATCACCACAGCCATCATATTAATGATGGCCGTGTTCA 
ACATGTTTTTCCTGAGTGACTCAGAGGAAACAATCCCTCCAACTGCCAACACAACAAA 
CACAAGCTTTTCAGCCTCAAATAATCAGGTGGCGATTCTGCGTGCGCAGAATTTATTT 




TGCGGGTAAACTATGAGCTGAAGATGTTGATCATGATGGTGGCCTTGGTGGGCTACAA 
CACCATCCTACTCCACACCCACGCCCACGTCCTGGGCGACTACAGCCAGGTCTTATTT 
GAGAGACCAGGCATTTGGAAAGACCTGAAGACCATGGGCTCTGTGTCTCTCTCTATAT 
TCTTCATCACACTGCTTGTTCTGGGTAGACAGAATGAATATTACTGTAGGTTAGACTT 
CTTATGGAAGAACAAATTCAAAAAAGAGCGGGAGGAGATAGAGACCATGGAGAACCTG 
AACCGCGTGCTGCTGGAGAACGTGCTTCCCGCGCACGTGGCTGAGCACTTCCTGGCCA 
GGAGCCTGAAGAATGAGGAGCTATACCACCAGTCCTATGACTGCGTCTGTGTCATGTT 
TGCCTCCATTCCGGATTTCAAAGAATTTTATACAGAATCCGACGTGAACAAGGAGGGC 
TTGGAATGCCTTCGGCTCCTGAACGAGATCATCGCTGACTTTGATGATCTTCTTTCCA 
AGCCAAAATTCAGTGGAGTTGAAAAGATTAAGACCATTGGCAGCACATACATGGCAGC 
AACAGGTCTGAGCGCTGTGCCCAGCCAGGAGCACTCCCAGGAGCCCGAGCGGCAGTAC 
ATGCACATTGGCACCATGGTGGAGTTTGCTTTTGCCCTGGTAGGGAAGCTGGATGCCA 
TCAACAAGCACTCCTTCAACGACTTCAAATTGCGAGTGGGTATTAACCATGGACCTGT 
GATAGCTGGTGTGATTGGAGCTCAGAAGCCACAATATGATATCTGGGGCAACACTGTC 
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OSTGAAAGGAAAGGGGGACCTGAAGACGTACTTTGTAAACACAGAAATGTC^GGTCC 
CTTTCCCAGAGCAACGTGGCATCCTGAAGAGTCACCTTCATTTTGGCAAGAAGACTGT 
ATTTTCAGGAAGGTATCACACACTTTCTGACTGCAACTTCTGTCCCTT 





ORF Start: ATG at 15 


ORF Stop: TGA at 3273 




SEQ ID NO: 110 


1086 aa ] MW at 122956.2kD 


NOV33a, 
CG95 165-01 Protein Sequence 


MRRRRYLRDRSEEAAGGGDGLPRSRDWLYSSYYCMSQQHPLIVFLLLIVMGSCLALLA 
VFFALGLEVEDHVAFLITVPTALAIFFAIFILVCIESVFKKLLRLFSLVIWICLVAMG 
YLFMCFGGTVS PWDQVSFFLFIIFWYTMLPFNMRDAIIASVLTSSSHTIVLSVCLSA 
TPGGKEHLVWQILANVIIFICGNLAGAYHKHLMELALQQTYQDTCNCIKSRIKLEFEK 
RQQERLLLSLLPAHIAMEMKAEIIQRLQGPKAGQMENTNNFHNLYVKRHTNVSILYAD 
IVGFTRLASDCSPGELVHMLNELFGKFDQIAKENECMRIKILGDCYYCVSGLPISLPN 
HAKNCVKMGLDMCEAIKKVRDATGVDINMRVGVHSGNVLCGVIGLQKWQYDVWSHDVT 
LANHMEAGGVPGRVHISSVTLEHLNGAYKVEEGDGDIRDPYLKQHLVKTYFVINPKGE 
RRSPQHLFRPRHTLDGAKMRASVRMTRYLESWGAAKPFAHLHHRDSMTTENGKISTTD 
VPMGQHNFQNRTLRTKSQKKRFEEELNERMIQAIDGINAQKQWLKSEDIQRISLLFYN 
KVLEKEYRATALPAFKYYVTCACLIFFCIFIVQILVLPKTSVLGISFGAAFLLLAFIL 
FVCFAGQLLQCSKKASPLLMVJLLKSSGIIANRPWPRISLTIITTAIILMMAVFNMFFL 
SDSEETIPPTANTTNTSFSASNNQVAILRAQNLFFLPYFIYSCILGLISCSVFLRVHY 
ELKMLIMMVALVGYNTILLHTHAHVLGDYSQVLFERPGIWKDLKTMGSVSLSIFFITL 
LVLGRQNEYYCRLDFLWKNKFKKEREEIETMENLNRVLLENVLPAHVAEHFLARSLKN 
EELYHQS YDCVCVMFAS I PDFKEFYTESDVNKEGLECLRLLNEI IADFDDLLSKPKFS 
GVEKIKTIGSTYMAATGLSAVPSQEHSQEPERQYMHIGTMVEFAFALVGKLDAINKHS 
FNDFKLRVGINHGPVIAGVIGAQKPQYDIWGNTVNVASRMDSTGVLDKIQVTEETSLV 
LQTLGYTCTCRGIINVKGKGDLKTYFVNTEMSRSLSQSNVAS 



Further analysis of the NOV33a protein yielded the following properties shown in 
Table 33B. 



Table 33B. Protein Sequence Properties NOV33a 


PSort analysis: 


0.8000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.3000 
probability located in microbody (peroxisome) 


SignalP analysis: 


Cleavage site between residues 65 and 66 



A search of the NOV33a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 33 C. 
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Table 33C. Geneseq Results for NOV33a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent ft, 
Date] 


NOV33a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 


Expect 
Value 


AAE0293 
8 


Human adenylate cyclase 25678 - 

Homo sapiens, 1086 aa. 
[ WO200 1 44453- A 1 , 2 1 -JUN-200 1 ] 


1-1086 
1..1086 


1086/1086 (100%) 
1086/1086(100%) 


0.0 


AAB0200 
6 


Adenylyl cyclase type II-C2 C2 alpha 
domain - Homo sapiens, 1090 aa. 
[US6107076-A, 22-AUG-2000] 


1.-1086 
1..1090 


1039/1090(95%) 
1062/1090(97%) 


0.0 


AAR9456 
0 


Rat adenylyl cyclase - Rattus sp, 1090 
aa. [WO9608260-A1, 2 l-MAR-1996] 


1 - 1 086 
1..1090 


1039/1090(95%) 
1062/1090(97%) 


0.0 


AAU0192 
4 


Human adenylate cyclase polypeptide - 
Homo sapiens, 1077 aa. 
[WO200125448-A1, 12-APR-2001] 


22.. 1078 
10.. 1069 


605/1070 (56%) 
772/1070 (71%) 


0.0 


AAB0200 
8 


Type IV adenylyl cyclase - Homo 
sapiens, 1064 aa. [US6107076-A, 22- 
AUG-2000] 


22.. 1082 
10.. 1062 


609/1072 (56%) 
769/1072 (70%) 


0.0 



In a BLAST search of public sequence datbases, the NOV33a protein was found 
homology to the proteins shown in the BLASTP data in Table 33D. 
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Table 33D. Public BLASTP Results for NOV33a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV33a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P26769 


Adenylate cyclase, type II (EC 
4.6.1.1) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Rattus 
norvegicus (Rat), 1090 aa. 


1.1086 
1.. 1090 


1039/1090 (95%) 
1062/1090 (97%) 


0.0 


Q08462 


Adenylate cyclase, type II (EC 
4.6.1.1) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Homo sapiens 
(Human), 887 aa (fragment). 


200..I086 
1..887 


887/887 (100%) 
887/887 (100%) 


0.0 


' Q91WF3 


SIMILAR TO ADENYLYL 
CYCLASE 4 (ADENYLYL 
CYCLASE TYPE 4) (EC 4.6.1.1) - 
Mus musculus (Mouse), 1077 aa. 


22.. 1082 
10..1075 


612/1074 (56%) 
780/1074 (71%) 


0.0 


; CAC37757 


SEQUENCE 2 FROM PATENT 
WO0125448- Homo sapiens 
(Human), 1077 aa. 


22.. 1078 
10.. 1069 


605/1070 (56%) 
772/1070 (71%) 


0.0 


P26770 


Adenylate cyclase, type IV (EC 
4.6. 1 . 1 ) (ATP pyrophosphate-lyase) 
(Adenylyl cyclase) - Rattus 
norvegicus (Rat), 1064 aa. 


22.. 1082 
10.. 1062 




609/1072(56%) 
769/1072 (70%) 


0.0 



PFam analysis predicts that the NOV33a protein contains the domains shown in 
the Table 33E. 



Table 33E. Domain Analysis of NOV33a 


Pfam Domain 


NOV33a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Sodcu 


388.395 


6/8 (75%) 
8/8(100%) 


0.73 


guanylate_cyc 


276.460 


71/226(31%) 
151/226 (67%) 


2.7e-68 


guanylate_cyc 


873..1073 


89/227 (39%) 
186/227 (82%) 


2.1e-96 



Example 34. 



5 The NOV34 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 34A. 
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Table 34A. NOV34 Sequence Analysis 




SEQIDNO:lll | 31 17 bp 


NOV34a, 


CCTCCCCAGTAGCTGGGACTATGGGAGCGTGCCACCATGCCTGGTTAATTTTTGTATT 


tuyji /j-ui uina oequence 


ATCCTCCTGGATTCCAAAGCCTCCCAGGCCGAGCTGGGCTGGACTGCACTGCCAAGTA 
ATGGGTGGGAGGAGATCAGCGGCGTGGATGAACACGACCGTCCCATCCGCACGTACCA 
AGTGTGCAATGTGCTGGAGCCCAACCAGGACAACTGGCTGCAGACTGGCTGGATAAGC 
CGTGGCCGCGGGCAGCGCATCTTCGTGGAACTGCAGTTCACACTCCGTGACTGCAGCA 




TGAGGCCGACCTGGGCCGTGGGCGTCCCCGCCTAGGCGGCAAAATCGACACGATCGCG 
GCGGACGAGAGCTTCACGCAGGGCGACCTGGGTGAGCGCAAGATGAAGCTGAACACAG 
AGGTGCGCGAGATCGGACCGCTCAGCCGGCGGGGTTTCCACCTGGCCTTTCAGGACGT 




GTGCGGGGCCTGGCCACGTTCCCAGCCACCGCAGCCGAGAGCGCCTTCTCCACACTGG 
TGGAAGTGGCCGGAACGTGCGTGGCGCACTCGGAAGGGGAGCCTGGCAGCCCCCCACG 




GCGGGATTCCAGGAGCGTGGTGACTTCTGCGAAGGTATCTGTCCCCCAGGGTTTTACA 
AGGTGTCCCCGCGGCGGCCCCTCTGCTCACCGTGCCCAGAGCACAGCCGGGCCCTGGA 

GCCGCTCGCCGCTGGTGCT^ 




CTCGGACGTCACCTACTCGCTGCTGTGCCTGCGCTGCGGCCGCGAGGGCCCGGCGGGC 

TCCACCGGGCCCGGGGGTAAGGCCGTCCGCGCCCCCCACCCCGAGGCCACCGCGCCTG 

GGAGGATGAGATCCGCAGGGACCGAGTGGAACCCCAGAGCGTGTCCCTGTCGTGGCGG 
GAGCCCATCCCTGCCGGAGCCCCTGGGGCCAATGACACGGAGTACGAGATCCGATACT 
ACGAGAAGGTGAGTGCGCAGAGTGAGCAGACTTACTCCATGGTGAAGACAGGGGCGCC 
CACAGTCACCGTGATTTTCCTCCCAGCTGCCTCAGGGTCCAGGGACCAGAGCCCCGCC 




TGCTGGCCATTTGGAGGAGGAGGCCCTGCAGCTATGGCAAAGGAGGAGGGGATGCCCA 
TGATGAAGAGGAGCTGTATTTCCACTGTGAGTTGGCTGGGAAAGTCCCAACACGTCGC 
ACATTCCTGGACCCCCAGAGCTGTGGGGACCTGCTGCAGGCTGTGCATCTGTTCGCCA 
AGGAACTGGATGCGAAAAGCGTCACGCTGGAGAGGAGCCTTGGAGGAGGCAAGCTGGG 




CTCGTAGCCGTGCACATGCTGAGGGACAGCGCCTCCGACTCACAGAGGCTCGGCTTCC 




CGTTGTTACCCGAGGTAGGGGAAGCACCTTGATGATTGTCACCGAGTACATGAGCCAT 




TGATGGGGTTGCTGCCTGGGCTGGCATCAGCCATGAAGTATCTGTCAGAGATGGGCTA 
CGTTCACCGGGGCCTGGCAGCTCGCCATGTGCTGGTCAGCAGCGACCTTGTCTGCAAG 
ATCTCTGGCTTCGGGCGGGGCCCCCGGGACCGATCAGAGGCTGTCTACACCACTATGG 
TGAGGCTACAGAGTGGCCGGAGCCCAGCGCTATGGGCCGCTCCCGAGACACTTCAGTT 
TGGCCACTTCAGCTCTGCCAGTGACGTGTGGAGCTTCGGCATCATCATGTGGGAGGTG 
ATGGCCTTTGGGGAGCGGCCTTACTGGGACATGTCTGGCCAAGACGTGGTGATCAAGG 
CTGTGGAGGATGGCTTCCGGCTGCCACCCCCCAGGAACTGTCCTAACCTTCTGCACCG 
ACTAATGCTCGACTGCTGGCAGAAGGACCCAGGTGAGCGGCCCAGGTTCTCCCAGATC 
CACAGCATCCTGAGCAAGATGGTGCAGGACCCAGAGCCCCCCAAGTGTGCCCTGACTA 
CCTGTCCCAGGCCTCTGACCCGCAGGCCTCCCACTCCACTAGCCGACCGTGCCTTCTC 




TACAAGGACAGCTTCGCGGCTGCTGGCTATGGGAGCCTGGAGGCCGTGGCCGAGATGA 
CTAGCCAGGACCTGGTGAGCCTAGGCATCTCTTTGGCTGAACATCGAGAGGCCCTCCT 
CAGCGGGATCAGCGCCCTGCAGGCACGAGTGCTCCAGCTGCAGGGCCAGGGGGTGCAG 
GTGTGAGTGGACCCCATTCTTCCAAGGCAGGACTCCGGTGGGG 
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ORF Start: ATG at 21 


ORF Stop: TGA at 3078 




SEQ ID NO: 112 


1019 aa | MWatll0412.5kD 


NOV34a, 
CG95 175-01 Protein Sequence 


GVDEHDRPIRTYQVCNVLEPNQDNWLQTGWISRGRGQRIFVELQFTLRDCSSIPGAAG 
TCKETFNVYYLETEADLGRGRPR1,GGKIDTIAADESFTQGDLGERKMKLNTEVREIGP 
LSRRGFHLAFQDVGACVALVSVRVYYKQCRATVRGLATFPATRAESAFSTLVEVAGTC 
VAHSEGEPGSPPRMHCGADGEWLVPVGRCSCSAGFQERGDFCEGICPPGFYKVSPRRP 
LCSPCPEHSRALENASTFCVCQDSYARSPTDPPSASCTRPPSAPRDLOYSLSRSPLVL 
RLRWLPPADSGGRSDVTYSLLCLRCGREGPAGACEGPRVAFLPRQAGLRERAATLLHL 
RPGARYTVRVAALNGVSGPAAAAGTTYAQVTVSTGPGGKAVRAPHPEATAPAAPAPSW 
GRPVGPAGSAPKEEDEIRRDRVEPQSVSLSWREPIPAGAPGANDTEYEIRYYEKVSAQ 
SEQTYSMVKTGAPTVTVIFLPAASGSRDQSPAIWTWTISALLVLGSVMSVLAIWRR 
RPCSYGKGGGDAHDEEELYFHCELAGKVPTRRTFLDPQSCGDLLQAVHLFAKELDAKS 
VTLERSLGGGKLGGRFGELCCGCLQLPGRQELLVAVHMLRDSASDSQRLGFLAEALTL 
GQFDHSHIVRLEGWTRGRGSTLMIVTEYMSHGALDGFLRQRHEGQLVAGQLMGLLPG 
LASAMKYLSEMG YVHRGLAARHVLVS SDLVCKI SG FGRGPRDRSEAVYTTMVRLQS GR 
SPALWAAPETLQFGHFSSASDVWSFGIIMWEVMAFGERPYWDMSGQDWIKAVEDGFR 
LPPPRNCPNLLHRLMLDCTQKDPGERPRFSQIHSILSKMVQDPEPPKCALTTCPRPLT 
RRPPTPLADRAFSTFPSFGSVGAVJLEALDLCRYKDSFAAAGYGSLEAVAEMTSQDLVS 
LGISLAEHREALLSGISALQARVLQLQGQGVQV 



Further analysis of the NOV34a protein yielded the following properties shown in 
Table 34B. 



Table 34B. Protein Sequence Properties NOV34a 


PSort analysis: 


0.4600 probability located in plasma membrane; 0.1000 probability located in 
endoplasmic reticulum (membrane); 0.1000 probability located in endoplasmic 
reticulum (lumen); 0.1000 probability located in outside 


SignalP analysis: 


Cleavage site between residues 43 and 44 



A search of the NOV34a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 34C. 
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Table 34C. Geneseq Results for NOV34a 


Geneseq 
Identifier 

1 


Protein/Organism/Length [Patent 
#, Date] 


NOV34a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


! AAM47209 


Human NOV3 protein - Homo 

sapiens, 1000 aa. 
[WO200174851-A2, 1 1-OCT- 
2001] 


7..1019 
9.. 1000 


933/1054 (88%) 
936/1054 (88%) 


0.0 


AAU03553 


Human protein kinase #53 - 
Homo sapiens, 1009 aa. 
[WO200138503-A2, 3 1 -MA Y- 
2001] 


24..1019 
18.. 1009 


913/1048(87%) 
916/1048 (87%) 


0.0 


AAW03421 


Mouse developmental kinase 1 - 
Mus sp, 998 aa. [WO9621013- 
Al, 11 -JUL- 1996] 


7.. 1019 
9..998 


522/1055 (49%) 
694/1055(65%) 


0.0 


AAR85092 


EPH-like receptor protein 
tyrosine kinase HEK1 1 - Homo 
sapiens, 998 aa. [W09528484- 
Al,26-OCT-1995] 


7.. 1019 
9..998 


520/1055 (49%) 
696/1055 (65%) 


0.0 


AAR85090 


EPH-like receptor protein 
tyrosine kinase HEK7 - Homo 
sapiens, 991 aa. [W09528484- 
Al,26-OCT-1995] 


28. .1010 
33..98S 


457/1018 (44%) 
641/1018(62%) 


0.0 



In a BLAST search of public sequence datbases, the NOV34a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 34D. 
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Table 34D. Public BLASTP Results for NOV34a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV34a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAD 13085 


SEQUENCE 5 FROM 
PATENT WOO 174851 - 
Homo sapiens (Human), 
1000 aa. 


7..1019 
9.. 1000 


933/1054 (88%) 
936/1054 (88%) 


0.0 


CAD23752 


SEQUENCE 1 FROM 
PATENT WO0208253 - 
Homo sapiens (Human), 
974 aa (fragment). 


33..1019 
1..974 


925/1027(90%) 
926/1027(90%) 


0.0 


CAD23753 


SEQUENCE 3 FROM 
PATENT WO0208253 - 
Homo sapiens (Human), 
1022 aa (fragment). 


33..1019 
1..1022 


930/1071 (86%) 
931/1071 (86%) 


0.0 


CAD23754 


SEQUENCE 5 FROM 
PATENT WO0208253 - 
Homo sapiens (Human), 
647 aa (fragment). 


3S..653 
1..623 


575/661 (86%) 
576/661 (86%) 


0.0 


CAD23755 


SEQUENCE 7 FROM 
PATENT WO0208253 - 
Homo sapiens (Human), 
695 aa (fragment). 


33. .653 
1.671 


580/705 (82%) 
581/705 (82%) 


0.0 



PFam analysis predicts that the NOV34a protein contains the domains shown in 
the Table 34E. 



Table 34E. Domain Analysis of NOV34a 


Pfam Domain 


NOV34a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


EPHJbd 


32..203 


115/178(65%) 
156/178(88%) 


4.4e-119 


fli3 


331. .426 


25/98 (26%) 
66/98 (67%) 


2e-06 


pkinase 


671. .903 


70/269 (26%) 
167/269(62%) 


l.le-37 


SAM 


942.. 1006 


27/68 (40%) 
50/68 (74%) 


2.9e-15 
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Example 35. 

The NOV35 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 3 5 A. 



Table 35A. NOV35 Sequence Analysis 




SEQIDNO: 113 


765 bp 


NOV35a, 
CG95693-01 DNA Sequence 


ATGGTGCTCTTTGGCTGGCTTTTTCGCAGAGTTCGCAGAGAGAATGTTATCTTTGGCA 
TTCTAACAGTGATGTCAATACAAGGTTATGCAAACCTCTGTAATCAATGGAGCATAAC 
AGGAGAATTTAATGATTTGCCTCAGGAAGAACTTTTACAGTGGATCAAATACAATACC 
GTACCAGGTGACAACTCAGTGGGTCCAGCTCTTGCTGGACAGGCAGAGTGGTCTCTCC 
TCATTTCCACCACTACCACAGGCCCGTGGGAGGTAATGCCTGGCTACCGCCAACATTC 
ACTTAAGGCTGAAGGACCTTTCAGTCAGCTTGTGGTGAGTGCTGCCAGGCCTGGAACT 
CACCCTTCAGGGCAGTGGGCTCTCTTCCAGCCCAGGGCAGATGCTGTCTTTGCAGGTG 
CCATGCCTACAATGGCAAGTGTCAAGCTGTCTACACTTCATCCCATTGTGAATCATCC 
ACATTACGAAGATGCAGACTTGAGAAATTGTGAATTACGTTACTCACATGGAAAAAGG 
AAATTTGCAAATGTGATTAAATTAATGCTCATGAGATGTGGAGATTATCCTTATTATC 
CAGGGATCATGGGATGTGGTCTGCACACTCAACTGAGAAGGTCCAGTAATGGCAAGGC 
AGTCACCAGGGGACCCATGTCCTCAGAGGACCTGGCTCCTCAGAGAAACCGCAGGTTC 
ACTGGGGGAGATTATCAGGACTTTAGACCTCAGGATGAAGAGACACACTCCAGGAGGG 
GCTGTGGATAG 




ORF Start: ATG at 1 


ORF Stop: TAG at 763 




SEQIDNO: 114 


254 aa j MWat28531.2kD 


NOV35a, 
CG95693-01 Protein Sequence 

i 


KVLFGWLFRRVRRENVI PG ILTVMS IQGYANI»CNQWSIT6EPNDI»PQEBI*LQWXKYNT 
VPGDNSVGPALAGQAEWSLLISTTTTGPWKVMPGYRQHSI.KAEGPFSQr.WSAARPGT 
HPSGQWALFQPRADAVFAGAMPTMASVKLSTLHPIVNHPHYEDADLRNCELRYSHGKR 
KFANVIKLMLMRCGDYPYYPGIMGCGLHTQLRRSSNGKAVTRGPMSSEDLAPQRNRRF 
TGGD QDFRPQDF.F.THSRRGCG 



Further analysis of the NOV35a protein yielded the following properties shown in 
Table 35B. 



Table 35B. Protein Sequence Properties NOV35a 


PSort analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.3592 
probability located in lysosome (lumen); 0.2463 probability located in microbody 
(peroxisome); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 14 and 15 



A search of the NOV35a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 35C. 
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Table 35C. Geneseq Results for NOV35a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent ft, 
Date] 


NOV35a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW85723 


Novel protein (Clone AX56_28) - 

Homo sapiens, 171 aa. 
[WO9920644-A1, 29-APR-1999] 


3.. 100 
23..115 


59/98 (60%) 
64/98 (65%) 


4e-23 


AAG03191 


Human secreted protein, SEQ ID 
[EP1 03340 1-A2, 06 P -SEP-2000] 


3..58 


46/56 (82%) 
48/56 (85%) 


le-19 


AAM41346 


Human polypeptide SEQ ID NO 
6277 - Homo sapiens, 618 aa. 
[WO200153312-A1, 26-JUL-2001] 


3..60 
434..491 


36/58 (62%) 
45/58 (77%) 


4e-15 


AAM39560 


Human polypeptide SEQ ID NO 
2705 - Homo sapiens, 618 aa. 
[WO2001533 12-A1, 26-JUL-2001] 


3. .60 
434..491 


36/58 (62%) 
45/58 (77%) 


4e-15 


AAG68230 


Cycline-dependent human kinase 14 
protein SEQ ID NO:2 - Homo 
sapiens, 129 aa. [WO200175013- 
A2, ll-OCT-2001] 


130.. 180 
5..S6 


38/52 (73%) 
41/52 (78%) 


3e-12 



In a BLAST search of public sequence datbases, the NOV35a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 35D. 
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Table 35D. Public BLASTP Results for NOV35a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV35a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96LZ9 


CDNA FLJ32949 FIS, CLONE 
TESTI2008020, WEAKLY 

SIMILAR TO DPY-19 
PROTEIN - Homo sapiens 
(Human), 758 aa. 


3.. 100 
576..668 


59/98 (60%) 
64/98 (65%) 


le-22 


094954 


K1AA0877 PROTEIN - Homo 
sapiens (Human), 580 aa 
(fragment). 


3. .60 
S96..453 


36/58 (62%) 
45/58 (77%) 


le-14 


S44629 


F22B7.10 protein - 
Caenorhabditis elegans, 628 aa. 


90..183 
464..55 1 


41/96(42%) 
51/96 (52%) 


le-09 


P34413 


Protein dpy-19 - Caenorhabditis 
elegans, 683 aa. 


90.. 183 
519..606 


41/96 (42%) 
51/96 (52%) 


le-09 


Q9VWR8 


CG6659 PROTEIN - Drosophila 
melanogaster (Fruit fly), 872 aa. 


121. .158 
721. .758 


17/38(44%) 
22/38 (57%) 


0.077 



PFam analysis predicts that the NO V3 5a protein contains the domains shown in 
the Table 35E. 



Table 35E. Domain Analysis of NOV35a 






Identities/ 




Pfam Domain 


NOV35a Match Region 


Similarities 


Expect Value 






for the Matched Region 


No Significant Matches Found 



Example 36. 



The NOV36 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 36A. 
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Table 36A. NOV36 Sequence Analysis 



SEQ ID NO: 



3500 bp 



NOV36a, 
CG958 14-01 DNA Sequence 



rGCAC-TTCTCCCGGCTATGC ATGGGCTGCGGCTCCGGCTCTAGCACAGGCACCA 



GCCCAGCCCGCAGCCCGCGCTCCACGGAGCGCTGGAGACCACCGTGGGGGGCCCCTTC 
ICTCGAGAGAAGCGGTCTTGGAGGTATGGATTTAGGTGGTTGGATTTTTTCCGTG 
GATCTATCAATTCACAATTCGAATTTGGAAGAAAGAAGGAAAACATGACGTCTCCAGC 
CAAATTCAAAAAGGATAAGGAGATCATAGCAGAGTACGATACTCAGGTCAAAGAGATC 

AACTGTTGCAGGACCTCCAGGACTTCTTCCGAAAGAAGGCAGAGATTGAGATGGACTA 
CTCCCGCAACCTGGAGAAGCTGGCAGAACGCTTCCTGGCCAAGACACGCAGCACCAAG 
GACCAGCAATTCAAGAAGGATCAGAATGTTCTCTCTCCAGTCAACTGCTGGAATCTCC 
TCTTAAACCAGGTGAAGCGGGAAAGCAGGGACCATACCACCCTGAGTGACATCTACCT 
GAATAATATCATTCCTCGATTTGTACAAGTCAGCGAGGACTCAGGAAGACTCTTTAAA 
AAGAGTAAAGAAGTCGGCCAGCAGCTCCAAGATGATTTGATGAAGGTCCTGAACGAGC 
TCTACTCGGTCATGAAGACATATCACATGTACAATGCCGACAGCATCAGTGCTCAGAG 
CAAACTAAAGGAGGCGGAGAAGCAGGAGGAGAAGCAAATTGGTAAATCGGTAAAGCAG 
GAGGACCGGCAGACCCCACGCTCCCCTGACTCCACGGCCAACGTTCGCATTGAGGAGA 
AACATGTCCGGAGGAGCTCAGTGAAGAAGATTGAGAAGATGAAGGAGAAGCACCAAGC 
CAAGTACACGGAGAATAAGCTGAAGGCCATCAAAGCCCGGAATGAGTACTTGCTGGCT 
TTGGAGGCAACCAATGCATCTGTCTTCAAGTACTACATCCATGACCTATCTGACCTTA 
TTGATCAGTGTTGTGACTTAGGCTACCATGCAAGTCTGAACCGGGCTCTACGCACCTT 
CCTCTCTGCTGAGTTAAACCTGGAACAGTCGAAGCATGAGGGTCTGGATGCCATCGAG 
AATGCAGTAGAAAACCTGGATGCCACCAGTGACAAGCAGCGCCTCATGGAGATGTACA 

TTCCCAGCTCTGTGCCCAGCAGCCTGTCCAGAGTGAGCTGGTACAGAGATGCCAACAA 
CTGCAGTCTCGCTTATCCACTCTAAAGATTGAAAACGAAGAGGTAAAGAAGACAATGG 
AGGCCACCCTGCAAACCATCCAGGACATTGTGACTGTCGAGGACTTTGATGTGTCTGA 
CTGCTTCCAGTACAGCAACTCCATGGAGTCCGTCAAGTCCACGGTCTCTGAAACCTTC 
ATGGGCAAGCCCAGCATTGCTAAGAGGAGAGCCAACCAGCAAGAGACAGAGCAGTTTT 
ATTTCACAAAAATGAAAGAGTACCTGGAGGGCAGGAACCTCATCACCAAGTTACAAGC 
CAAGCATGACCTTCTGCAGAAAACCCTGGGAGAAAGTCAGCGGACAGATTGCAGTCTA 
GCCAGGCGCAGCTCAACTGTGAGGAAACAGGACTCCAGCCAGGCAATTCCTCTAGTGG 
TGGAAAGCTGTATCCGGTTTATCAGCAGACACGGACTACAGCATGAAGGAATTTTCCG 
GGTGTCAGGATCCCAGGTGGAAGTGAATGACATCAAAAATGCCTTTGAGAGAGGAGAG 



ATCATTTATCACAGTTCAGTGAAGAGAACATGATGGACCCCTACAACCTCGCCATCTG 
CTTCGGGCCCTCGCTAATGTCAGTGCCAGAGGGCCACGACCAGGTGTCCTGCCAAGCC 
CACGTGAATGAGCTGATCAAAACCATCATCATCCAGCATGAGAACATCTTCCCAAGCC 
CCAGGGAGCTGGAGGGCCCTGTCTACAGCAGAGGAGGAAGCATGGAGGATTACTGTGA 



CATCAGTACATCGTGGTCCAAGACACCGAGGACGGTGTCGTGGAGAGGTCCAGCCCCA 
AGTCTGAGATTGAGGTCATTTCTGAGCCACCTGAAGAAAAGGTGACAGCCAGAGCGGG 
GGCCAGCTGTCCCAGTGGGGGTCATGTAGCCGATATTTATCTTGCAAACATCAACAAG 
CAAAGGAAGCGTCCAGAATCTGGGAGCATCCGGAAAACTTTTCGGAGTGACAGCCATG 



ATCCTCCCAGCCCATCATGAGCCAGAGCCTCCCCAAAGAAGGGCCAGATAAGTGTTCC 
ATCAGTGGGCACGGGAGCCTCAACTCCATCAGCCGCCACTCATCCCTGAAGAATCGGC 
TGGATAGTCCACAGATCCGGAAGACTGCCACAGCGGGAAGGTCAAAAAGCTTCAATAA 
CCATCGGCCCATGGACCCTGAGGTCATTGCTCAGGATATTGAGGCAACAATGAACTCG 



TGGTTCTGGACACCTTGGAGCCCCTCAAAACCTCCCCAGTGGTGGCCCCCACGTCAGA 
GCCCTCCAGCCCTCTGCACACCCAGCTCCTCAAGGACCCCGAGCCCGCCTTCCAGCGC 
AGCGCCAGTACTGCTGGGGACATCGCCTGCGCCTTCCGGCCTGTCAAGTCTGTCAAGA 
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ORF Start: ATG at 25 


ORF Stop: TGA at 3490 




SEQ ID NO: 116 


1 155 aa | MW at 130304.7kD 


NOV36a, 
CG958 14-01 Protein Sequence 


MGCGSGSSTGTSRRRTRPQRPPSACARRSRLPSPQPALHGALETTVGGPFCPREKRSW 
RYGFRWLDFFRGSINSQFEFGRKKENMTSPAKFKKDKEIIAEYDTQVKEIRAQLTEQM 

NVLSPVNCiraLLLNQVKRESRDHTTLSDIYLNNIIPRFVQVSEDSGRLFKKSKEVGQQ 
LQDDLMKVLNELYSVMKTYHMYNADSISAQSKLKEAEKQEEKQIGKSVKQEDRQTPRS 
PDSTANVRIEEKHVRRSSVKKIEKMKEKHQAKYTENKLKAIKARNEYLLALEATNASV 
FKYYIHDLSDLIDQCCDLGYHASLNRALRTFLSAELNLEQSKHEGLDAIENAVENLDA 
TSDKQRLMEMYNNVFCPPMKFEFQPHMGDMASQLCAQQPVQSELVQRCQQLQSRLSTL 
KIENEEVKKTMEATLQTIQDIVTVEDFDVSDCFQYSNSMESVKSTVSETFMGKPSIAK 
RRANQQETEQFYFTKMKEYLEGRNLITKLQAKHDLLQKTLGESQRTDCSLARRSSTVR 
KQDS 3QAI PLWES C I RF Z SRHGLQHEGI FRVSGSQVEVND I KNAFERGEDPLAGDQN 
DHDMDSIAGVLKLYFRGLEHPLFPKDIFHDLMACVTMDNLQERALHIRKVLLVLPKTT 
LIIMRYLFAFLNHLSQFSEENMMDPYNLAICFGPSLMSVPEGHDQVSCQAHVNELIKT 
IIIQHENIFPSPRELEGPVYSRGGSMEDYCDSPHGETTSVEDSTQDVTAEHHTSDDEC 
EPIEAIAKFDYVGRTARELSFKKGASLLLYQRASDDWWEGRHNGIDGLIPHQYIWQD 
TEDGWERSSPKSEIEVISEPPEEKVTARAGASCPSGGHVADIYLANINKQRKRPESG 
SIRKTFRSDSHGLSSSLTDSSSPGVGASCRPSSQPIMSQSLPKEGPDKCSISGHGSLN 
SISRHSSLKNRIjDSPQIRKTATAGRSKSFNNHRPMDPEVIAQDIEATMNSALNELREL 
ERQSSVKHTPDWLDTLEPLKTSPWAPTSEPSSPLHTQLLKDPEPAFQRSASTAGDI 
ACAFRPVKSVKMAAPVKPPATRPKPTVFPKTNATSPGVNSSTSPQSTDKSCTV 



Further analysis of the NOV36a protein yielded the following properties shown in 
Table 36B. 



Table 36B. Protein Sequence Properties NOV36a 


PSort analysis: 


0.9400 probability located in nucleus; 0.4936 probability located in mitochondrial 
matrix space; 0.3000 probability located in microbody (peroxisome); 0.2087 
probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV36a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 36C. 
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Table 36C. Geneseq Results for NOV36a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV36a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM79887 


Human protein SEQ ID NO 
3533 - Homo sapiens, 1094 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


62.. 1 155 
1..1094 


1088/1094 
(99%) 

1090/1094 
(99%) 


0.0 


ABG 18053 


Novel human diagnostic 
protein #18044- Homo 
sapiens, 1 095 aa. 
[WO200175067-A2, 11-OCT- 
2001] 


63. .1155 
3.. 1095 


1089/1093 
(99%) 

1089/1093 
(99%) 


0.0 


ABG05128 


Novel human diagnostic 
protein #51 19 - Homo sapiens, 
1095 aa. [WO200175067-A2, 
ll-OCT-2001] 


3. .1095 


1089/1093 
(99%) 

1089/1093 
(99%) 




ABG 18053 


Novel human diagnostic 
protein #18044 -Homo 
sapiens, 1095 aa. 
[WO200175067-A2, ll-OCT- 
2001] 


63..1155 
3..1095 


1089/1093 
(99%) 

1089/1093 
(99%) 


0.0 


ABG05128 


Novel human diagnostic 
protein #51 19 - Homo sapiens, 
1095 aa. [WO200175067-A2, 
ll-OCT-2001] 


63.. 11 55 
3.. 1095 


1089/1093 
(99%) 

1089/1093 
(99%) 


0.0 



In a BLAST search of public sequence datbases, the NOV36a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 36D. 
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Table 36D. Public BLASTP Results for NOV36a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV36a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Value 


075044 


KIAA0456 PROTEIN - Homo 
sapiens (Human), 1095 aa 
(fragment). 


63. .1155 
3.. 1095 


1089/1093 (99%) 
1089/1093 (99%) 


0.0 


Q9P2P2 


KIAA1304 PROTEIN - Homo 
sapiens (Human), 1051 aa 
(fragment). 


107..1155 
12. .1051 


719/1063 (67%) 
850/1063 (79%) 


0.0 


CAC22407 


SEQUENCE 47 FROM 
PATENT WO0075320 - 
Homo sapiens (Human), 1075 
aa. 


85.. 1151 
1.1 062 


642/1079 (59%) 
819/1079 (75%) 


0.0 


Q92512 


RHO GTPASE- 
ACTIVATING PROTEIN - 
Mus musculus (Mouse), 987 
aa. 


204.1 151 
8..974 


554/983 (56%) 
713/983 (72%) 


0.0 


Q91Z69 


GAP1 - Mus musculus 
(Mouse), 714 aa (fragment). 


447.. 1155 
3. .714 


439/721 (60%) 
545/721 (74%) 


0.0 



PFam analysis predicts that the NOV36a protein contains the domains shown in 
the Table 36E. 



Table 36E. Domain Analysis of NOV36a 


Pfam Domain 


NOV36a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


FCH 


106..204 


27/109 (25%) 
81/109 (74%) 


2.2e-19 


RhoGAP 


589..741 


54/170 (32%) 
128/170 (75%) 


4.4e-53 


SH3 


815..869 


20/58 (34%) 
41/58(71%) 


1.7e-ll 



Example 37. 



The NOV37 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 37A. 
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Table 37A. NOV37 Sequence Analysis 



SEQ ID NO: 117 



NOV37a, 
CG95 824-01 DNA Sequence 



CAAACAGCGATGTCCCAGACGCAGGACTACGAGTGCAGGAGCCATAATGTCGACCTGC 
CGGAGTCGAGGATTCCAGGGTCGAACACTCGGTTGGAGTGGGTGGAGATCATCGAACC 
GCGCACCCGCGAGCGCATGTACGCCAACCTGGTCACCGGTGAGTGCGTGTGGGACCCG 
CCGGCCGGCGTCCGCATCAAGCGCACCAGCGAGAACCAGTGGTGGGAGCTGTTCGACC 
CCAACACGTCCCGCTTCTACTACTACAATGCCAGCACGCAGCGCACGGTGTGGCACCG 
GCCGCAGGGCTGCGACATCATCCCGCTGGCCAAGCTGCAGACGCTGAAGCAGAACACG 
GAGTCCCCGCGCGCCTCGGCGGAGAGCAGCCCCGGGCGCGGCAGCAGCGTCAGCCGTG 
AGGGCAGCACCAGCTCCTCCCTGGAGCCCGAGCCCGACACTGAGAAAGCGCAGGAGTT 
.GCGAGGGCCGGGCGGCCCGCGGCGTTTGGGACAGTGAAGGAGGACAGCGGCAGC 
TCTTCACCACCAGGAGTGTTCCTTGAGAAGGACTATGAGATTTACCGGGATTACAGTG 
CGGACGGCCAGCTTCTTCACTACAGGACCTCCTCGCTGCGGTGGAACTCGGGCGCCAA 
AGAGCGCATGCTCATCAAGGTCGCTGATCGGGAGCCCAGCTTCCTCGCCGCCCAGGGC 
AATGGCTACGCCCCAGACGGCCCACCTGGGGTCCGCTCCCGCAGACCCTCCGGCAGCC 

agcactcacccagcctgcagaccttcgccccggaggctgacggcaccatcttcttccc 
agagaggaggccgtcacccttcctgaagagggccgagctcccagggagcagctccccg' 
ctgctggcccagccccgaaagccctccggggactcgcagccctcctccccgcgctatg 
gctatgaacccccgctctacgaggagcccccagtggagtaccaggcccccatctacga 
tgagccccccatggacgtgcaattcgaggctggcgggggctaccaggccggctctccc 
cagcggtcgccgggccgtaagccccggccgttcctccagcccaacaagcagggccccc 
cctcgccctgccagcagctggtgctcaccaagcagaagtgtcccgagcgcttcctgag 
cctggagtacagtcccgccggcaaggagtacgtgcggcagctggtctacgtggagcag 
gcgggctccagccccaagctgcgcgccggcccgcggcacaagtacgcgcccaaccccg 

cgtcaagtccggagactacagcaccatggagggacctgagctgcggcacagccagccg 
cccacgccgctgccacaggcccaggaggatgccatgtcctggtccagccagcaggaca 

CCCTGTCCTCCACAGGCTACTCCCCGGGCACGCGCAAGCGGAAGAGCAGAAAGCCCTC 
TTTGTGCCAAGCCACCAGCGCCACCCCCACTGAGGGCCCCGGGGACCTGCTTGTGGAG 
CAGCCCCTGGCCGAGGAACAGCCCCCGTGCGGGACCAGCCTCGCCCCCGTGAAGCGAG 
CGGAAGGTGAGGCCGAAGGGGCGCGGGGCGCGGCCGAGCCCTTCCTGGCGCAGGCTCG 
GCTGGCCTGGGAGGCGCAGCAGGCCCACTTCCACATGAAGCAGAGGAGCAGCTGGGAC 
TCCCAGCAGGACGGCTCTGGCTACGAGAGCGACGGCGCCCTGCCACTGCCCATGCCCG 

CTGGAGGAGGGGCACCTTCGAGAAGCTAGGCTTCCCCCAGATCCTGCTGGAGAAGAGC 
GTCTCCGTGCAGACCAACCTGGCCTCACCAGAGCCCTACCTCCACCCCTCACAGTCTG 
AGGACCTCGCTGCCTGTGCCCAGTTCGAGAGCAGCCGGCAGAGCCGCAGCGGCGTTCC 
CAGCTCCAGCTGCGTCTTCCCCACTTTCACGCTGCGCAAGCCCTCCTCGGAGACGGAC 
ATCGAGAACTGGGCCTCCAAGCACTTCAACAAGCACACGCAGGGCCTCTTCCGGCGGA 
AGGTGTCCATCGCCAACATGCTGGCCTGGAGCAGCGAGTCCATCAAGAAGCCCATGAT 
CGTGACAAGCGACCGGCACGTGAAGAAGGAGGCCTGCGAGCTCTTCAAGCTGATCCAG 

CCACCAAGGGCTGGAGCGTGCAGGGCCTGCGGGACGAGCTCTACATCCAGCTGTGCCG 



•TTTCCCGCCCACCCCCAAGTTCCACTCCTACCTGGAAGGCTACA 
TCTACCGGCACATGGACCCCGTCAATGACACTAAAGGGGTGGCGATAAGCACGTATGC 
CAAGTACTGTTACCACAAGCTACAGAAGGCAGCCCTGACCGGGGCCAAGAAGGGGCTG 
AAGAAGCCCAACGTGGAGGAGATCCGGCATGCCAAGAACGCCGTGTTCAGCCCGTCCA 
TGTTCGGCAGCGCACTGCAGGAGGTCATGGGCATGCAGAGAGAGCGCTACCCCGAGCG 
CCAGCTGCCCTGGGTGCAGACACGGCTCTCTGAGGAGGTGCTGGCGCTCAACGGTGAC 
CAGACAGAGGGCATCTTCAGGGTCCCTGGGGACATTGACGAGGTGAATGCCCTGAAGC 
TGCAGGTGGACCAGTGGAAGGTGCCCACAGGCCTGGAAGACCCCCACGTCCCTGCGTC 
CCTGCTGAAGCTGTGGTACCGGGAGCTGGAGGAGCCCCTGATCCCGCACGAGTTCTAC 
GAGCAGTGCATCGCGCACTACGACAGCCCCGAGGCGGCGGTGGCCGTGGTGCACGCGC 
TGCCCCGCATCAACCGCATGGTGCTGTGCTACCTCATCCGCTTCCTGCAGGTCTTCGT 
GCAGCCGGCCAACGTCGCGGTCACCAAGATGGATGTCAGCAACCTGGCCATGGTGATG 
GCGCCCAACTGCTTGCGCTGCCAGTCCGACGACCCGCGCGTCATCTTCGAGAACACCC 
GCAAGGAGATGTCCTTCCTGCGGGTGCTCATCCAGCACCTGGACACCAGCTTCATGGA 
GGGTGTGCTGGAGCGGGGGCGCCCGGGGACAGGAGGGATGTCCTGCCGCCCCCAGCCA 
GGCCGAACTCCGCACTCGCTCTCCCGGCAGAGGGGCCAGAATCGCCCGGCCCAGCCCT 
GGAGCCCCCTCCACTCCCCCAGGCCCCTGGCCCCGGCGCTCCCCACGTCTTCTGCCTG 
GTCTGAGGGTGCAGCCAGGGCACAGCAGCGGCGGGGAGGGCGCCTCTGGCCCCCCACC 
TCACGGCCAGTTCCCGCGGGCACCGCCTCGCCCTCCGCTGGCCGCGGGTCAGCTCCGA 



TCCCTTTCTCCTGTCCTCGTCCTGGCCTGCAGCTCTTCCCAGCCCCGAGAGAGCTTCC 
CGACCTGTCCCCGCCTCCTCTCCCTCCCTCGGCCCGTGGTCCCCAGCTGGTG ACTGCT 
CAGGAGTTTGGGGGCTCCAG 
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ORF Start: ATGat 10 


ORF Stop: TGA at 3763 




SEQIDNO: 118 


1251 aa | MW at 138786.4kD 


NOV37a, 
CG95824-01 Protein Sequence 


MSQTQDYECRSHNVDLPESRIPGSNTRLEWVEIIEPRTRERMYANLVTGECVWDPPAG 
VRI KRTSENQWWELFDPNTSRFYYYbJASTQRTVWHRPQGCDI I PLAKLQTLKQNTES P 
RASAESSPGRGSSVSREGSTSSSLEPEPDTEKAQELPARAGRPAAFGTVKEDSGSSSP 
PGVFLEKDYEIYRDYSADGQLLHYRTSSLRWNSGAKERMLIKVADREPSFLAAQGNGY 
APDGPPGVRSRRPSGSQHSPSLQTFAPEADGTIFFPERRPSPFLKRAELPGSSSPLLA 
QPRKPSGDSQPSSPRYGYEPPLYEEPPVEYQAPIYDEPPMDVQFEAGGGYQAGSPQRS 
PGRKPRPFLQPNKQGPPSPCQQLVLTKQKCPERFLSLEYSPAGKEYVRQLVYVEQAGS 
SPKLRAGPRHKYAPNPGGGSYSLQPSPCLLRDQRLGVKSGDYSTMEGPELRHSQPPTP 
LPQAQEDAMSWSSQQDTLSSTGYSPGTRKRKSRKPSLCQATSATPTEGPGDLLVEQPL 
AEEQPPCGTSIAPVKRAEGEAEGARGAAEPFLAQARLAWEAQQAHFHMKQRSSWDSQQ 
DGSGYESDGALPLPMPGPWRAFSEDEALAQQENRHWRRGTFEKLGFPQILLEKSVSV 
QTNLASPEPYLHPSQSEDLAACAQFESSRQSRSGVPSSSCVFPTFTLRKPSSETDIEN 
WASKHFNKHTQGLFRRKVSIANMLAWSSESIKKPMIVTSDRHVKKEACELFKLIQMYM 
GDRRAKADPLHVALEVATKGWSVQGLRDELYIQLCRQTTENFRLESLARGWELMAICL 
AFFPPTPKFHSYLEGYIYRHMDPVNDTKGVAISTYAKYCYHKLQKAALTGAKKGLKKP 
NVEEIRHAKNAVFSPSMFGSALQEVMGMQRERYPERQLPWQTRLSEEVLALNGDQTE 
GIFRVPGDIDEVNALKLQVDQWKVPTGLEDPHVPASLLKLWYRELEEPLIPHEFYEQC 
IAHYDSPEAAVAVVHALPRINRWLCTLIRFLQVFVQPAOTAVTKMDVSNLAMVMAPN 
CLRCQSDDPRVIFENTRKEMSFLRVLIQHLDTSFMEGVLERGRPGTGGMSCRPQPGRT 
PHSLSRQRGQNRPAQPWSPLHSPRPLAPALPTSSAWSEGAARAQQRRGGRLWPPTSRP 
VPAGTASPSAGRGSAPRKCLLCPGAERRCLLGAGLPPCGSCAPWPGPCPAAPVSPSLS 
PVLVLACSSSQPRESFPTCPRLLSLPRPWPSW 



Further analysis of the NOV37a protein yielded the following properties shown in 
Table 37B. 



Table 37B. Protein Sequence Properties NOV37a 


PSort analysis: 


0.7000 probability located in nucleus; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV37a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 37C. 
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Table 37C. Geneseq Results for NOV37a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV37a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


ABB59882 


Drosophila melanogaster 
polypeptide SEQ ID NO 6438 - 
Drosophila melanogaster, 1309 
aa. [WO200171042-A2, 27- 
SEP-2001] 


681. .1083 


213/418(50%) 
282/41 8 (66%) 


e-114 




Human preoptic regulatory 
factor-2 (hPORF-2) protein #1 - 

Homo sapiens, 75 aa. 
[WO200142464-A2, 14-JUN- 
2001] 


1009.. 1083 
1..75 


75/75 (100%) 
75/75 (100%) 


7e-36 


ABG06669 


protein #6660 - Homo sapiens, 
322 aa. [WO200175067-A2, 
ll-OCT-2001] 


872. 1091 
28..253 


/O/ZJU /o) 

111/230 (48%) 




ABG06669 


Novel human diagnostic 
protein #6660 - Homo sapiens, 
322 aa. [WO200175067-A2, 
ll-OCT-2001] 


872.. 1091 
28..253 


76/230 (33%) 
111/230 (48%) 


2e-17 


AAB42926 


Human ORFX ORF2690 
polypeptide sequence SEQ ID 
NO:5380 - Homo sapiens, 903 
aa. [WO200058473-A2, 05- 
OCT-2000] 


928.. 1250 
84.. 447 


100/377 (26%) 
142/377 (37%) 


2e-13 



In a BLAST search of public sequence datbases, the NOV37a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 37D. 
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Table 37D. Public BLASTP Results for NOV37a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV37a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Value 


Q9C0H5 


KI AA 168 8 PROTEIN - 
Homo sapiens (Human), 
1094 aa (fragment). 


1..1083 
12.. 1094 


1083/1083 (100%) 
1083/1083 (100%) 


0.0 


Q9VDE9 


CG3421 PROTEIN - 
Drosophila melanogaster 
(Fruit fly), 1309 aa. 


681. .1083 
897. .1309 


213/418(50%) 
282/418 (66%) 


e-113 


PI 8890 


Putative preoptic regulatory 
factor-2 precursor (PORF-2) 
- Rattus norvegicus (Rat), 75 
aa. 


1009.. 1083 
1..75 


74/75 (98%) 
75/75 (99%) 




Q9BKW0 


HYPOTHETICAL 36.4 
K.DA PROTEIN - 
Caenorhabditis elegans, 3 17 
aa. 


798..932 
179..307 


50/135 (37%) 
80/135 (59%) 


3e-20 


AAH23344 


SIMILAR TO 
HYPOTHETICAL 
PROTEIN 
DKFZP564B1 162- Mus 
musculus (Mouse), 654 aa. 


872..1091 
22..247 


74/230 (32%) 
109/230 (47%) 


4e-17 



PFam analysis predicts that the NOV37a protein contains the domains shown in 
the Table 37E. 



Table 37E. Domain Analysis of NOV37a 


Pfam Domain 


NOV37a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


WW 


65..95 


12/31 (39%) 
19/31 (61%) 


0.0054 


MyTH4 


761. .879 


34/123 (28%) 
85/123 (69%) 


1.4e-19 


RhoGAP 


909.. 1056 


52/171 (30%) 
105/171 (61%) 


7.3e-19 



Example 38. 



The NOV38 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 3 8 A. 
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Table 38A. NOV38 Sequence Analysis 




SEQ ID NO: 119 


1269 bp 


NOV38a, 
CG96 198-01 DNA Sequence 


TTGAAGCAGAAGCTCCCACCTTCTCGGCAGCAGCCAAGGTCCGAGTGGGAGTGACCAT 




CGGGAACCCAGCCAGCTCCGCCCCTCTCCGGTCAGGAGACTCTTCATCCATTTAGCAG 
CCGCCGACTTACTAGTCACTTTTGTGGTTATGCCCCTAGATGCCACCTGGAATATCAC 
TGTTCAATGGCTGGCTGTGGACATCGCATGTCGGACACTGATGTTCCTGAAACTAATG 
GCCACGTATTCTGCAGCTTTCC7GCCTGTGGTCATTGGATTGGACCGCCAGGCAGCAG 
TACTCAACCCGCTTGGATCCCGTTCAGGTGTAAGGAAACTTCTGGGGGCAGCCTGGGG 
ACTTAGTTTCCTGCTTGCCTTCCCCCAGGTGGCTGGCCCAGTCCCTTTCACTCAGTGT 
GTCACCAAAGGCAGCTTCAAGGCTCAATGGCAAGAGACCACCTATAACCTCTTCACCT 
TCTGCTGCCTCTTTCTGCTGCCACTGACTGCCAIGGCCATCTGCTATAGCCGCATTGT 




TGACCTTCATCCTCTGCTGGACACCTTATTACCTACTGGGTATGTGGTACTGGTTCTC 
CCCCACCATGCTAACTGAAGTCCCTCCCAGCCTGAGCCACATCCTTTTCCTCTTGGGC 




GCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCAGCCAGCTTTGACTA 
TTTTTATACTCCTCACAATTCTCAAGCACTTCTTAGGCAGAGCACAAAAACAAAAAAA 
AAAAAAAAACTTAGAAATATATGAATGCAAGAGATGGGAAGAGGAAAGACCATTTCCC 
AACTAGCTCAGAGTAGCCTGATCGTAACTCCAAACAGTTCACAAAATTCTCTTAGGCT 
GAAACTATTCATCTTTCCTAAATGTCCCTCATGTCTGCTAAAATCACTTTC 




ORF Start: ATG at 5 


ORF Stop: TGAat 1124 


^_ 


SEQ ID NO: 120 


373 aa j MW at 41692.8kD 


NOV38a, 
CG96 198-01 Protein Sequence 


MFHIX3EESPVCPRPTPVLTKPPGQHPFLATLQQAPPLVEAEAPTFSAAAKVRVGVTIV 
LFVSSAGGNIAVLWSVTRREPSQLRPSPVRRLFIHLAAADLLVTFVVMPLDATWNITV 
QWLAVDIACRTLMFLKLMATYSAAFLPWIGLDRQAAVLNPLGSRSGVRKLLGAAWGL 
SFLLAFPQVAGPVPFTQCVTKGSFKAQWQETTYNLFTFCCLFLLPLTAMAICYSRIVL 
SVSRPQTRKAGE FAL PRS FDNC PRVRLRALRLALLI LLTF I LCWTPYYLLGM WYWFS P 
TMLTEVPPSLSHILFLLGLLNAPLDPLLYGAFTSARLSLPKCWDYRREPPRPASFDYF 
YTPHNSQALLRQSTKTKKKKKLRNI 




SEQ ID NO: 121 


1551 bp 


NOV38b, 
CG96 198-02 DNA Sequence 


TTTCTTTTTTTTTGAGACGGTATTTCACTGTTGTTGCCCAGGCTAGAGTACAATGGCA 

CTCCCGAGTAGCTGGGATTACAGGCATGCGCCACCACACCCGGCTAATTTTGTATTTT 
TAGTAGAGACGGGGTTTCTCCTGCCCACCTTCTCGGCAGCAGCCAAGGTCCGAGTGGG 
AGTGACCATTGTGCTGTTTGTTTCTTCGGCTGGAGGGAACCTGGCAGTCCTGTGGTCA 




ATTTAGCAGCCGCCGACTTACTAGTCACTTTTGTGGTTATGCCCCTAGATGCCACCTG 
GAATATCACTGTTCAATGGCTGGCTGTGGACATCGCATGTCGGACACTGATGTTCCTG 
AAACTAATGGCCACGTATTCTGCAGCTTTCCTGCCTGTGGTCATTGGATTGGACCGCC 
AGGCAGCAGTACTCAACCCGCTTGGATCCCGTTCAGGTAAAAGAAAAAAGAAAGCAAT 
TATGGTTCCTGGCTGGGCGCTGAGGTTCATGCCTGTAATCCCAGCACTTTGGAAGGCT 
GAGGTGAGCAGATCAGGCCCAGTCCCTTTCACTCAGTGTGTCACCAAAGGCAGCTTCA 
AGGCTCAATGGCAAGAGACCACCTATAACCTCTTCACCTTCTGCTCCCTCTTTCTGCT 
GCCACTGACTGCCATGGCCATCTGCTATAGCCGCATTGTCCTCAGTGTGTCCAGGGAA 
TTTGCCCTCCCCCGCTCCTTTGACAATTGTCCCCGTGTTCGTCTCCGGGCCCTGAGAC 
TGGCCCTGCTTATCTTGCTGACCTTCATCCTCTGCTGGACACCTTATTACCTACTGGG 




CATGGCCAGCCGGGGTGGGGGCCGGGGTCGTGGCCGGGGCCAGTTGACCTTCAACGTG 
GAGGCCGTGGGCATTGGGCTCCACAGTAGGTACTGTGATCATTTTAAAAATCAGTTGG 
TTACATACCTACTAAAAATCCTTGTTGTTGTTGTTTTGAGACGTAGTCTCGCTCTATC 
ACCCAGGCTAAGAGTGCAGTGGTGCAGTCTCGGCTCACTGCATCCTCTGCCTCCCGGG 


AGCCTCCTGAGTACCTGGGACTACAGGCGCCCCCCCCCACACACACACACACACCCGG 
CTAATTTTTGTATTTTTAGTAGAGACGGGATTTCACCATGTTG 
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ORF Start: ATG at 1 


ORF Stop: TAA at 1510 




SEQ ID NO: 122 


503 aa | MW at 56199.8kD 


NOV38b, 
CG96 198-02 Protein Sequence 


MFIS5HYKVSSRGAEALFVFSFFLRRYFTWAQARVQWHDLGSPQTLPPGFKRFSCLN 
LPSSWDYRHAPPHPMIFVFLVETGFLLPTFSAAAKVRVGVTIVLFVSSAGGNLAVLWS 
VTRREPSQLRPSPWRLFIHIAAADLLVTFVVMPLDATWNITVQWLAVDIACRTLMFL 
KLMATYSAAFLPWIGLDRQAAVIjMPLGSRSGKRKKKAIMVPGWALRFMPVIPALWKA 
EVSRSGPVPFTQCVTKGSFKAOWQETTYNLFTFCCLFLLPLTAMAICYSRIVLSVSRE 
FALPRSFDNCPRVRLRALRLALLILLTFILCWTPYYLLGMWYWFSPTMLTEVPPSLSH 
ILFLLGLLMAPLDPLLYGAFTLGCRRLFTGSLNTQAPSTMASRGGGRGRGRGQLTFNV 
EAVGIGLHSRYCDHFICNQLVTYLLKILWWLRRSLALSPRLRVQWCSLGSLHPLPPG 
FKQFSCLKPPEYLGLQAPSKPPEYLGLQAPPPTHTHTPG 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 38B. 



Table 38B. Comparison of NOV38a against NOV38b. 



Protein Sequence 


NO V3 8a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV38b 


43..323 
86..369 


214/291 (73%) 
219/291 (74%) 



Further analysis of the NOV38a protein yielded the following properties shown in 
Table 38C. 



Table 38C. Protein Sequence Properties NOV38a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.3000 
probability located in microbody (peroxisome) 


SignalP analysis: 


Cleavage site between residues 70 and 7 1 



A search of the NOV38a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 38D. 
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Table 38D. Geneseq Results for NOV38a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent U, Date] 


NOV38a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU10819 


Human Type II GnRH-R 
splice variant 1 protein #2 - 
Homo sapiens, 377 aa. 
[WO200178796-Al,25- 
OCT-2001] 


40..323 
26..322 


278/297 (93%) 
280/297 (93%) 


e-156 


AAU10816 


Human Type II GnRH-R 
splice variant 3 protein - 
Homo sapiens, 366 aa. 
[WO200178796-Al,25- 
OCT-2001] 


40..323 
15.31 1 


278/297 (93%) 
280/297 (93%) 


e-156 


AAU10815 


Human Type 11 GnRH-R 

Homo sapiens, 376 aa. 
[WO200178796-Al,25- 
OCT-2001] 


40..323 


278/297 (93%) 
2&vij.y I \yi /a) 


e-156 


AAU10814 


Human Type II GnRH-R 
splice variant 1 protein #1 - 
Homo sapiens, 377 aa. 
[WO200178796-Al,25- 
OCT-2001] 


40..323 
26.322 


2W291 (93%) 
280/297 (93%) 


e-156 


AATJ10813 


Human Type II GnRH-R - 
Homo sapiens, 379 aa. 
[WO200I78796-AI,25- 
OCT-2001] 


40..323 
28..324 


278/297 (93%) 
280/297 (93%) 


e-156 



In a BLAST search of public sequence datbases, the NOV38a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 38E. 
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Table 38E. Public BLASTP Results for NOV38a 


Protein 
Number 




NOV38a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96P88 


TYPE 11 GONADOTROPIN- 
RELEASING HORMONE 
RECEPTOR - Homo sapiens 
(Human), 379 aa (fragment). 


40..323 
28.-324 


281/297 (94%) 
283/297 (94%) 


e-158 


CAD12280 


SEQUENCE 9 FROM 
PATENT WOO 178796 - Homo 
sapiens (Human), 366 aa. 


40..323 
15.31 1 


278/297 (93%) 
280/297 (93%) 


e-156 


CAD12279 


SEQUENCE 7 FROM 
PATENT WOO 178796 - Homo 
sapiens (Human), 376 aa. 


40..323 
25..321 


278/297 (93%) 
280/297 (93%) 


e-156 


CAD12278 


SEQUENCE 5 FROM 
PATENT WOO 178796 - Homo 
sapiens (Human), 377 aa. 


40..323 
26..S22 


278/297 (93%) 
280/297 (93%) 


e-156 


CAD 12277 


SEQUENCE 3 FROM 
PATENT WOO 178796 - Homo 
sapiens (Human), 379 aa. 


40..323 
28..S24 


278/297 (93%) 
280/297 (93%) 


e-156 



PFam analysis predicts that the NOV38a protein contains the domains shown in 
the Table 3 8F. 



Table 38F. Domain Analysis of NOV38a 


Pfam Domain 


NOV38a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l 


66..3 19 


60/287 (21%) 
191/287 (67%) 


9.7e-41 



Example 39. 



The NOV39 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 39A. 
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Table 39A. NOV39 Sequence Analysis 




SEQ ID NO: 123 j 


1047 bp 


NOV39a, 
CG96231-01 DNA Sequence 


ATGTTTGGCCCCGCTAAAGGTCGCCATTTTQGAGTCCACCCGGCGCCTGGTTTCCCCG 
GCGGCGTCTCCCAACAGGCTGCCGGGACCAAAGCTGGCCCCGCGGGTGCCTGGCCTGT 
GGGCAGCCGGACCGACACGATGTGGCGGCTCCGCTGCAAGGCCAAGGACGGCACCCAT 
GTTTTGCAGGGGCTGTCCAGCCGGACCCGGGTGCGGGAACTCCAGGGCCAAATTGCCG 
CCATCACCGGGATCGCCCCCGGCGGTCAGCGAATCCTCGTCGGATACCCTCCCGAGTG 
CCTGGATCTCAGCAATGGGGATACCATTCTGGAAGACTTGCCCATCCAATCTGGTGAC 
ATGCTGATCATTGAAGAAGACCAAACCAGGCCCAGAAGTTCACCTGCATTTACTAAAC 
GTGGTGCTTCTAGTTACGTCAGGGAAACTTTGCCTGTGCTTACCAGAACCGTGGTCCC 
AGCAGACAACTCTTGCCTCTTTACTAGTGTGTACTATGTCGTCGAAGGAGGAGTCTTG 
AATCCAGCTTGTGCCCCTGAGATGAGACGCCTCATAGCACAAATTGTAGCAAGCGATC 
CAGACTTCTATAC-TGAGGCAATACTGGGAAAAACAAATCAAGAGTACTGTGACTGGAT 
CAAAAGGGATGACACTTGGGGAGGAGCAATAGAGATATCGATTTTGTCCAAGTTTTAC 
CAATGTGAAATATGTGTAGTGGATACACAGACAGTAAGAATTGATCGTTTTGGGGAAG 
ATGCAGGATATACCAAAAGGGTTCTGCTTATTTATGATGGCATCCACTATGATCCACT 
TCAGCGTAACTTCCCTGATCCAGATACACCTCCTCTGACCATTTTCTCCTCTAATGAT 
GATATTGTTCTTGTACAAGCACTGGAATTAGCAGATGAAGCTAGAAGAAGGAGACAGT 
TTACTGATGTCAACCGCTTCACCCTGAGATGCATGGTATGTCAGAAAGGATTAACTGG 
ACAAGCAGAAGCAAGGGAACATGCCAAGGAGACAGGCCATACCAACTTTGGAGAAGTG 
TGA 




ORF Start: ATG at 1 


ORF Stop: TGA at 1045 




SEQ ID NO: 124 


348 aa [ MW at 38322.0kD 


KOV39a, 
CG96231-01 Protein Sequence 


MFGPAKGRHFGVKPAPGFPGGVSQQAAGTKAGPAGAWPVGSRTDTMWRLRCKAKDGTH 
VLQGLSSRTRWELQGQIAAITGIAPGGQRILVGYPPECLDLSNGDTILEDLPIQSGD 
MLIIREDQTRPRSSPAFTKRGASSYVRETLPVLTRTWPADNSCLFTSVYYWEGGVL 
NPACAPEMRRLI AQIVASDPDFYSEAI LGKTNQE YCDWIKRDDTWGGAI EI S ILSKFY 
QCEICWDTQTVRI DRFGEDAGYTKRVLL IYDG I H YDPLQRNF PDPDTPPLT I FSSND 
DIVLVQALELADEARRRRQFTDVNRFTLRCHVCQKGLTGQAEAREHAKETGHTMFQEV 




SEQ ID NO: 125 


1047 bp 


NOV39b, 
CG96231-02 DNA Sequence 


ATGTTTGGCCCCGCTAAAGGTCGCCATTTTGGAGTCCACCCGGCGCCTGGTTTCCCCG 
GCGGCGTCTCCCAACAGGCTGCCGGGACCAAAGCTGGCCCCGCGGGTGCCTGGCCTGT 
GGGCAGCCGGACCGACACGATGTGGCGGCTCCGCTGCAAGGCCAAGGACGGCACCCAT 
GTTTTGCAGGGGCTGTCCAGCCGGACCCGGGTGCGGGAACTCCAGGGCCAAATTGCCG 
CCATCACCGGGATCGCCCCCGGCGGTCAGCGAATCCTCGTCGGATACCCTCCCGAGTG 
CCTGGATCTCAGCAATGGGGATACCATTCTGGAAGACTTGCCCATCCAATCTGGTGAC 
ATGCTGATCATTGAAGAAGACCAAACCAGGCCCAGAAGTTCACCTGCATTTACTAAAC 
GTGGTGCTTCTAGTTACGTCAGGGAAACTTTGCCTGTGCTTACCAGAACCGTGGTCCC 
AGCAGACAACTCTTGCCTCTTTACTAGTGTGTACTATGTCGTCGAAGGAGGAGTCTTG 
AATCCAGCTTGTGCCCCTGAGATGAGACGCCTCATAGCACAAATTGTAGCAAGCGATC 
CAGACTTCTATAGTGAGGCAATACTGGGAAAAACAAATCAAGAGTACTGTGACTGGAT 
CAAAAGGGATGACACTTGGGGAGGAGCAATAGAGATATCGATTTTGTCCAAGTTTTAC 
CAATGTGAAATATGTGTAGTGGATACACAGACAGTAAGAATTGATCGTTTTGGGGAAG 
ATGCAGGATATACCAAAAGGGTTCTGCTTATTTATGATGGCATCCACTATGATCCACT 
TCAGCGTAACTTCCCTGATCCAGATACACCTCCTCTGACCATTTTCTCCTCTAATGAT 
GATATTGTTCTTGTACAAGCACTGGAATTAGCAGATGAAGCTAGAAGAAGGAGACAGT 
TTACTGATGTCAACCGCTTCACCCTGAGATGCATGGTATGTCAGAAAGGATTAACTGG 
ACAAGCAGAAGCAAGGGAACATGCCAAGGAGACAGGCCATACCAACTTTGGAGAAGTG 
TGA 




ORF Start: ATG at 1 


ORF Stop: TGA at 1045 




SEQ ID NO: 126 


348 aa | MW at 38322.0kD 


NOV39b, 
CG9623I-02 Protein Sequence 


MFGPAKGRHFGVHPAPGFPGGVSQQAAGTKAGPAGAWPVGSRTDTMWRLRCKAKDGTH 
VLQGLSSRTRVRELQGQIAAITGIAPGGQRILVGYPPECLDLSNGDTILEDLPIQSGD 
MLIIEEDQTRPRSSPAFTKRGASSYVRETLPVLTRTWPADNSCLFTSVYYWEGGVL 
NPACAPEMRRLIAQIVASDPDFYSEAILGKTNQEYCDWIKRDDTWGGAIEISILSKFY 
QCEICWDTQTVRIDRFGEDAGYTKRVLLIYDGIHYDPLQRNFPDPDTPPLTIFSSND 
DIVLVQALELADEARRRRQFTDVNRFTLRCMVCQKGLTGQAEAREHAKETGHTNFGEV 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 39B. 
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Table 39B. Comparison of NOV39a against NOV39b. 


Protein Sequence 


NOV39a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV39b 


1..348 
1..348 


344/348 (98%) 
344/348 (98%) 



Further analysis of the NOV39a protein yielded the following properties shown in 
Table 39C. 



Table 39C. Protein Sequence Properties NOV39a 


PSort analysis: 


0.3000 probability located in nucleus; 0.2271 probability located in lysosome 
(lumen); 0.1 109 probability located in microbody (peroxisome); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV39a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 39D. 
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Table 39D. Geneseq Results for NOV39a 


Geneseq 
Identifier 


[Patent #, Date] 


NOV39a Residues/ 
Match Residues 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


. ABB60807 


Drosophila melanogaster 
polypeptide SEQ ID NO 9213 
- Drosophila melanogaster, 
347 aa. [WO200171042-A2, 
27-SEP-2001] 


49..348 
7.347 


144/343 
(41%) 

196/343 
(56%) 


2e-64 


| AAG53575 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 68223 - 
Arabidopsis thaliana, 146 aa. 
[EP1033405-A2,06-SEP- 
2000] 


148..271 
4.. 124 


63/124 
(50%) 
87/124 
(69%) 


8e-31 


\ AAG20460 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 22659 - 
Arabidopsis thaliana, 146 aa. 
[EP1033405-A2, 06-SEP- 
2000] 


148..271 
4.. 124 


63/124 
(50%) 

(69%) 


8e-31 


AAG53576 

i 
i 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 68224 - 
Arabidopsis thaliana, 122 aa. 
[EP1033405-A2, 06-SEP- 
2000] 


179..271 
8..100 


52/93 
(55%) 
67/93 
(71%) 


le-24 


AAG20461 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 22660- 
Arabidopsis thaliana, 122 aa. 
[EP1033405-A2, 06-SEP- 
2000] 


179..271 
8..100 


52/93 
(55%) 
67/93 
(71%) 


le-24 



In a BLAST search of public sequence datbases, the NOV39a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 39E. 



245 



WO 03/010327 



PCT/US02/14199 



Table 39E. Public BLASTP Results for NOV39a 




Protein 
Accession 
Number 


Protein/Organism/Length 


NOV39a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAL39440 


GM14814P-Drosophila 
melanogaster (Fruit fly), 
347 aa. 


49..348 
7..S47 


144/343 (41%) 
196/343 (56%) 


5e-64 


Q9VRJ9 


CG4603 PROTEIN - 
Drosophila melanogaster 
(Fruit fly), 347 aa. 


49..348 
7..347 


144/343 (41%) 
196/343 (56%) 


5e-64 


Q9LPT6 

i 


F11F12.1 PROTEIN 
(HYPOTHETICAL 23.4 
KDA PROTEIN) - 

Arabidopsis thaliana 
(Mouse-ear cress), 208 aa. 


148..347 
4..206 


96/206 (46%) 
131/206 (62%) 


le-44 


013974 


HYPOTHETICAL 35.7 
KDA PROTEIN 
C24C9.I4 IN 
CHROMOSOME I - 
Schizosaccharomyces 
pombe (Fission yeast), 
329 aa. 


48. .347 
5.. 328 


101/333 (30%) 
161/333 (48%) 


4e-36 


P43558 


Hypothetical 33.5 kDa 
protein in SEC53-FET5 

Saccharomyces cerevisiae 
(Baker's yeast), 301 aa. 


92.. 347 
39..299 


90/278 (32%) 
144/278 (51%) 


7e-33 



PFam analysis predicts that the NOV39a protein contains the domains shown in 
the Table 39F. 



Table 39F. Domain Analysis of NOV39a 


Pfam Domain 


NOV39a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


OTU 


154..269 


40/135 (30%) 
87/135 (64%) 


l.le-18 



Example 40. 



The NOV40 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 40A. 
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Table 40A. NOV40 Sequence Analysis 




SEQ ID NO: 127 


591 bp 


NOV40a, 
CG96260-01 DNA Sequence 

i 


AACAGAGGTTCAGCTGGGAGACCCCTCACACACAGGTGAAGGCGTGGCTGTAGAACCT 
CAATGGACCCCCTGGTCTCCTCAGGAATGAAGGTCATTGCCATCCTCACCCTCCTCCT 
CTTCTGCTCCCCCACCCACAGCAGTTTCTGGCAGTTTCAGAGGAGGGTCAAACACATC 
ACGGGGCGAAGTGCCTTCTTCTCATATTACGGATATGGCTGCTACTGTGGGCTTGGGG 
ATAAAGGGATCCCCGTGGATGACACTGACAGGCACAGCCCCTCATCTCCCTCTCCCTA 
CGAGAAGCTGAAGGAGTTCAGCTGCCAGCCTGTGTTGAACAGCTACCAGTTCCACATC 
GTCAATGGCGCAGTGGTAGGTGGATGCACCCTTGGTCCTGGTGCCAGCTGCCACTGCA 
GGCTGAAGGCCTGTGAGTGTGACAAGCAATCCGTGCACTGCTTCAAAGAGAGCCTGCC 
CACCTATGAGAAAAACTTCAAGCAGTTCTCCAGCCAGCCCAGGTGTGGCAGACATAAG 
CCCTGGTGCTAGGGACACCACAGGGTCCCTCTCATCATCCAGCATCCGCTCTAGTGTT 
GCTCTTCCAGG 




ORF Start: ATG at 6 1 


ORF Stop: TAG at 532 




SEQ ID NO: 128 


157 aa | MW at !7585.0kD 


NOV40a, 
CG96260-0 1 Protein Sequence 


MDPLVSSGMKVIAILTLLLFCSPTHSSFWQFQRRVKHITGRSAFFSYYGYGCYCGLGD 
KGIPVDDTDRHSPSSPSPYEKLKEFSCQPVLNSYQFHIVM3AWGGCTLGPGASCHCR 
LKACECDKQSVHCFKESLPTYEKNFKQFSSQPRCGRHKPWC 



Further analysis of the NOV40a protein yielded the following properties shown in 
Table 40B. 



Table 40B. Protein Sequence Properties NOV40a 


PSort analysis: 


0.8200 probability located in endoplasmic reticulum (membrane); 0.2222 probability 
located in microbody (peroxisome); 0.1900 probability located in plasma membrane; 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 27 and 28 



A search of the NOV40a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 40C. 
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Table 40C. Geneseq Results for NOV40a 


Geneseq 
I Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV40a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


} AAG63223 


Amino acid sequence of a 
human lipid metabolism 
enzyme - Homo sapiens, 152 
aa. [WO200153468-A2, 26- 
JUL-2001] 


11. .157 


139/151 (92%) 
i /in/ 1 > i /rno/\ 


le-82 


AAR63044 


RPLA2-8 - Rattus sp, 158 aa. 
[WO9502328-A, 26-JAN- 
1995] 


I..158 


1 UO/ 1 DU (OOTo) 

121/160 (75%) 




AAB 12536 


Mouse secretory 
phospholipase A2 protein 
sequence SEQ ID NO:14- 
Mus musculus, 144 aa. 
[WO200034486-A1, 15-JLTN- 
2000] 


17..157 
10.. 144 


48/141 (34%) 
70/141 (49%) 


le-17 


AAB11994 


Mouse secreted phospholipase 
A2 - Mus musculus, 144 aa. 
[JP2000166544-A,20-JUN- 
2000] 


'17..157 
10..144 


48/141 (34%) 
70/141 (49%) 


le-17 


AAB56432 


Human prostate cancer antigen 

protein sequence SEQ ID 
NO: 1010 - Homo sapiens, 1 64 
aa. [WO200055I74-Al,2I- 
SEP-2000] 


9..150 
21..164 


48/150(32%) 
72/150 (48%) 


2e-16 



In a BLAST search of public sequence datbases, the NOV40a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 40D. 



248 



WO 03/010327 



PCT/US02/14199 



Table 40D. Public BLASTP Results for NOV40a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV40a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


B54762 


phospholipase A2 (EC 3.1.1 .4) 
RPLA2-8 precursor - rat, 158 aa. 


I..I57 
1 - 1 58 


106/160(66%) 
121/160 (75%) 


2e-55 


P39878 


Group IIC secretory phospholipase 
A2 precursor (EC 3.1.1.4) 
(Phosphatidylcholine 2- 
acylhydrolase GI1C) (GIIC 
sPLA2) (PLA2-8) (14 kDa 
phospholipase A2) - Rattus 
norvegicus (Rat), 150 aa. 


9..157 
1.-150 


99/152(65%) 
114/152 (74%) 


8e-52 


P48076 


Group IIC secretory phospholipase 
A2 precursor (EC 3.1.1.4) 
(Phosphatidylcholine 2- 
acylhydrolase GIIC) (GIIC 
sPLA2) (PLA2-8)(I4kDa 
phospholipase A2) - Mus 
musculus (Mouse), 150 aa. 


9..157 
1..150 


98/152 (64%) 
114/152 (74%) 


9e-51 


A54762 


phospholipase A2 (EC 3.1.1.4) 
MPL2-8 - mouse, 130aa 
(fragment). 


27.. 157 
I..I30 


90/132 (68%) 
103/132 (77%) 


5e-48 


A60718 


phospholipase A2 homolog, non- 
pancreatic - human, 50 aa 
(fragment). 


22. .68 
4..50 


46/47 (97%) 
47/47 (99%) 


6e-23 



PFam analysis predicts that the NOV40a protein contains the domains shown in 
the Table 40E. 



Table 40E. Domain Analysis of NOV40a 


Pfam Domain 


NOV40a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


phoslip 


27..68 


19/43 (44%) 
31/43(72%) 


I.7e-I3 


phoslip 


77.. 150 


20/82(24%) 
52/82 (63%) 


4.2e-06 



Example 41. 



The NOV41 clone was analy2ed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 41 A. 
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Table 41A. NOV41 Sequence Analysis 




SEQ ID NO: 129 


952 bp 


NOV41a, 
CG96364-0I DNA Sequence 


AGGACTTCCTGGCAGGTGGAGTGGCAGCAGCCATCTCCAAGATGGGGGTAGCGCCCAT 
CGAGAGGGTCAAGCTCCTGCTGGAGGTACAGCATGCCAGCAAGCAGATCACCGCAGAT 
ATGCAATACAAGGGCATTATAGACTGCGTGGTCCATATTCTCAAGGAGCAGGGAGTCC 
TGTCCTTCTGGCACCGTAACCTGGTCAATGTCCTCAGATACTTCCCCACCCAGGCTCT 
CAACTTGGCCTTCAAAGATAAACACAAGCAGATCTTCCTGGGTGGTGTGGACAAGAGG 
ACCCAGTTTGGGAGCTACTTTGCAGGGAATCCGGCATCAGGTGGTGCCGCTGGGGCCA 
CATCCTTGTGTTTTGTGTACCCTCTTGATTTTGCCCATACCTGTCTAGCAGCTGACGT 
GGGTAAAGCTGGAGCTGAAAGGGAATTCCGAGGCCTCGGTAACTGCCTGGTTAAGATC 
TACGAAACTGATGGGATTAAGGGCCTGTACCAAGGCTTTAACGTGTCTGTGCAGGGTA 
TTATCATCTACCGAGCTGCCTACTTTGGTATCTATGACACTGCAAAGGGAATGCTTCC 
GGATCCCAGGAACACTCACATCATCATCAGCTGGATGATCACTCAGACTGTCACTGCC 
ATTGCTGGGTTGACTTCCCATCCATTTGACACCGTTCGATGCCGCATGATGATGCAGT 
CAGGGCGCAAAGGAATTGACGTCATGTACACAGGCACGCTTGACTGCTGGAGGAAGAT 
TGTTGGTGATGAAGGAGGCAAAGATTTTTTCAAGGGTGCATGGTCCAGTGTTCTCAGA 




GGCATGGATGGTGCTTTTGTGCTTGTCTTGTATGGTGAAATCAAGAAGTACACATAAG 
TTATTTCCTGGGATTTCCCCCACT 




ORF Start: ATG at 31 


ORF Stop: TAA at 925 




SEQ ID NO: 130 


298 aa j MW at 32668.6RD 


NOV41a, 
CG96364-01 Protein Sequence 


MTDAAVSFTKDFLAGGVAAAISKMGVAPIERVKLLLEVQHASKQITADMQYKGIIDCV 
VHILKEQGVLSFWHRNLVNVLRYFPTQALNLAFKDKHKQIFLGGVDKRTQFGSYFAGN 
PASGGAAGATSLCFOTPLDFAHTCLAADVGKAGAEREFRGLGNCLVKIYETDGIKGLY 
QGFNVSVQGIIIYRAAYFGIYDTAKGMLPDPRNTHIIISWMITQTVTAIAGLTSHPFD 
TVRCRMMMQSGRKGIDVMYTGTLDCWRKIVGDEGGKDFFKGAWSSVLRGMDGAFVLVL 
YGEIKKYT 


r ^ 


SEQ ID NO: 131 


745 bp 


NOV41 b, 
CG96364-03 DNA Sequence 


CTTCTTTCAACATGACAGATGCCGCATTGTCCTTCGCCAAGGACTTCCTGGCAGGTGG 


AGTGGCCGCAGCCATCTCCAAGACGGCGGTAGCGCCCATCGAGCGGGTCAAGCTGCTG 
CTGCAGGTGCAGCATGCCAGCAAGCAGATCACTGCAGATAAGCAATACAAAGGCATTA 




AAATACAAGCAGATCTTCCTGGGTGGTGTGGACAAGAGAACCCAGTTTTGGCGCTACT 
TTGCAGGGAATCTGGCATCGGGTGGTGCCGCAGGGGCCACATCCCTGTGTTTTGTGTA 
CCCTCTTGATTTTGCCCGTACCCGTCTAGCAGCTGATGTGGGTAAAGCTGGAGCTGAA 
AGGGAATTCCGAGGCCTCGGTGACTGCCTGGTTAAGATCTACAAATCTGATGGGATTA 
AGGGCCTGTACCAAGGCCTTGACTGCTGGCGGAAGATTGCTCGTGATGAAGGAGGCAA 
AGCTTTTTTCAAGGGTGCATGGTCCAATGTTCTCAGAGGCATGGGTGGTGCTTTTGTG 
CTTGTCTTGTATGATGAAATCAAGAAGTACACATAAGTTATTTCCTAGG 




ORF Start: ATG at 70 


ORF Stop: TAA at 730 




SEQ ID NO: 132 


220 aa | MW at 24272.9kD 


MDV41b, 
CG96364-03 Protein Sequence 


MTDAALSFAKDFLAGGVAAAISKTAVAPIERVKLLLQVQHASKQITADKQYKGIIDCV 
VRIPKEQEVLSFWRGNLANVIRYFPTQALNFAFKDKYKQIFLGGVDKRTQFWRYFAGN 
LASGGAAGATSLCFVYPLDFARTRLAADVGKAGAEREFRGLGDCLVKIYKSDGIKGLY 
QGLDCWRKIARDEGGKAFFKGAWSNVLRGMGGAFVLVLYDEIKKYT 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 41B. 



Table 41B. Comparison of NOV41a against NOV41b. 


Protein Sequence 


NOV4 la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV41b 


1.-178 
1.-178 


142/178 (79%) 
150/178 (83%) 
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Further analysis of the NOV41a protein yielded the following properties shown in 
Table 4 1C. 



Table 41C. Protein Sequence Properties NOV41a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.3600 probability located in 
mitochondrial matrix space; 0.3000 probability located in mitochondrial 
intermembrane space; 0.2883 probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV41 a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 41 D. 



Table 41D. Geneseq Results for NOV41a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


| Identities/ 
NO V4 la Residues/ Similarities for 
Match Residues the Matched 
Region 


Expect 
Value 


AAU10379 


Human adenine nucleotide 
translocator 2 (ANT2) - Homo 

sapiens, 298 aa. 
[WO200I85944-A2, I5-NOV- 
2001] 


I..298 
1..298 


261/298 (87%) 
274/298 (91%) 


e-150 


AAU0II99 


Human adenine nucleotide 
translocator-2 (ANT-2) protein - 

Homo sapiens, 298 aa. 
[WO200132876-A2, 10-MAY- 
2001] 


1.298 
1..298 


261/298 (87%) 
274/298 (91%) 


e-150 


AAY7I032 


Human adenine nucleotide 
translocator ANT2 - Homo 
sapiens, 298 aa. 
[WO200026370-A2, Il-MAY- 
2000] 


1..298 
1..298 


261/298 (87%) 
274/298 (91%) 


e-150 


AAU 10380 


Human adenine nucleotide 
translocator 3 (ANT3) - Homo 

sapiens, 298 aa. 
[WO200185944-A2, I5-NOV- 
2001] 


1..296 
1..296 


242/296 (81%) 
267/296 (89%) 


e-143 


AAU01200 


Human adenine nucleotide 
translocator-3 (ANT-3) protein - 

Homo sapiens, 298 aa. 
[WO200132876-A2, 10-MAY- 
2001] 


I. .296 
1..296 


242/296 (81%) 
267/296 (89%) 


e-143 
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In a BLAST search of public sequence datbases, the NOV41a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 41E. 



Table 41E. Public BLASTP Results for NOV41a 



| Protein 
i Accession 
j Number 


Protein/Organism/Length 


NOV41a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


P05I41 


ADP,ATP carrier protein, 
fibroblast isoform (ADP/ATP 
translocase 2) (Adenine 
nucleotide translocator 2) 
(ANT 2) - Homo sapiens 
(Human), 298 aa. 


1..298 
1..298 


263/298 (88%) 
275/298 (92%) 


e-151 


Q09073 


ADP,ATP carrier protein, 
fibroblast isoform (ADP/ATP 
translocase 2) (Adenine 
nucleotide translocator 2) 
(ANT 2) - Rattus norvegicus 
(Rat), 298 aa. 


1..298 
1..298 


261/298 (87%) 
274/298 (91%) 


e-150 


j A29132 


ADP,ATP carrier protein T2 - 
human, 298 aa. 


1..298 
I. .298 


261/298 (87%) 
274/298 (91%) 


e-150 


| P5I881 

j 


ADP.ATP carrier protein, 
fibroblast isoform (ADP/ATP 
translocase 2) (Adenine 
nucleotide translocator 2) 
(ANT 2) - Mus musculus 
(Mouse), 298 aa. 


1..298 
1..298 


260/298 (87%) 
273/298 (91%) 


e-149 


BAB84673 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 2 -Bos 
taurus (Bovine), 298 aa. 


1..298 
I..298 




259/298 (86%) 
273/298 (90%) 


e-149 



PFam analysis predicts that the NOV41a protein contains the domains shown ii 
the Table 4 IF. 



Table 41F. Domain Analysis of NOV41a 


Pfam Domain 


NOV41a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mitocarr 


7.. 1 05 


34/125 (27%) 
85/125 (68%) 


4.6e-22 


mito_carr 


112..208 


35/125 (28%) 
82/125 (66%) 


7.4e-22 


mito carr 


209..298 


27/125 (22%) 
66/125 (53%) 


3.2e-09 
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Example 42. 

The NOV42 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 42A. 



Table 42A. NOV42 Sequence Analysis 




SEQIDNO: 133 


936 bp 


NOV42a, 
CG96422-01 DNA Sequence 


GCCACCATGACGGAACAAGCCATCTCCTTCGCCAAGGACTTCCTAGCTGGAGGCATCA 
CCGCCGCTATCTCCAAGACAGCCGTGGCATCGATCAAGCGGGTCCAGCTGCTGCTGCA 
GATGCAGCACGCCAGCATGCCGATGGCCGCCGCCAAGCAGTGCAAGGGCATCGTGGAC 
TGCATCGTCCGCATCCCCAAAGATCAGGGCGTGCTGTCCTTCTGGAGGGGCAACCTGG 
CCAATGTCATCCGCTACTCCCCCACGCAAGCCCTCAATTTCGCCTTCAAGGATAAGTA 
CAAGCAGATCTTCCTGGCGGGCGTGGACAAGCACACGCAGTTCTGCAGGTACTTTGCG 
GGCAACCTGGCCTCTGGCGGCACGGCCGTCGTGTATCCCCTGGATTTCACCAGAACCC 
GCCTGGCAGCCGACGTGGGAAAGTCAGGCACGGAGCGCGAGTTCCGAGGCCTGGGAGA 
CTGCCTAGTGAAGATCAGCAAGTCCGACGGCATCCGGGGCCTTTACCAGGGCTTCAGC 
GTCTCCGTGCAGGCCATCATCATCTACCAGGCAGCCTACTTCAGGGTGTACGATACGG 
CCAATGGCATGTTCCCCGACCCCAAGAACACACACATCCTGGTGAGCTGGATGACCGC 




GTCCAACGCCCTCAAAGGCATGGGGGTGGGGGCGGGCTTCGTGCTGGTCCTGTACGAC 
GAGCTGAAGGTCAGCTAAGGGCCTTGGCATCCTCCACACACACACCAAAGGAACCAAG 
AGAACCAC 




ORF Start: ATG at 7 


ORF Stop: TAA at 886 




SEQIDNO: 134 


293 aa | MW at 32354.3kD 


NOV42a, 
CG96422-01 Protein Sequence 


MTEQAISFAKDFLAGGITAAISKTAVASIKRVQLLLQMQHASMPMAAAKQCKGIVDCI 
VRIPKDQGVLSFWRGNLANVIRYSPTQALNFAFKDKYKQIFLAGVDKHTQFCRYFAGN 
LASGGTAVVYPLDFTRTRLAADVGKSGTEREFRGLGDCLVKISKSDGIRGLYQGFSVS 
VQAIIIYQAAYFRWDTANGMFPDPKNTHILVSWMTAQTVTAVAGVLSKPFDTVRRRT 
MMQSRRKGADIMYTGTVDCWRKIFRDERGEAFFKGVWSNALKGMGVGAGFVLVLYrjEL, 
KVS 



Further analysis of the NOV42a protein yielded the following properties shown in 
Table 42B. 



Table 42B. Protein Sequence Properties NOV42a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located 
in cytoplasm; 0.2809 probability located in lysosome (lumen); 0.1000 probability 
located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV42a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 42C. 
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Table 42C. Geneseq Results for NOV42a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV42a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAU 10380 


Human adenine nucleotide 
translocator 3 (ANT3) - 
Homo sapiens, 298 aa. 
[WO200185944-A2, 15- 
NOV-2001] 


1..291 


253/297 (85%) 
264/297 (88%) 


e-140 


AAUU 1ZUU 


Human adenine nucleotide 

translocator-3 (ANT-3) 
protein - Homo sapiens, 298 
aa. [WO200I32876-A2, 10- 
MAY-2001] 


I..295 


253/297 (85%) 
264/297 (88%) 


e-140 


AAM41427 


NO 6358 - Homo sapiens, 
323 aa. [WO200I533I2-AI, 
26-JUL-2001] 


26..320 


Z3j/zy/ (liD/o) 

264/297 (88%) 




AAM3964I 


Human polypeptide SEQ ID 
NO 2786 - Homo sapiens, 
298 aa. [WO2001533 12-A1, 
26-JUL-2001] 


I..29I 
I..295 


253/297 (85%) 
264/297 (88%) 


e-140 


i 

AAY71033 


Human adenine nucleotide 
translocator ANTS - Homo 
sapiens, 298 aa. 
[WO200026370-A2, 11- 
MAY-2000] 


1..29I 
1..295 


253/297 (85%) 
264/297 (88%) 


e-140 



In a BLAST search of public sequence datbases, the NOV42a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 42D. 
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Table 42D. Public BLASTP Results for NOV42a 


Protein 
Accession 
Number 


Prate in/Organ ism/Length 


NOV42a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 


P12236 


ADP,ATP carrier protein, liver 
isoform T2 (A DP/ ATP 
translocase 3) (Adenine 
nucleotide translocator 3) (ANT 
3) - Homo sapiens (Human), 
298 aa. 


1..291 
1 ..295 


253/297 (85%) 
264/297 (88%) 


e-140 


P32007 


ADP, ATP carrier protein, 
isoform T2 (ADP/ATP 
translocase 3) (Adenine 
nucleotide translocator 3) (ANT 
3) - Bos taurus (Bovine), 298 


1..291 

1..295 


246/297 (82%) 
261/297 (87%) 


e-136 


P05141 


ADP,ATP carrier protein, 
fibroblast isoform (ADP/ATP 
translocase 2) (Adenine 
nucleotide translocator 2) (ANT 
2) - Homo sapiens (Human), 
298 aa. 


1..291 
1 ..295 


234/297 (78%) 
256/297 (85%) 


e-132 


BAB84673 


ADENINE NUCLEOTIDE 
TRANSLOCATOR 2 - Bos 
taurus (Bovine), 298 aa. 


1..291 
1 ..295 


233/297 (78%) 
254/297 (85%) 


e-131 


Q09073 


ADP.ATP carrier protein, 
fibroblast isoform (ADP/ATP 

translocase 2) (Adenine 
nucleotide translocator 2) (ANT 
2) - Rattus norvegicus (Rat), 
298 aa. 


1..291 
1..295 


232/297 (78%) 
255/297 (85%) 


e-131 
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PFam analysis predicts that the NOV42a protein contains the domains shown in 
the Table 42E. 



Table 42E. Domain Analysis of NOV42a 


Pfam Domain 


NOV42a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mito_carr 


7.. 105 


33/125(26%) 
83/125 (66%) 


9.2e-23 


mitocarr 


112..202 


38/125 (30%) 
78/125 (62%) 


1.9e-18 


mito_carr 


203..293 


25/125 (20%) 
69/125 (55%) 


2.9e-05 



Example 43. 

The NOV43 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 43A. 



Table 43A. NOV43 Sequence Analysis 




SEQIDNO: 135 


858 bp 


NOV43a, 
CG96442-01 DNA Sequence 


ATGTTATGGGTATCTTTGATCTCTGAATTTGAAGGAGGGCTTACTGAGTGTGTCTGGC 
TCATTGGCACTCCTGAGGCAGAACACTTTTCAGGCAAGGGAGTACTACCTGTGCTGTT 

TGGTATCGGAACTCATCATCATCCTTTCCAACCTCCATAGGGCACCTTCCAGAACAAC 
AGAAACTGGAAAGAAAAATGCTTGCTCTAGGGCTCCTTGGGCAGTGGCTCAGTGGCAT 
CTGCCTCAGTGGCTCCCCTTTTCCATACTCCTGGGAAAAGTTTGTCCTTGGAAATGAT 




GGGTATTTAGAAGTGGCCTACTTTCATGTCAAGGAAGAAAGTAAGAGAGAGAGAGAGA 
GAGAGGAGAACGGAGAACATTCTCCCCATCAAAATGTCCCTTCCCAAATTCTATCAGT 
GCCTCTTCTAGGCTTTGAGAGCATGTACTGTGTACACGCTGCTATCACTGCATTTCAT 
CCCCTATGGCTGTCCCCTTCACTGCACGGGGCAGCCCTGTGCTTTCCGTTTTTCAAGA 
TCGAATGTAGCAAACAAGTTATCTATGCAGTTAATGATGATTTATTGAGGGCCTACAA 






ATGAGCTCAAAGGAGATTGTGCCAATTACTGCACCTGTCAGTGTTCAACAACAGATAA 
AATTAATTCGGAAACCCAAACAGTCTGCAGAAAACCGAACACAATTATCTAAGAAATT 
AAGTTATACCGTTGCCATGACCTTTAATCAATCATCCAACCGTTAA 




ORF Start: ATG at 1 


ORF Stop: TAA at 856 




SEQIDNO: 136 


285 aa | MW at 31776.3kD 


NOV43a, 
CG96442-01 Protein Sequence 


MLWVSLISEFEGGLTECVWLIGTPEAEHFSGKGVLPVLFKLRLVGGQLPHLNAGLWV 
WYRNSSSSFPTSIGHLPEQQKLERKMIiALGLLGQWLSGICLSGSPFPYSWEKFVLGND 
GYLEVAYFHVKEESKREREREENGEHSPHQNVPSQILSVPLLGFESMYCVHAAITAFH 
PLWLSPSLHGAALCFPFFKIECSKQVIYAVNDDLLRAYKTGTVATASGKKRGOGAREE 
MSSKEIVPITAPVSVQQQIKLIRKPKQSAENRTQLSKKLSYTVAMTFNQSSNR 



Further analysis of the NO V43a protein yielded the following properties shown in 
Table 43B. 
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Table 43B. Protein Sequence Properties NOV43a 


PSort analysis: 


0.4010 probability located in microbody (peroxisome); 0.3000 probability located in 
nucleus; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


Cleavage site between residues 16 and 17 



A search of the NOV43a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 43C. 



Table 43C. Geneseq Results for NOV43a 




Geneseq 
Identifier 


Protein/Organ is 

m/Length 
[Patent #, Date] 


NOV43a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched 
Region 


Expect 


ABG01918 

1 

I 


Novel human 

diagnostic 
protein #1909- 
Homo sapiens, 

434 aa. 
[WO200 175067- 
A2, ll-OCT- 
2001] 


59. 140 
219-314 


30/100 (30%) 
41/100 (41%) 


5.8 


ABG01918 


Novel human 

diagnostic 
protein #1909- 
Homo sapiens, 

434 aa. 
[WO2001 75067- 
A2, 11-OCT- 
2001] 


59.. 140 
219.314 


30/100 (30%) 
41/100(41%) 


5.8 


ABG26879 


Novel human 
diagnostic 
protein #26870 - 
Homo sapiens, 

800 aa. 
[WO200175067- 
A2, ll-OCT- 
2001] 


91.156 
328.393 


20/68 (29%) 
31/68 (45%) 


7.6 


ABG26879 


Novel human 
diagnostic 
protein #26870 - 
Homo sapiens, 

800 aa. 
[WO200175067- 
A2, ll-OCT- 
2001] 


91-156 
328..393 


20/68 (29%) 
31/68 (45%) 


7.6 
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AAY08899 


S. peregrina 


66..111 


17/46 (36%) 


9.9 




cysteine 


286.-327 


25/46 (53%) 






protease-like 










protein 1 - 










Sarcophaga 










peregrina, 550 










aa. 

[JP11146789-A, 










02-JUN-1999] 









In a BLAST search of public sequence datbases, the NOV43a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 43D. 



Table 43D. Public BLASTP Results for NOV43a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV43a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9X935 


CONSERVED 
HYPOTHETICAL 
PROTEIN - Streptomyces 
coelicolor, 309 aa. 


131. .192 
192..248 


19/62 (30%) 
27/62 (42%) 


8.4 


Q9RNK7 


HYPOTHETICAL 
PROTEIN - Zymomonas 
mobilis,415aa. 


141..178 
221.. 258 


12/38(31%) 
24/38 (62%) 


8.4 



PFam analysis predicts that the NOV43a protein contains the domains shown in 
the Table 43E. 



Table 43E. Domain Analysis of NOV43a 



Pfam Domain 


NOV43a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


No Significant Matches Found 



5 Example 44. 

The NOV44 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 44A. 



Table 44A. NOV44 Sequence Analysis 

| SEQIDN0.137 | 999 bp 
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NOV44a, 
CG96501-01 DNA Sequence 



GGTTTTAA TGCTTCCGGGTTGCCACTGCAGTGGCATTTCTGRCTGTGGGAGCCTCAGT 
TTCCCAGTCATTGATGAGCCTGAGCAGCAGCRAAGTACCACTGTAAGCCATGAGATGT 
CTCGTCTGAACTGGAAACCCTTTGTATATGACGCCCTTGCCTCTATCACTGCTGAGTT 
TGGAACTTTCCCCATGGACCTTGCCAAAACACGACTTCAGGTACAAGGCCAAAGCATT 
GATGTCCGTTTCAAAGAAACAAAATATAGACGGATGTTTCATGCTTTGTTTTGGATCT 
ATAAAGCGGAGGGGGGATTGGCTCTGTATTCAGGAATTGCTCCTGTTTTGCAAAGACA 
AGCATCATATGGCACCATTAAAATTGGGATTTACCAAAGCTTGAAGCAATTATCTGTA 
GAACGTTTAGAAGATGAAACTCTTTTAATCAACATGATCTGTGGGGTAGTGTCAGGGG 
TGATATTTTCCACTATAGCCAATCCCACCGATGTTCTAAAGATTCGAATGCAGGCTCA 
AGGAAGTTTGTTCCAAGGGAGCATGATTGGCAGCTTCATCGATATATACCAACAGGAA 
GGCACCAGGGGTCTGTGGAGGAGTGTGGTTCCAACTGCTCAGCATGCTGCCATCGTTG 
TGGGAGTAGAGCTACCAGTCTATGATTTTACTAAGAAGCACTTAATATTGTCAGGAAT 
GATGGAAGACACAACTTTAACTCACTTTGTTTCCAGCTTTACATATGGTTTGGCTGGG 
GCTCTTGCCTCTAATCCAGGTGATGTGGCAGGCACTCACGTGATGAACCAGAGGGCAA 
TCGTGGGACATGTGGATCTCTATAAGGGCACTTTGGATGGTATTTTAAAAATGTGGAA 
ACATGAGGGCTTTTTTTTGTATTCTAAAGGATTTTGGCCAAACTGGCTTTGGCGTGGA 
CCCTGGAACATCATTCTTAAAATTACATATGAGAAGCTCAAGAGGCTTTA AATCTAAG 
GACTGAATTATAT 





ORF Start: ATG at 8 


ORF Stop: TAA at 977 




SEQ ID NO: 138 


323 aa ~~f MW at 36200.6kD 


NOV44a, 
CG96501-01 Protein Sequence 


MLPGCHCSGISDCGSLSFPVIDEPEQQQSTTVSHEMSRLNWKPFWDALASITAEFGT 
FPMDLAKTRLgVQGQSIDVRFKETKYRRMFHALFWIYKAEGGLALYSGIAPVLQRQAS 
YGTIKIGIYQSLKQLSVERLEDETLLINMICGWSGVIFSTIANPTDVLKIRMQAQGS 
LFQGSMIGSFIDIYQQEGTRGLWRSWPTAQHAAIWGVELPVYDFTKKHLILSGMME 
DTTLTHFVSSFTYGLAGALASNPGDVAGTHVMWQRAIVGHVDLYKGTLDGILKMWKHE 
GFFLYSKGFWPNWLWRGPWNIILKITYEKLKRL 



Further analysis of the NOV44a protein yielded the following properties shown in 
Table 44B. 



Table 44B. Protein Sequence Properties NOV44a 


PSort analysis: 


0.7480 probability located in microbody (peroxisome); 0.7000 probability located in 
plasma membrane; 0.2000 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV44a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 44C. 
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Table 44C. Geneseq Results for NOV44a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV44a Residues/ 
Match Residues 


Identities/ 
Similarities for the 


Expect 
Value 


AAY94669 


Murine uncoupling protein 
isoform mUCP5S amino 
acid sequence - Mus 
musculus, 325 aa. 
[WO200032624-A2, 08- 
JUN-2000] 


27..323 
27..S23 


257/297 (86%) 
266/297 (89%) 


e-144 


AAY94668 


Murine uncoupling protein 
isoform mUCPSL amino 
acid sequence - Mus 
musculus, 322 aa. 
[WO200032624-A2, 08- 
JUN-2000] 


27..323 
24..320 


257/297 (86%) 
266/297 (89%) 


e-144 


AAE06056 


Human gene 1 6 encoded 
secreted protein HMIAP86, 
SEQIDNO:118-Homo 

sapiens, 334 aa. 
[WO200I51504-A1, 19- 
JUL-2001] 


27..323 
36..332 


258/297 (86%) 
264/297 (88%) 


e-144 


AAY87079 


Human secreted protein 
sequence SEQ ID NO: 118 
- Homo sapiens, 335 aa. 
[WO200004140-Al,27- 
JAN-2000] 


27..323 
36.-332 


258/297 (86%) 
264/297 (88%) 


e-144 


AAY94666 


Human uncoupling protein 
isoform hUCP5S amino 
acid sequence - Homo 

sapiens, 322 aa. 
[WO200032624-A2, 08- 
JUN-2000] 


27..323 
24..320 


258/297 (86%) 
264/297 (88%) 


e-144 



In a BLAST search of public sequence datbases, the NOV44a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 44D. 
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Table 44D. Public BLASTP Results for NOV44a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV44a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9Z2B2 


Brain mitochondrial carrier 

protein- 1 (BMCP-1) 
(Mitochondrial uncoupling 
protein 5) (UCP 5) (Solute 
carrier family 25, member 

325 aa. ^ ^ 


27..32S 
27..323 


257/297 (86%) • 
266/297 (89%) 


e-144 


095258 


Brain mitochondrial carrier 

protein- 1 (BMCP-I) 
(Mitochondrial uncoupling 
protein 5) (UCP 5) (Solute 
carrier family 25, member 

(Human), 325 aa. 


27.323 
27..323 


258/297 (86%) 
264/297 (88%) 


e-144 


Q9JMH0 


BRAIN MITOCHONDRIAL 

CARRIER PROTE1N-1 - 
Rattus norvegicus (Rat), 322 
aa. 


27..323 
24..320 


255/297 (85%) 
265/297 (88%) 


e-143 


Q9EP88 


BRAIN MITOCHONDRIAL 
CARRIER PROTEIN 

BMCPI (BRAIN 
MITOCHONDRIAL 
CARRIER PROTEIN-1)- 
Rattus norvegicus (Rat), 325 
aa. 


27..S23 
27..S23 


255 207 (85%) 
265/297 (88%) 


e-143 


Q9CR58 


4933433D23RIK PROTEIN 
- Mus musculus (Mouse), 
291 aa. 


36..323 
1..289 


205/289 (70%) 
231/289 (78%) 


e-113 



PFam analysis predicts that the NOV44a protein contains the domains shown in 
the Table 44E. 
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Table 44E. Domain Analysis of NOV44a 


Pfam Domain 


NOV44a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


mito_carr 


39..I38 


36/125 (29%) 
75/125 (60%) 


1.5e-17 


mito carr 


140..231 


27/125 (22%) 
74/125 (59%) 


2.8e-21 


mito_carr 


234..32S 


29/125 (23%) 
70/125 (56%) 


0.00021 



Example 45. 

The NOV45 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 45A. 



Table 45A. NOV45 Sequence Analysis 




SEQIDNO:139 j 1065 hp 


NOV45a. 
CG96557-01 DNA Sequence 


CTACGGCGAAGACCGGGTCCGTGATGAGCTGCAAGAAATGGCTGTGGAGCTGCTRGGG 

ACGACTGCAGGTGCCTCGGCTGCGTGTCCACCAAGAAGCAGCATTGCAGCAGCTTCTC 
CAGCAGGCGATGCTCTGTGGAGGGGCCTCCAGGCGCAGAGGGCTGGAGTGCACTCCCA 
CTCCCTCCCAGAGCTGCCCCACTGCACTCTGGACTGGCTGAGCTGGAGAGGATGATCA 
CGCGGGGCCTCCAGAGTCCCTACCACCAGGTCTGCGAGCTCATTTGCCTGGAGAACTC 
ACACAGCATCTCGGGGGTCCGGATCCTCCCCATCAACTACCTACGCCAGGTGCACCTC 
CTGGCCCAGACCTATGGGGCCTGGGTCCACCTGGACAGGGCCCAGCTGATGAACGTGG 
TGCTGGCCCTGCATGTCCCGCCCACCTACATTGTGGAGCACTGTGACTCTGTGTCTTT 
CTGCCTCTCCAAGGGCCTGGGCGCCCCAGTGGGGTCCCTGGCTGGGAGGCCCAAGGAC 




CTTCTGGATCCTGGGACTCAGGGTGGCCATTGAGTCACTAGGGCGGGACCCAGGGCAC 




ACGGCCAGGATTGTGATGCTCTGCAGAGTCTGGCAGAGCCTCTAACCTGGGTCTTCCA 
TCTGCTCACTGAGTCCTCCAACAATCCCATGATGTCCCTACTTCACAATGAGGAAACA 
3? ;gc c gagaggtcaatag 




ORF Start: ATG at 1 


ORF Stop: TAG at 1063 




SEQ ID NO: 140 


354 aa ] MW at 38439.8kD 


NOV45a, 
CG96557-01 Protein Sequence 


MLRRLARAAAVVGLRSDKA^TRPGPAMRCVMAEAVVGDDDYGEDRVRDEIjQEMAVELLG 

lgrtlfvpsntmanlisgsddcrclgotstkkqhcssfssrrcsvegppgaegwsalp 
lppraaplksglaelermitrglqspyhqvceliclenshsisgvrilpinylrqvhl 
laqtygawvhldraqlmnvvlalhvpptyivehcdsvsfclskglgapvgslagrpkd 
fieeawrlqkalggsmhqvgmlaavallcrrgspsrpagfwilglrvaieslgrdpgh 
qllqqtllsaesslgkpsvdgqdcdalqslaepltwvfhlltessnnpmmsllhneet 

EAQRGQ 



Further analysis of the NOV45a protein yielded the following properties shown in 
Table 45B. 
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Table 45B. Protein Sequence Properties NOV45a 


PSort analysis: 


0.8673 probability located in mitochondrial matrix space; 0.5542 probability located 

in mitochondrial inner membrane; 0.5542 probability located in mitochondrial 
intermembrane space; 0.5542 probability located in mitochondrial outer membrane 


SignalP analysis: 


Cleavage site between residues 1 7 and 18 



A search of the NOV45a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 45C. 



Table 45C. Geneseq Results for NOV45a 


Geneseq 
i Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV45a Residues/ 
Match Residues 


Identities/ \ 
Similarities for the 
Matched Region 


Expect 
Value 


AAG42377 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 52844 - 

Arabidopsis thaliana, 356 aa. 
[EP1033405-A2, 06-SEP-2000] 


I2..260 
4..248 


98/252 (38%) 
129/252 (50%) 


4e-36 


AAG42376 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 52843 - 

Arabidopsis thaliana, 358 aa. 
[EP1033405-A2, 06-SEP-2000] 


12..260 
6..250 


98/252 (38%) 
129/252 (50%) 


4e-36 


AAG42378 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 52845 - 

Arabidopsis thaliana, 339 aa. 
[EP1033405-A2, 06-SEP-2000] 


26..260 
1..231 


88/238 (36%) 
119/238 (49%) 


5e-31 


AAG 10781 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 9239 - 
Arabidopsis thaliana, 332 aa. 
[EP1033405-A2, 06-SEP-2000] 


26..260 
1..231 


94/241 (39%) 
125/241 (51%) 


5e-31 


AAG73780 


Human colon cancer antigen 
protein SEQ ID NO:4544 - 
Homo sapiens, 272 aa. 
[WO200122920-A2, 05-APR- 
2001] 


10..268 
3..251 


91/261 (34%) 
135/261 (50%) 


2e-30 



In a BLAST search of public sequence datbases, the NOV45a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 45D. 
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Table 45D. Public BLASTP Results for NOV45a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV45a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


T00716 


L-allo-threonine aldolase 
homolog F22013.ll - 
Arabidopsis thaliana, 484 aa. 


12.. 260 
90..334 


98/252 (38%) 
129/252 (50%) 


le-35 


AAL86346 


HYPOTHETICAL 38.9 KDA 
PROTEIN - Arabidopsis thaliana 
(Mouse-ear cress), 358 aa. 


12.. 260 
6..250 


98/252 (38%) 
129/252 (50%) 


le-35 


Q9FRS2 


F22013.ll - Arabidopsis 
thaliana (Mouse-ear cress), 352 
aa. 


12..260 
6..250 


98/252 (38%) 
129/252 (50%) 


le-35 


007051 


L-allo-threonine aldolase (EC 
4.1.2.-) (L-allo-TA) (L-allo- 
threonine acetaldehyde-lyase) - 
Aeromonas jandaei, 338 aa. 


12.259 
4..241 


100/249 (40%) 
131/249 (52%) 


3e-34 


Q8ZGE4 


L-ALLO-THREONINE 
ALDOLASE (EC 4.1.2.-)- 
Yersinia pestis, 339 aa. 


11. .268 
2..249 


93/260 (35%) 
136/260 (51%) 


le-31 



PFam analysis predicts that the NOV45a protein contains the domains shown in 
the Table 45E. 



Table 45E. Domain Analysis of NOV45a 



NOV45a Match Region 



Identities/ 
Similarities 
for the Matched Region 



No Significant Matches Found 



Example 46. 

5 The NOV46 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 46A. 
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Table 46A. NOV46 Sequence Analysis 




SEQIDNO:141 | 945 bp 


NOV46a, 
1 CG96581-01 DNA Sequence 


TCTGCCCAGTTAAAGTGCAGGCGTTAGGCTAAAGCTGGGACAAACCTGGAGCGGCTTC 
ATTGGCTTTCCTACTTCTTTTGCATCCCAGAGATGCATAAGCTTAAATCGTCTCAGAA 
GGACAAGGTCCGCCAGTTTATGGCGTGCACTCAGGCTGGCGAGAGAACTGCTATCTAC 
TGCTTAACGCAGAATGAGTGGAGACTAGACGAGGCCACGGACAGCTTCTTCCAAAACC 
CAGACTCGCTCCACAGGGAGTCCATGCGGAACGCTGTGGACAAGAAGAAGCTGGAGCG 
GCTGTACGGCAGGTACAAAGATCCACAAGATGAAAACAAAATTGGAGTCGATGGGATT 
CAACAGTTTTGTGATGATCTGAGCCTGGATCCTGCCAGTATCAGTGTATTGGTCATAG 
CGTGGAAGTTCAGGGCAGCAACTCAGTGTGAATTTAGCAGAAAGGAATTTCTAGATGG 
CATGACAGAACTTGGGTGTGACAGCATGGAGAAGCTAAAGGCTCTTCTGCCAAGACTG 
GAGCAGGAGCTGAAGGACACAGCCAAGTTTAAAGATTTTTATCAGTTTACCTTCACCT 
TCGCTAAGAACCCAGGGCAGAAAGGTTTAGACTTAGAAATGGCTGTTGCGTATTGGAA 
ATTAGTGTTATCTGGAAGGTTTAAATTTTTAGATCTCTGGAACACATTCTTAATGGAA 
CATCACAAAAGATCAATTCCAAGGGACACCTGGAACCTCCTGCTGGACTTTGGAAACA 
TGATTGCGGATGATATGTCTAACTACGATGAAGAAGGAGCTTGGCCCGTTCTTATAGA 
TGATTTTGTAGAATATGCACGGCCAGTAGTCACAGGTGGAAAACGCAGCCTTTTCTAG 
GCAGCAAGTTAAGCAGGAGTAAGATTATGAAATGATTTGTATCCTGCAAGGAGATTGC 
AGTCAGTTCCTGGGTGC 




ORF Start: ATG at 91 


ORF Stop: TAG at 868 




SEQ ID NO: 142 


259 aa j MW at 30179.1kD 


! NOV46a, 
CG96581-01 Protein Sequence 


MHKLKSSQKDK 1 QFMACTQ2 ^PTAIYCLTQNEWRLDEATDSFFQNPDSLHRESMRN 
AVDKKKLERLYGRYKDPQDENKIGVDGIQQFCDDLSLDPASISVLVIAWKFRAATQCE 
FSRKEFLDGMTELGCDSMEKLKALLPRLEQELKDTAKFKDFYQFTFTFAKNPGQKGLD 
LEMAVAYWKLVLSGRFKFLDLWNTFLMEHHKRSIPRDTWNLLLDFGNMIADDMSNYDE 
EGAWPVLIDDFVEYARPWTGGKRSLF 




SEQ ID NO: 143 j 808 bp 


NOV46b, 
' CG96581-02 DNA Sequence 


GAGATGCATAAGCTTAAATCGTCTCAGAAGGACAAGGTCCGCCAGTTTATGGCGTGCA 
CTCAGGCTGGCGAGAGAACTGCTATCTACTGCTTAACGCAGAATGAGTGGAGACTAGA 
CGAGGCCACGGACAGCTTCTTCCAAAACCCAGACTCGCTCCACAGGGAGTCCATGCGG 
AACGCTGTGGACAAGAAGAAGCTGGAGCGGCTGTACGGCAGGTACAAAGATCCACAAG 
ATGAAAACAAAATTGGAGTCGATGGGATTCAACAGTTTTGTGATGATCTGAGCCTGGA 
TCCTGCCAGTATCAGTGTATTGGTCATAGCGTGGAAGTTCAGGGCAGCAACTCAGTGT 
GAATTTAGCAGAAAGGAATTTCTAGATGGCATGACAGAACTTGGGTGTGACAGCATGG 
AGAAGCTAAAGGCTCTTCTGCCAAGACTGGAGCAGGAGCTGAAGGACACAGCCAAGTT 
TAAAGATTTTTATCAGTTTACCTTCACCTTCGCTAAGAACCCAGGGCAGAAAGGTTTA 
GACTTAGAAATGGCTGTTGCGTATTGGAAATTAGTGTTATCTGGAAGGTTTAAATTTT 
TAGATCTCTGGAACACATTCTTAATGGAACATCACAAAAGATCAATTCCAAGGGACAC 
CCGGAACCTCCTGCTGGACTTTGGAAACATGATTGCGGATGATATGTCTAACTACGAT 
GAAGAAGGAGCTTGGCCCGTTCTTATAGATGATTTTGTAGAATATGCACGGCCAGTAG 
TCACAGGTGGAAAACGCAGCCTTTTCTAGGCAGCAGTTAAGCAGGAGTAAGATT 




ORF Start: ATG at 4 


ORF Stop: TAG at 781 




SEQ ID NO: 144 


259 aa j MW at 30 149. 1 kD 


NOV46b, 
CG96581-02 Protein Sequence 


MHKLKSSQKDK^/RQFMACTQAGERTAIYCLTQNEWRLDEATDSFFQNPDSLHRESMRN 
AVDKKKLERLYGRYKDPQDENKIGVDGIQQFCDDLSLDPASISVLVIAWKFRAATQCE 
FSRKEFLDGMTELGCDSKEKLKALLPRLEQELKDTAKFKDFYQFTFTFBKNPGQKGLD 
LEMAVAYWKLVLSGRFKFLDLWNTFLMEHHKRSIPRDTRNLLLDFGNMIADDMSNYDE 
EGAWPVLIDDFVEYARPWTGGKRSLF 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 46B. 



265 



WO 03/010327 



PCT/US02/14199 



Table 46B. Comparison of NOV46a against NOV46b. 


Protein Sequence 


NOV46a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV46b 


1..259 
1..259 


258/259 (99%) 
258/259 (99%) 



Further analysis of the NOV46a protein yielded the following properties shown in 
Table 46C. 



Table 46C. Protein Sequence Properties NOV46a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV46a protein against the Geneseq database, a proprietary 
5 database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 46D. 
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Table 46D. Geneseq Results for NOV46a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV46a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAM40345 


NO 3490 - Homo sapiens! 244 
aa. [WO200 1 533 12-A 1,26- 
JUL-2001] 


1 ..244 


244/244 (100%) 
244/244 (100%) 


e-144 


AAM42131 


Human polypeptide SEQ ID 
NO 7062 - Homo sapiens, 224 
aa. [WO200 1533 12-A 1,26- 
JUL-2001] 


22..224 


200/203 (98%) 
200/203 (98%) 


e-1 15 


AAY94962 


Human secreted protein clone 
mtl24 3 protein sequence SEQ 
I D NO: 1 3 0 - Homo sapiens, 
244 aa. [WO200009552-A1, 
24-FEB-2000] 


I6..255 


185/240 (77%) 
218/240 (90%) 


e-115 


AAB93082 


Human protein sequence SEQ 
ID NO: 11917 -Homo sapiens, 
186 aa. [EP1074617-A2, 07- 
FEB-2001] 


1..173 
1..173 


173/173 (100%) 
173/173 (100%) 


le-98 


ABB63270 


Drosophila melanogaster 
polypeptide SEQ ID NO 16602 
- Drosophila melanogaster, 291 
aa. [WO200171042-A2,27- 
SEP-2001] 


2..248 
5..247 


152/248 (61%) 
192/248 (77%) 


le-87 



In a BLAST search of public sequence datbases, the NOV46a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 46E. 
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Table 46E. Public BLASTP Results for NOV46a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV46a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q96GG9 


RP42 HOMOLOG - Homo 
sapiens (Human), 259 aa. 


1..255 
1..255 


201/255 (78%) 
234/255 (90%) 


e-123 


Q9QZ73 


RP42 - Mus musculus 
(Mouse), 259 aa. 


1..255 
1..255 


201/255 (78%) 
235/255 (91%) 


e-123 


Q9HCT3 


RP42 PROTEIN - Homo 
sapiens (Human), 259 aa. 


1..255 
1..255 


200/255 (78%) 
234/255 (91%) 


e-123 


Q99NE7 


PUTATIVE LEUCINE- 
ZIPPER PROTEIN - Mus 
musculus domesticus (western 
European house mouse), 259 


1..255 
1..255 


200/255 (78%) 
233/255 (90%) 


e-122 


AAL78673 


LEUCINE ZIPPER PROTEIN 
- Homo sapiens (Human), 259 
aa. 


I..255 
I..255 


199/255 (78%) 
232/255 (90%) 


e-122 



PFam analysis predicts that the NOV46a protein contains the domains shown in 
the Table 46F. 



Table 46F. Domain Analysis of NOV46a 






Identities/ 




Pfam Domain 


NOV46a Match Region 


Similarities 


Expect Value 






for the Matched Region 




No Significant Matches Found 



Example 47. 



5 The NOV47 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 47 A. 
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Table 47A. NOV47 Sequence Analysis 




SEQ ID NO: 145 


1072 bp 


NOV47a, 


CCCTCCTTGTCTGTCTCCGGGATTCAGGCCTCCCTCCCTGACATGGAGAGTAACCTGT 


CG96624-01 DNA Sequence 


CTGGCCTGGTGCCTGCTGCCGGGCTGGTGCCTGCGCTGCCACCTGCTGTGACCCTGGG 


CTCTGGCTGGTGCTTCTGTATGGGCACAAGCGTCTCAGCTATCAGACGGTGTTCCTGG 
CCCTCTGTCTGCTCTGGGCCGCCTTGCGTACCACCCTCTTCTCCTTCTACTTCCGAGA 

GTCTGCCTGCAGTTCTTCACCTTGACGCTTATGAACCTCTACTTTGCCCAGGTAAGGC 
TCGCTGTCCGAGGGGCCTTTGTGGGGGCCTCGCTGCTCTTTCTGCTGGTGAACGTGCT 
GTGTGCTGTGCTCTCCCATCGGCGCCGGGCACAGCCCTGGGCCCTGCTGCTTGTCCGC 






GCCTCGTCGCCAGGCGGGCGCCCTCCACTAGCATCTACCTGGAGGCCAAGGGGACCAG 
TGTGTGCCAGGCGGCCGCGATGGGTGGCGCCATGGTCCTGCTCTATGCCAGCCGGGCC 
TGCTACAACCTGACAGCACTGGCCTTGGCCCCCCAGAGCCGGCTGGACACCTTCGATT 
ACGACTGGTACAATGTGTCTGACCAGGCGGACCTGGTGAATGACCTGGGGAACAAAGG 
CTACCTGGTATTTGGCCTCATCCTCTTCGTGTGGGAGCTACTGCCCACCACCCTGCTG 




TCTTTGACCGGGCTGGGCACTGTGAAGATGAGGGCTGCTCCTGGGAGCACAGCCGGGG 
TGAGAGCACCAGGTAGGAGCCGTGGCACTGCCTCAGTACCCCTGCCCTACCCGCCCAC 
CCCGCTGGCTCCATCAAGCTATGGGGGA 




ORF Start: ATG at 43 


ORF Stop: TAG at 1000 




SEQ ID NO: 146 


319 aa | MW at 35534.2kD 


NOV47a, 
CG96624-01 Protein Sequence 


MESNLSGLVPAAGLVPALPPAVTLGLTAAYTTLYALLFFSWAQLWLVLLYGHKRLSY 
QTVFLALCLLWAALRTTLFSFYFRDTPRANRLGPLPPWLLYCCPVCLQFFTLTLMNLY 
FAQVRLAVRGAFVGASLLFLLVNVLCAVLSHRRRAQPWALLLVRVLVSDSLFVICALS 
LAACLCLVARRAPSTSIYLEAKGTSVCQAAAMGGAMVLLYASRACYNLTALALAPQSR 
LDTFDYDWYNVSDQADLVNDLGNKGYLVFGLILFVWELLPTTLLVGFFRVHRPPQDLF 
ASRSYFFDRAGHCEDEGCSWEHSRGESTR 




SEQ ID NO: 147 


1200 bp 


NOV47b, 


GATAAGGCCTCCCTCCCTGACATGGAGAGTAACCTGTCTGGCCTGGTGCCTGCTGCCG 


CG96624-02 DNA Sequence 


CCTGTATGCCCTGCTCTTCTTCTCCGTCTATGCCCAGCTCTGGCTGGTGCTTCTGTAT 
GGGCACAAGCGTCTCAGCTATCAGACGGTGTTCCTGGCCCTCTGTCTGCTCTGGGCCG 
CCTTGCGTACCACCCTCTTCTCCTTCTACTTCCGAGATACTCCCCGCGCCAACCGCCT 
GGGGCCCTTGCCCTTCTGGCTTCTCTACTGCTGCCCCGTCTGCCTGCAGTTCTTCACC 
TTGACGCTTATGAACCTCTACTTTGCCCAGGTAAGGCTCGCTGTCCGAGGGGCCTTTG 
TGGGGGCCTCGCTGCTCTTTCTGCTGGTGAACGTGCTGTGTGCTGTGCTCTCCCATCG 

TTCGTCATCTGCGCGCTGTCTCTTGCTGCCTGCCTCTGCCTCGTCGCCAGGCGGGCGC 
CCTCCACTAGCATCTACCTGGAGGCCAAGGGGACCAGTGTGTGCCAGGCGGCCGCGAT 
GGGTGGCGCCATGGTCCTGCTCTATGCCAGCCGGGCCTGCTACAACCTGACAGCACTG 
GCCTTGGCCCCCCAGAGCCGGCTGGACACCTTCGATTACGACTGGTACAATGTGTCTG 
ACCAGGTGGGCATACGCATGTCTGCCACCCAGGCGGACCTGGTGAATGACCTGGGGAA 
CAAAGGCTACCTGGTATTTGGCCTCATCCTCTTCGTGTGGGAGCTACTGCCCACCACC 




CCTACTTCTTTGACCGGGCTGGGCACTGTGAAGATGAGGGCTGCTCCTGGGAGCACAG 
CCGGGGTGAGAGCACCAGTATGTCGGGCAGTCTAGGCTCTGGGAGCTGGTATGGTGCC 
ATCGGGCGTGAGCCGGGCTGGTATGGGGGCAGCCAGACGAAGACCACTCCTCTGCTCT 
TCTCCCAGGTGCCAGGACCAGGCGGCCACCACCACAGTCTCTACTCCACCCCACAGAC 
GTGATCCCCGTCGCCTGCTGC F 1 "^GGCCCTG 




ORF Start: ATG at 22 


ORF Stop: TGAat 1162 




SEQ ID NO: 148 


380 aa | MW at 41810.1kD 


NOV47b, 
CG96624-02 Protein Sequence 


MESNLSGLVPAAGLVPALPPAVTLGLTAAYTTLYALLFFSVYAQLWLVLLYGHKRLSY 
QTVFLALCLLWAALRTTLFSFYFRDTPRANRLGPLPFWLLYCCPVCLQFFTLTLMNLY 
FAQVRIiAVRGAFVGASLLFLLVNVLCAVLSHRRRAQPWAIjLLVPwVLVSDSLFVICAIjS 

iaaclclvarrapstsiyleakgtsvcqaaamggamvllyasracynltalalapqsr 
ldtfdydwynvsdqvgirmsatqadlvndlgnkgylvfglilfvwellpttllvgffr 
vhrppqdlfasrsyffdraghcedegcswehsrgestsmsgslgsgswygaigrepgw 
ygc-sqtkttpllfsqvpgpgghhhslystpqt 
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Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 47B. 



Table 47B. Comparison of NOV47a against NOV47b. 


Protein Sequence 


NOV47a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV47b 


1 ..314 
1..323 


256/323 (79%) 
256/323 (79%) 



Further analysis of the NOV47a protein yielded the following properties shown in 
Table 47C. 



Table 47C. Protein Sequence Properties NOV47a 


PSort analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi body; 
0. 1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 44 and 45 



5 A search of the NOV47a protein against the Geneseq database, a proprietary 

database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 47D. 
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Table 47D. Geneseq Results for NOV47a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV47a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


• AAE09061 


Human seven-transmembrane 
protein 19459 - Homo sapiens, 
396 aa. [WO2001591 16- A2, 
16-AUG-2001] 


1..318 
1..343 


317/343 (92%) 
317/343 (92%) 


e-180 


AAU04571 


Human G-protein coupled 
receptor like protein, GPCR #8 

- Homo sapiens, 1314 aa. 
[WO200153454-A2, 26-JUL- 
2001] 


I..318 
I..343 


317/343 (92%) 
317/343 (92%) 


e-180 


AAY69992 


Human receptor-associated 
protein from Incyte clone 
786873 - Homo sapiens, 346 
aa. [WO200008155-A2, 17- 
FEB-2000] 


1.-318 
1..293 


267/343 (77%) 
267/343 (77%) 


e-142 


AAU04572 


Human G-protein coupled 
receptor model sequence #4 - 

Homo sapiens, 178 aa. 
[WO200I53454-A2, 26-JUL- 
2001] 


56..2I8 
1..178 


162/178 (91%) 
162/178 (91%) 


2e-85 


AAY60207 


Human endometrium tumour 
EST encoded protein 267 - 

Homo sapiens, 296 aa. 
[DE19817948-Al,21-OCT- 
1999] 


26..289 
1..278 


159/279 (56%) 
1 97/279 (69%) 


2e-85 



In a BLAST search of public sequence datbases, the NOV47a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 47E. 
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Table 47E. Public BLASTP Results for NOV47a 



i Accession 
i Number 


Protein/Organism/Length 


NOV47a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


•; CAC69286 


SEQUENCE 1 FROM PATENT 
w uu 1 3y 1 1 o - Homo sapiens 
(Human), 396 aa. 


I..318 


317/343 (92%) 
3 1 7/343 (92%) 


e-180 


Q96NI9 


CDNA FLJ3 1532 FIS, CLONE 
NT2RJ2000597 - Homo sapiens 
(Human), 417 aa. 


1..319 
1..344 


317/344 (92%) 
317/344 (92%) - 


e-179 


: Q9NQC5 


HYPOTHETICAL 23.3 KDA 
PROTEIN PRECURSOR - 
Homo sapiens (Human), 2 1 0 aa. 


1-187 
1..201 


182/202(90%) 
182/202 (90%) 


4e-96 


Q9JHD9 


PUTATIVE SEVEN PASS 

TRANSMEMBRANE 
PROTEIN - Mus musculus 
(Mouse), 385 aa. 


I4..310 
18..339 


180/323 (55%) 
221/323 (67%) 


2e-93 


060478 


PUTATIVE SEVEN PASS 

TRANSMEMBRANE 
PROTEIN (SIMILAR TO 
TRANSMEMBRANE 7 
SUPERFAM1LY MEMBER 1) 
(UPREGULATED IN 
KIDNEY) - Homo sapiens 
(Human), 399 aa. 


14.310 

33..354 


177/323 (54%) 
220/323 (67%) 


4e-93 



PFam analysis predicts that the NOV47a protein contains the domains shown in 
the Table 47F. 



Table 47F. Domain Analysis of NOV47a 



NOV47a Match Region 



Identities/ 
Similarities 
for the Matched Region 



No Significant Matches Found 



Example 48. " ~ 

The NOV48 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 48A. 
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Table 48A. NOV48 Sequence Analysis 




SEQIDNO:149 | 834 bp 


NOV48a, 
CG96747-01 DNA Sequence 

1 


ATGGCCCGGCTGGGCGCGCTGCTCCTGGCCGCCGCCCTGGGTGCACTGCTCAGCTTCG 
CGCTCCTGGCCGCCGCGGTCGCCAGCGACTACTGGTACATCCTGGAGGTGGCGGACGC 
CGGCAATGGCAGCGCCTGGCCCGGGCGCGCAGAGCTGCTCTCCTCGCACTCGGGGCTC 
TGGCGCATCTGCGGAAGACCTCCTGACATGCACCTGTGTGCACGTGTGCACACGCGAG 
GCCTATGTTTCCCAGGGCAGAACGGCTGCATCCCGCTGGTCGACCCTTTTGCCAGTGA 
GAGCCTGGACGTCTCCACCTCGGTGCAGCACCTCATCTTGCTGCACCGTGCAGTCATT 

GCTCCCTGGCCCAGAGCGTGTCTCTGCTGCTTTTCACCGGCTGCTACTTCCTGCTGGG 
GGGTGTCCTGACACTGGCGGGGGTCAGCATCTACATCAGCTACTCGCACCTGGCCTTT 
GCGGAGACGGTGCAGCAGTATGGCCCGCAGCACATGCAGGGCGTCCGCGTCAGCTTCG 
GCTGGTCCATGGCCCTGGCCTGGGGCTCCTGTGCCTTGGAGGCATTCAGCGGAACCCT 
CCTGCTCTCAGCTGCCTGGACCCTCAGCCTGAGCCCCCCAATCTGTGGTCATCTGAGT 
CCCCAGCAGGGCAGGCTTGGTGGAGAAGAGGCTGATGAGAGGGCCCGAGAGCCCCTCC 
GATTTGCAGGGGTGGGGGGCAAGGAGCTGAGCGATCCAGATGTACCCCTCTGCCCCCT 
CCCTTGTTCTCAAGCCTGCTAG 




ORF Start: ATG at 1 j ORF Stop: TAG at 832 




SEQIDNO:150 J 277 aa | MW at 29226.6kD 


NOV48a, 
CG96747-01 Protein Sequence 


MARLGALLLAAALGALLSFALLAAAVASDYWYILEVADAGNGSAWPGRAELLSSHSGL 
WRICGRPPDMHLCARVHTRGLCFPGQNGCIPLVDPFASESLDVSTSVQHLILLHRAVI 
WLPLSLVLLVCGWICGLLSSLAQSVSLLLFTGCYFLLGGVLTLAGVSIYISYSHLAF 
AETVQQYGPQHMQGVRVSFGWSMALAWGSCALEAPSGTLLLSAAWTLSLSPPICGHLsi 
PQQGRLGGSEADERAREPLRFAGVGGKELSDPDVPLCPLPCSQAC 



Further analysis of the NOV48a protein yielded the following properties shown i 
Table 48B. 



Table 48B. Protein Sequence Properties NOV48a 


PSort analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0. 1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 29 and 30 



A search of the NOV48a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 48C. 
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Table 48C. Geneseq Results for NOV48a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV48a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


j AAM23670 


Human EST encoded protein SEQ 
ID NO: 1 195 - Homo sapiens, 57 
aa. [WO200154477-A2, 02-AUG- 
2001] 


186..241 
1..57 


51/57 (89%) 
53/57 (92%) 




j AAB73979 


Human stargazin-Iike protein 
CACNG8 - Homo sapiens, 327 
aa. [WO200121791-A2, 29- 
MAR-2001] 


7.. 161 


1 12. (ZOYO ) 

81/172(46%) 




,' AAY84372 


A human voltage-gated calcium 

channel designated 
CACNGLIKE3 - Homo sapiens, 
327 aa. [WO200014224-A1, 16- 
MAR-2000] 


5..168 
7..161 


49/172(28%) 
81/172(46%) 


2e-09 


AAY70462 


Human membrane channel 
protein-12 (MECHP-12) - Homo 
sapiens, 323 aa. [WO20001271 1- 
A2, 09-MAR-2000] 


5.. 168 
7..158 


42/169(24%) 
79/169(45%) 


3e-09 


ABB 11805 


Human voltage gated Ca channel 

subunit homologue, SEQ ID 
NO:2 1 75 - Homo sapiens, 325 aa. 
[WO2001 57 1 88-A2, 09-AUG- 
2001] 


5.. 168 
9..160 


42/169(24%) 
78/169(45%) 


7e-09 



In a BLAST search of public sequence datbases, the NOV48a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 48D. 
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Table 48D. Public BLASTP Results for NOV48a 


Accession 
Number 


Protein/Organ ism/ Length 


NOV48a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9D563 


493051 1J1 1RIK PROTEIN - Mus 
musculus (Mouse), 222 aa. 


3..224 
4..215 


98/231 (42%) 
133/231 (57%) 


2e-38 


Q9JJV4 


Voltage-dependent calcium channel 
gamma-4 subunit (Neuronal 
voltage- gated calcium channel 
gamma-4 subunit) - Mus musculus 
(Mouse), 327 aa. 


5..168 
7..161 


50/172 (29%) 
81/172 (47%) 


3e-09 


Q98UH4 


IPR328-LIKE PROTEIN - Gallus 
gallus (Chicken), 314 aa. 


1-168 
3..157 


49/171 (28%) 
78/171 (44%) 


4e-09 


Q8WXS5 


VOLTAGE-DEPENDENT 
CALCIUM CHANNEL GAMMA- 
8 SUBUNIT - Homo sapiens 
(Human), 426 aa. 


5..216 
18.. 226 


59/232 (25%) 
98/232 (41%) 


5e-09 


CAC36506 


SEQUENCE 1 FROM PATENT 
WOO 121791 - Homo sapiens 
(Human). 327 aa. 


5.. 168 
7.. 161 


49/172 (28%) 
81/172 (46%) 


6e-09 



PFam analysis predicts that the NOV48a protein contains the domains shown in 
the Table 48E. 



Table 48E. Domain Analysis of NOV48a 


Pfam Domain 


NOV48a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PMP22_Claudin 


4..214 


48/232 (21%) 
139/232 (60%) 


0.023 



Example 49. 



5 The NOV49 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 49A. 
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Table 49 A. NOV49 Sequence Analysis 




SEQ ID NO: 151 


537 bp 


NOV49a, 
CG96789-01 DN A Sequence 


CGCCTCCCGGCGCGGGTGGCCX3AGGCGTAGCGCTGCGACCCCCGCACCCCTCTGCGAA 
CATGGCGCTGCGAGTGGTGCGGAGCGTGCGGGCCCTGCTCTGCACCCTGCGCGCGGTC 
CCGTCACCCGCCGCGCCCTGCCCGCCGAGGCCCTGGCAGCTGGGGGTGGGCGCCGTCC 
GTACGCTGCGCACTGGACCCGCTCTGCTCTCGGTGCGTAAATTTACAGAGAAACACGA 
ATGGGTAACAACAGAAAATGGCATTGGAACAGTGGGAATCAGCAATTTTGCACAGGAA 
GCGTTGGGAGATGTTGTTTATTGTAGTCTCCCTGAAGTTGGGACAAAATTGAACAAAC 
AAGGTTGGCTGATCA^GATGACACTGAGTAACCCTTCAGAACTAGATGAACTTATGAG 
Tfi a a a AOr AT ATO AGAAAT ACAT AAAATCTATTGAGGAGTGAAAATGGAACTCCTAA 
ATAAACTAGTATGAAATAACGCAAGCCAGCAGAGTTGTCTTAAATTAGTGGTGGATAG 
AAGACTTAGAATAGC 




ORF Start: ATG at 60 


ORF Stop: TGA at 447 




SEQ ID NO: 152 


129 aa | MW at I4212.3kD 


NOV49a, 
iCG96789-01 Protein Sequence 


MALRWRSVRALLCTLRAVPSPAAPCPPRPWQLGVGAVRTLRTGPALLSVRKFTEKHE 
WVTTENGIGTVGISNFAQEALGDWYCSLPEVGTKLNKQGWLIKMTLSNPSELDELMS 
EEAYEKYIKSIEE 



Further analysis of the NOV49a protein yielded the following properties shown in 
Table 49B. 



f Table 49B. Protein Sequence Properties NOV49a 


PSort analysis: 


0.9200 probability located in mitochondrial matrix space; 0.6000 probability located 

in mitochondrial inner membrane; 0.6000 probability located in mitochondrial 
intermembrane space; 0.6000 probability located in mitochondrial outer membrane 


j SignalP analysis: 


Cleavage site between residues 1 9 and 20 



A search of the NOV49a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 49C. 
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Table 49C. Geneseq Results for NOV49a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV49a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABG 11725 


Novel human diagnostic protein 
#1 1716 - Homo sapiens, 185 aa. 
[WO200175067-A2, Il-OCT-2001] 


1..129 
10..185 


124/176 (70%) 
126/176 (71%) 


6e-61 


ABG 11725 


Novel human diagnostic protein 
#1 1716 - Homo sapiens, 185 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..129 
10.. 185 


124/176 (70%) 
126/176 (71%) 


6e-61 


ABG 18844 


Novel human diagnostic protein 
#1 8835 - Homo sapiens, 1 84 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..129 
9.. 184 


103/176 (58%) 
111/176 (62%) 


2e-46 


ABG 18844 


Novel human diagnostic protein 
#18835 -Homo sapiens, 184 aa. 
[WO200175067-A2, ll-OCT-2001] 


1..129 
9.. 184 


103/176 (58%) 
111/176 (62%) 


2e 46 


AAB58488 


Lung cancer associated polypeptide 

sequence SEQ ID 826 - Homo 
sapiens, 102 aa. [WO200055180-A2, 
21-SEP-2000] 


7..60 
1..54 


52/54 (96%) 
52/54 (96%) 


le-24 



In a BLAST search of public sequence datbases, the NOV49a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 49D. 
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Table 49D. Public BLASTP Results for NOV49a 


Protein 
Number 




NOV49a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P23434 


Glycine cleavage system H protein, 
mitochondrial precursor - Homo 
sapiens (Human), 173 aa. 


I ..129 
1..173 


128/173 (73%) 
128/173 (73%) 


5e-63 


Q9N121 


Glycine cleavage system H protein, 

mitochondrial precursor - 
Oryctolagus cuniculus (Rabbit), 173 
aa. 


1..129 
1..173 . 


114/173 (65%) 
119/173 (67%) 


Ie-54 


P20821 


Glycine cleavage system H protein, 
mitochondrial precursor - Bos taurus 
(Bovine), 173 aa. 


1..129 
1..173 


113/173 (65%) 
118/173 (67%) 


le-53 


Q96GY5 


GLYCINE CLEAVAGE SYSTEM 
PROTEIN H (AMINOMETHYL 
CARRIER) - Homo sapiens 
(Human), 98 aa. 


I. .97 
1..97 


96/97 (98%) 
96/97 (98%) 


2e-50 


Q9QYU8 


H PROTEIN - Rattus norvegicus 
(Rat), 169 aa. 


I..126 
1 ..168 


102/170 (60%) 
112/170 (65%) 


3e-46 



PFam analysis predicts that the NOV49a protein contains the domains shown in 
the Table 49E. 



Table 49E. Domain Analysis of NOV49a 


Pfam Domain 


NOV49a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


GCV_H 


51. .96 


25/51 (49%) 
46/51 (90%) 


2.3e-22 


GCV_H 


97..127 


14/35 (40%) 
31/35 (89%) 


2.4e-ll 



Example 50. 



The NOV50 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 50A. 
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Table 50A. NOV50 Sequence Analysis 




SEQ ID NO: 153 


440 bp 


NOV50a, 
CG97253-01 DNA Sequence 


GAGTGCCTTCGAGTGCGAGGCGAGGTGGCTCCTGACGCTAAGAGCTTCGTGCTGAACC 
TGGGCAAAGACAGCAACAACCTGTGCCTGCACTTCAACCCTCGCTTCAACGCCCACGG 
CGACGCCAACACCATCGTGTGCAACAGCAAGGACGGCGGGGCCTGGGGGACCGAGCAG 
CGGGAGGCTGTCTTTCCCTTCCAGCCTGGAAGTGTTGCAGAGGTGTGCATCACCTTCG 
ACCAGGCCAACCTGACGGTGACTTCAAGATCAAATGTGTGGCCTTTGACTGAAATCAG 
CCAGCCCATGGCCCCCAATAAAGGCAGCTGCCTCTGAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 




ORF Start: ATG at 14 


ORF Stop: TGA at 383 




SEQ ID NO: 154 


123 aa | MW at 13177.8kD 


NOV50a, 
CG97253-01 Protein Sequence 


MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCLHFNPRFNAHGDANT 
IVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVTSRSNVWPLTEISQPMA 
PNKGSCL 



Further analysis of the NOV50a protein yielded the following properties shown in 
Table 50B. 



Table 50B. Protein Sequence Properties NOV50a 


PSort analysis: 


0.6500 probability located in cytoplasm; 0.1 586 probability located in lysosome 
(lumen); 0.1000 probability located in mitochondrial matrix space; 0.0000 
probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV50a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 50C. 
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Table 50C. Geneseq Results for NOV50a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV50a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY80675 


Human galectin-1 fusion protein, SEQ ID 
NO:9 - Homo sapiens, 142 aa. 
[WO200006724-AI, 10-FEB-2000] 


1..99 
8.. 106 


99/99(100%) 
99/99(100%) 


le-54 


AAW55959 


Human HL-60 lectin - Homo sapiens, 
135 aa. [WO9808535-A2, 05-MAR- 
1998] 


1..99 
1..99 


99/99 (100%) 
99/99 (100%) 


le-54 


AAR08153 


Human lectin gene product - Homo 
sapiens, 135 aa. [JP02262597-A, 25- 
OCT-1990] 


1..99 
1..99 


99/99 (100%) 
99/99(100%) 


o le - 54 


AAR52745 


Amino acid sequence of both HL-60 and 
placenta lectins - Homo sapiens, 135 aa. 
[W0941 1497-A, 26-MAY-1994] 


1..99 
1..99 


98/99 (98%) 
99/99 (99%) 


3e-54 


AAP91964 


HL-60 and placenta lectins - Homo 
sapiens, 135 aa. [EP337799-A, I8-OCT- 
1989] 


1..99 
1..99 


98/99 (98%) 
99/99 (99%) 


3e-54 



In a BLAST search of public sequence datbases, the NOV50a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 50D. 
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Table 50D. Public BLASTP Results for NOV50a 


Accession 
Number 


Protein/Organism/Length 


NOV50a 
Residues/ 
Match 


Identities/ 
Similarities for 
the Matched 


Expect 
Value 


Q9DOX0 


18 DAYS EMBRYO CDNA, R1K.EN 
FULL-LENGTH ENRICHED 
LIBRARY, CLONE: 1 1 10058J24, FULL 
INSERT SEQUENCE - Mus musculus 
(Mouse), 1 52 aa. 


1..123 
1..149 


103/149 (69%) 
112/149 (75%) 


2e-54 


LNHUGB 


galectin 1 [validated] - human, 135 aa. 


1..99 
1..99 


99/99(100%) 
99/99(100%) 


3e-54 


E981607 


AMINO ACID SEQUENCE FOR GBP - 
vectors, 135 aa. 


I..99 
I..99 


99/99(100%) 
99/99(100%) 


3e-54 


CAA01896 


BETA GALACTOSIDE BINDING 
PROTEIN - synthetic construct, 135 aa. 


I..99 
1.99 


98/99 (98%) 
98/99 (98%) 


le-53 


P09382 


Galectin- 1 (Beta-galactoside-binding 
lectin L-14-I) (Lactose-binding lectin 1) 
(S-Lac lectin 1) (Galaptin) ( 14 kDa 
lectin) (HPL) (HBL) - Homo sapiens 
(Human), 134 aa. 


2..99 
I..98 


98/98 (100%) 
98/98 (100%) 


le-53 



PFam analysis predicts that the NOV50a protein contains the domains shown in 
the Table 50E. 



Table 50E. Domain Analysis of NOV50a 


Pfam Domain 


NOV50a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Gal-bindjectin 


3..I13 


65/139 (47%) 
100/139 (72%) 


4.6e-45 



Example 51. 



The NOV51 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 51 A. 



281 



WO 03/010327 



PCT/US02/14199 



Table 51A. NOV51 Sequence Analysis 



SEQ ID NO: 155 



NOV51a, 
CG97400-01 DNA Sequence 



GCAGGGCCCCTGAAGi 
TCCATCGCAAGGAGACGTCAGCCGGCAGCCGGTCTCACAGGCGTACCATACGGCTACA 
GAAACAGGGCGGTGACAGCGACAGAGAGCAAGCGGGAACTCCCTCCACCAGCCAGCGC 

AGGAGACGGCGCTCACCGGCTGCACCTGCGCCGTTGACGCCACCGGGGCCGGCAGACA 
GACCCGCGGCGCTGGCTGGTGGAGGGAGTTCCCGCTTGCTCTCTGTCGCTGTCACCGC 
CCTGTTTCTGTAGCCGTATGGTACGCCTGTGAGACCGGCTGCCGGGTGACGTCTCCTT 
GCGATGGAGCATATCCGGACGACCAAGGTCGAACAAGTAAAATTACTTGACCGATTCA 
GTACCAGCAACAAGTCATTAACAGGAACACTGTATCTTACGGCTACACATCTATTATT 
TATCGACTCTCATCAAAAAGAAACCTGGATATTACACCACCATATTGCCTCAGTAGAG 
AAACTTGCTTTGACTACTTCTGGATGCCCCCTTGTGATACAGTGCAAGAACTTCAGAA 
CTGTGCATTTCATTGTTCCCAGAGAAAGAGATTGCCATGATATTTACAACTCTTTGCT 
ACAACTGTCAAAACAAGCAAAATATGAAGATCTCTATGCATTTTCTTATAATCCCAAA 
CAAAATGATTCAGAACGACTACAAGGCTGGCAGCTCATTGATCTCGCTGAGGAATATA 
AGAGGATGGGAGTGCCAAACTCACACTGGCAGTTGTCTGATGCCAACCGGGACTACAA 
GATTTGTGAAACTTACCCCAGAGAACTTTATGTTCCCCGGATAGCAAGCAAACCAATA 
ATTGTTGGTAGTTCCAAGTTCCGGAGCAAGGGAAGATTCCCAGTTCTTTCCTACTATC 
ATCAAGATAAGGAGGCTGCCATTTGTCGATGTAGTCAGCCACTCTCTGGATTCAGTGC 
CAGGTGCCTGGAGGATGAACATTTGCTTCAAGCCATTAGTAAAGCCAATCCAGTCAAT 
CGCTATATGTACGTCATGGATACCAGGCCAAAACTGAATGCAATGGCCAACAGAGCAG 
CTGGAAAAGGTTATGAAAATGAAGACAACTATTCCAATATTAGATTTCAGT' 
AATTGAAAATATTCATGTCATGAGGTCCAGCCTTCAGAAATTATTGGAAGTCAATGGC 
ACTAAAGGGCTTTCTGTCAATGATTTCTACTCCGGTTTGGAGAGCTCGGGATGGCTTC 
GCCATATCAAAGCTGTTATGGATGCTGCAGTCTTCTTGGCCAAAGCAATAACAGTTGA 
ITAGGACTTCCC AG GT TT GT 
TCCCTGGGTTCTCTTTTATTGGATTCCTACTACAGGACAATCAAAGGATTCATGGTTT 
TAATAGAAAAGGATTGGATCTCTTTTGGACATAAATTTTCAGAGAGGTGTGGCCAGTT 
'CTTGGAATGTGTGTGG 
CATTTGACCGAACAGTTTCCACAAGCCTTTGAATTCAGTGAAGCATTTCTTCTTCAGA 
TCCATGAGCATATTCATTCATGCCAGTTTGGAAAC 
AAGAGAAGAGCTCAAGTTGAAGGAGAAGACTTATTCCCTGTGGCCATTTCTTTTGGAA 
GACCAAAAGAAGTACTTAAATCCTCTCTACAGTTCCGAATCTCACAGATTTACAGTTT 
TGGAGCCAAATACAGTATCTTTCAATTTTAAGTTTTGGAGGAACATGTACCATCAGTT 
TGATCGAACACTGCATCCTAGGCAGTCTGTATTTAATATAATTATGAATATGAATGAG 
CAAAATAAACAATTAGAGAAAGATATTAAAGACCTAGAATCTAAAATTAAACAACGCA 
AAAATAAGCAAACAGATGGCATCCTCACCAAGGAATTGTTACATTCAGTTCATCCTGA 
ATCACCTAACCTCAAAACTTCCCTGTGTTTTAAAGAGCAGACTCTGCTACCCGTAAAT 
GATGCTCTTCGAACTATAGAGGGCAGCAGCCCGGCAGATAATCGTTATAGTGAATATG 
CAGAAGAGTTTTCTAAATCAGAACCTGCTGTGGTCAGCTTAGAGTATGGTGTGGCAAG 
AATGACTTGTTAGACTCATAGAGTTTTTTCTGCAATGATTGCAGTACAAGAAAAGGAT 
TATTGTGAGGATGGTCTGTAAGCATAACCAAAAGGAATTTGTCTAATAACAATTTTAG 
GGTTTAACAGTAGGCTAATAGTTGAAGGAAGGATAATAACTACCCTTGTGAGAGAAAT 
ATGTCATTTTAATTGCATTTCCAGCAAGGAATGACATTCAGTTCTGTAAGAAATGAGT 
GGTATTTGATGTATTTACTCAAAACACAATTTGCACTGTACACTAGTGAATTGACGTT 
TATGATTTATGTTAATTCAGCCAAACATAAATAACCTTCCTTAAGTACAATTTAACTT 
CAAGAAAACAAAATTTGACAACATAGTTTCTTAATAAATGATATGGCATGTACTTTCA 
ATTATGTAGCTTTGTAACTATGAATATTTACATATTTTGCCTTTTAGTGATATTTAAT 
GTTAAAGTGCCATGAAAAATATTTCTAAGAAAGCCTTAAATTCCCAGTGGATTCTTTJ 
ICTTAAGTTTTACAGCCTACAACAAGAT 
•GTTTTGTTTGTAAAGAATTGTGCTCTC 
TTCTATATAAACACTTGTAGAAGTACACTGTTCACGTTTAGCCTGCCCCACTTTTGTA 
TTCAAAATTAATGAAACTGAAGGTTTATTCTGATCATAATTTGTTTAGTGCTACATTT 
GATAATTTATTATTTACAGCTTAGAATATTGATTTCTTGAATACGTATAAGCACATTT 
GACTGTCTTTTATATATGGATTACTGCATTCCATTGATTCTTATTTTGTGGTTGGCTT 
TATTTCTTTCACAACTGTGTAAGTTTAAAGAGCTAAAGCTCTAAAACTGTTCTC 
ACAATGAATAGTACATGTATATGTATATTTTTAAACTGCCTTATTGCTCAATG? 
CTGTTCCTGGAGTATCTAACTTTCAGTTTACTCAAAAACTTTTGGGTAAATAAAAAAG 
GGAAGTACAGATAGTTTAGGTTGTGCATGGTTGCATGAATTTTGAAGTCATTTCTATG 
TGGAGCATTATTTTTCTTCTTGTTAAATATAGAAAAAAAAAGAGATTCTAGTACAAAG 
TTACTGTTTAACAAAAGCAACATAAACTCTGGGAAAGATTTCATTTTGCCATGTTATA 
TTTACTGTTTATTCTGTGTACTAGTACATATCTTTAAATTACCAAAAAACAAGAAACA 
AAACATAAAAACCCCAAAACTATCACTTGGAATTAGCAATATCACCCAACTGGCTTTA 
AAATTGAAAATTTAAATAACATGGTGGCATGAAGTACAATTCGAGTATTAGGCAATTT 
GCATAGTGTTCCTCATGCTACTTTCTGTTACACCTCTATTATATTAGTTTTGAATATA 
AACATCTTTTTCAGACCAAAAAAAACTTTATTGTATGAGAGCTTATCTTATCCTGTTT 
ATTTTTCCAATGCTTTTCTGTAATACATTATGTAATTTAAAAAATATTCCTTTTTAAA 
CAGCAACAGAAATGCACTATAAAATATAGTATGTGATTAACCAATCCTGCTTCCATAT 
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TTAAGC^CTGGGAM'GGAAACrTAATCTCTGTGACTACAAAGGGAAGTTTTTGTGCCT 
TGGTGTTCCAGTCACTGATTGTGGTTTTAGAATCTTCTGTGGCTGACTTCTTGTATTC 
AACTCTGGTTTATTATCITrAATCTTaiACTTCAGACATACATTTGTGTGTTTATATGC 
TCACAGGTGGACTGAGAAATCAGTTACATCTTAAGTGACCTACAGGGTATATGTTGGC 
AAAAGCAGACTGTGTATATGTCTTATAAAGTTGAATTTATGTTCAGTGTGTTTGGAAG 
TGTATAGCATGTAAATTATTTCATATATGATTTAAAGGTAATTAAATGTTCACATTTT 
ACTTTGAATGTTTTTCTTCTGGATAAAACAAATGGAATCAAAAAAAAAAAAAAAAAftA 
AAATT 




ORF Start: ATG at 410 


ORF Stop: TAG at 2273 




SEQ ID NO: 156 


621 aa j MW at 71922.1kD 


NOV51a, 
CG97400-01 Protein Sequence 


MEHIRTTKVEQVKLLDRFSTSNKSLTGTLYLTATHLLFIDSHQKETWILHHHIASVEK 
LALTTSGCPLVIQCKNFRTVHFIVPRERDCHDIVNSLLQLSKQAKYEDLYAFSYNPKQ 
NDSERLQGWQLIDLASEYKRMGVPNSHWQLSDANRDYKICETYPRELYVPRIASKPII 
VGSSKFRSKGRFPVLSYYHQDKEAAICRCSQPLSGFSARCLEDEHLLQAISKANPVNR 
YMYWDTRPK1.NAMANRAAGKGYENEDNYSNIRFQFVGIENIHVMRSSLQKLLEVNGT 
KGLSVNDFYSGL3S SGWLRH I KAVMDAAVFLAKAITVENASVLVHCSDGWDRTSQVCS 
LGSLLLDSYYRTIKGFMVLIEKDWISFGHKFSERCGQLDGDPKEVSPVFTQFLECVWH 
LTEQFPQAFEFSEAFLLQIHEHIHSCQFGNFLGNCPKEREELKLKEKTYSLWPFLLED 
QKKYLNPLYSSESHRFTVLEPNTVSFNFKFWRNMYHQFDRTLHPRQSVFNIIMNMNEQ 
NKQLEKDIKDLESKIKQRKNKQTDGILTKELLHSVHPESPNLKTSLCFKEQTLLPVND 
ALRTIEGSSPADNRYSEYAEEFSKSEPAWSLEYGVARMTC 



Further analysis of the NOV51a protein yielded the following properties shown in 
Table 51B. 



Table 51B. Protein Sequence Properties NOV51a 


PSort analysis: 


0.3600 probability located in mitochondrial matrix space; 0.1723 probability 
located in microbody (peroxisome); 0.1000 probability located in lysosome 
(lumen); 0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV51a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 51 C. 
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Table 51C. Geneseq Results for NOV51a 


Geneseq 
Identifier 

. 


Protein/Organism/Length [Patent #, Date] 


NOV51a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB73229 


Human phosphatase MTMR7 h - Homo 
sapiens, 629 aa. [WO2001 I2819-A2, 22- 
FEB-2001] 


8..534 
82..613 


338/533 (63%) 
430/533 (80%) 


0.0 


ABB66818 


Drosophila melanogaster polypeptide 

SEQ ID NO 27246 - Drosophila 
melanogaster, 676 aa. [WO200171042- 
A2, 27-SEP-2001] 


I4..542 
1..525 


282/530 (53%) 
386/530 (72%) 


e-176 


ABB59964 


Drosophila melanogaster polypeptide 

SEQ ID NO 6684 - Drosophila 
melanogaster, 676 aa. [WO200171042- 
A2, 27-SEP-2001] 


14..542 
I..525 


282/530 (53%) 
386/530 (72%) 


e-176 


ABB65465 


Drosophila melanogaster polypeptide 

SEQ ID NO 23 1 87 - Drosophila 
melanogaster, 639 aa. [WO200171042- 
A2, 27-SEP-2001] 


52..542 
2..488 


260/491 (52%) 
355/491 (71%) 


e-163 


AAW36451 


Human tyrosine phosphatase related 
protein - Homo sapiens, 621 aa. 
[WO9735015-A2, 25-SEP-1997] 


22..510 
74..567 


197/501 (39%) 
297/501 (58%) 


4e-99 



In a BLAST search of public sequence datbases, the NOV51 a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 5 ID. 
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Table 51D. Public BLASTP Results for NOV51a 


Accession 
Number 


Protein/Organism/Length 


NOV5 1 a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96P80 


MYOTUBULARIN RELATED 
PROTEIN 6 - Homo sapiens 
(Human), 621 aa. 


1..621 
1..621 


620/621 (99%) 
620/621 (99%) 


0.0 


Q8VE11 


HYPOTHETICAL 70.9 KDA 
PROTEIN - Mus musculus 
(Mouse), 617 aa. 


L.621 
1.617 


558/621 (89%) 
588/621 (93%) 


0.0 


Q9Y217 


Myotubularin related protein 6 (EC 
3. 1.3.48) - Homo sapiens (Human), 
465 aa (fragment). 


157..621 
1..465 


464/465 (99%) 
464/465 (99%) 


0.0 


Q96EF0 

1 


SIMILAR TO HYPOTHETICAL 
PROTEIN FLJ20126 - Homo 
sapiens (Human), 704 aa. 


1..540 
1..541 


332/543 (61%) 
430/543 (79%) 


0.0 


Q9W1Q6 


BCDNA:GH04637 PROTEIN - 
Drosophila melanogaster (Fruit 
fly), 676 aa. 


I4..542 
I..525 


282/530 (53%) 
386/530 (72%) 


e-175 



PFam analysis predicts that the NOV5 la protein contains the domains shown in 
the Table 5 IE. 



Table 51E. Domain Analysis of NOV51a 



NOV5 la Match Region 



Identities/ 
Similarities 
for the Matched Region 



No Significant Matches Found 



Example 52. 

The NOV52 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 52A. 
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Table 52A. NOV52 Sequence Analysis 




SEQ ID NO: 157 


807 bp 


NOV52a, 
CG97462-01 DNA Sequence 


ATGGCTGCCAAAGTGTTT6AGTTCATCAGCAAGTTTGGCCTGGCCTTAGCTGTTGCAG 
GAGGCCTGAATGAACTCTGCCTTAATGTGGATGCTGGGCACAGAGCTGTCATCTTTGA 
CCTATTCCGTGGAGTACAGGACATTGTGGTAGGGGAAAGGACTCACTTTCTCATTCCA 
TGGGTACAGAAACCAATTATCTTTGACTGCCCTTCTCGACCACGTAATGTGCCAGCCA 
TCACTGGTAGCAAAGATTTACAGAATGTCAACATCACACTGCTCATCCTCTTCTGGCC 
TGTCACTAGCCAGTTTCCTTGCATCTTCACCAGCATCAGAGAGGACTATGATGAGCAG 
GTGCTGCCATCCGTCACGACCAAGATCCTCAAGTCCGTGGTGGCTAGCTTTGATGCTG 
GAGAACTAATCACCCAGAGAGAGCTGGTCTCCAGGGAGCAATTACTTACAGAGCGAGC 
AGCCACCTTTGGGCTCATCCTGGATGACGTGTCCTTGACACATCTGACCTTCGGGAAG 
GAGTTCACAGAAGCGGTGGAAGCCAAACAGGTAGCTCAGCAGGAAGCAAGGGCCAGAT 




CTCCAAGGCAGCTGAGCTGATTGCCAACTCACTGGCCACCGCAAGGGACCGCCTGATG 
GAGCTCTGCAAGCTGGAAGCTGCGGAGGACATCGCGTACCAGCTCTCACGCTCTCGGA 
ACATCACCTATCCGCCGGCTGGGCAGTCCGTGCTCCTCCAGCTGCCCCAGTGA 


, 


ORF Start: ATG at 1 


ORF Stop: TGA at 805 




SEQ ID NO: 158 


268 aa | MW at 29706.9kD 


NOV52a, 
CG97462-01 Protein Sequence 


MAAKVFEFISKFGliALAVAGGLNELCLNVDAGHRAVIFDLFRGVQDIWGERTHFLIP 
WVQKPIIFDCPSRPRNVPAITGSKDLQNVNITLLILFWPVTSQFPCIFTSIREDYDEQ 
VLPSVTTKILKSWASFDAGELITQRELVSREQLLTERAATFGLILDDVSLTHLTFGK 
EFTEAVEAKQVAQQEARARFWEKAEQQKKVAIISAEGYSKAAELIANSLATARDRLM 
ELCKLEAAEDIAYQLSRSRNITYPPAGQSVLLQLPQ 



Further analysis of the NOV52a protein yielded the following properties shown in 
Table 52B. 



Table 52B. Protein Sequence Properties NOV52a 


PSort analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.2632 probability 
located in lysosome (lumen); 0.1000 probability located in endoplasmic reticulum 
(lumen); 0.1000 probability located in outside 


S gnalP inab si 


Cleavage site between residues 22 and 23 



A search of the NOV52a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 52C. 
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Table 52C. Geneseq Results for NOV52a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV52a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


j AAG73845 


Human colon cancer antigen protein 
SEQ ID NO:4609 - Homo sapiens, 279 
aa. [WO200122920-A2, 05-APR- 
2001] 


1..268 
8..279 


237/272 (87%) 
243/272 (89%) 


e-120 


AAB43874 


Human cancer associated protein 
sequence SEQ ID NO: 1 3 1 9 - Homo 
sapiens, 279 aa. [WO200055350-A1, 
21-SEP-2000] 


1..268 
8..279 


237/272 (87%) 
243/272 (89%) 


e-120 


AAW54352 


Heat shock 27 kD protein and 
prohibitin (admixture) - Homo sapiens, 
471 aa. [WO9810291-A1, 12-MAR- 
1998] 


1..268 
200..471 


237/272 (87%) 
243/272 (89%) 


e-120 


'> AAR42215 


Human prohibitin - Homo sapiens, 272 
aa. [JP05271294-A, 19-OCT-1993] 


1..268 
1..272 


237/272 (87%) 
243/272 (89%) 


e-120 


r AAR 13466 


Prohibitin - Rattus rattus, 272 aa. 
[USN7612674-N, 16-JUL-1991] 


1..268 
1..272 


236/272 (86%) 
243/272 (88%) 


e-120 



In a BLAST search of public sequence datbases, the NOV52a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 52D. 



Table 52D. Public BLASTP Results for NOV52a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV52a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P35232 


Prohibitin - Homo sapiens 
(Human), 272 aa. 


1..268 
1..272 


237/272 (87%) 
243/272 (89%) 


e-120 


P24142 


Prohibitin (B-cell receptor 
associated protein 32) (BAP 32) - 
Mus musculus (Mouse), and, 272 


1..268 
1..272 


236/272 (86%) 
243/272 (88%) 


e-120 


Q9VIZ4 


LETHAL (2) 37CC PROTEIN - 
Drosophila melanogaster (Fruit 
fly), 276 aa. 


1..267 
1..271 


178/271 (65%) 
212/271 (77%) 


4e-91 


Q9BKU4 


HYPOTHETICAL 30.0 KDA 
PROTEIN - Caenorhabditis 
elegans, 275 aa. 


2..266 
5..273 


157/269 (58%) 
203/269 (75%) 


2e-78 


001720 


E^g£|ln-Like Molecule 


2.267 


155/270(57%; 


le-76 
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TC-PRO-1 - Toxocara canis, 274 


4..273 


201/270 (74%) 






aa. 









PFam analysis predicts that the NOV52a protein contains the domains shown in 
the Table 52E. 



Table 52E. Domain Analysis of NOV52a 


Pfam Domain 


NOV52a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Band_7 


12..203 


49/206 (24%) 
169/206(82%) 


4.6e-66 



Example 53. 

The NOV53 clone was analyzed, and the nucleotide and encoded polypeptide 
5 sequences are shown in Table 53A. 



Table 53A. NOV53 Sequence Analysis 




SEQ1DN0:159 j 1527 bp 


NOV53a. 
CG97472-01 DNA Sequence 


ATTGAGGCCATGGGGACAAAGAAGGTCACCCCATCTCTGATCTTCGCCATC\ :A7 TTG 
CTACAATCGGCTCTTTCCAATTTGGCTACAACACTGGGGTCATCAATGCTCCTGAGAT 
GATCATAAGGGAATTTATCAACACTTTGAAGGACAAGGCAAATACCCCTCCCTCTGAG 
ATGTTGTGCTTGTCCCTTTGGTGCTTGTCTATGGCCATATTCTCCATTGGTTGTATGA 
TTAGCTCCTTTTCTGTTGGACTGTTTGTCAACAGCTTTGACAGGCGTAATTCAATGCT 
TACTGTCAACCTGTTGGCTGCCACTGGTGGCTGCCTTATGGGACTGTGTAAAGTAGCT 




GCACAGGTTTTGTGCCCACGTACATCGGAGACATCTCACCTATTGGCCTGCAAGGTGT 
CTTTGGCACTCTCAACCAGCTGGGCATTGTTGTTGGAATTCTGGTGGCCCAGATCTTT 
GATCTGGAATTCATCCTGGGGTCTGAAGACCTATGGCCTGTGCTATTAGGCTTTCCCA 
TCTCTCCTGCTATGCTACAAAGTGCAGCCCTTCCTTTTTGCCCTGAAAGTCCCAGATT 
CTTGCTCATTAACAGAAAAGAAGAGGAGAATGCTAAGGAGATCCTCCAGTGGTTGTGG 
GGCACCCAGGATGTATCCCAAGACATCCAGGAGATGAAAGATGAGAGTGCAAGGATGG 

CATCATCATTTCCATCATGCTCCAGCTCTCTCAGCAGCTCTCTGTAATCAATACTGTG 
TTCTATTACTCAACAGGAATCTTTAAGGGTGCAGGTGTTCAAGAGCCCATCTGTGTCA 
CCATTGGTGTGGGTGTGGTTAATACTATCTTCACTATAGTTTCCCTATTTCTAGTGGA 
AAGGGCAGTAAGAAGGACTCTACATATGATAGGCCTTGGAGGGATGGCTTTTTGTTCC 
ATCCTCATGTCTGTTTGTTTGTTATTGAAGGATGAGTGTAATGGGATAAGCTTTGTCT 
GTATTGGGGCTATCTTGGTCTTTGTGGTCTTCTTTGAAATTGGGCCAGACCACATTCC 
CTGGTTTATTGTGGCTGAACTCTTCAGCCAGGGCCCTGGCCCAGCTGTGATGGCAGTG 
GCCGGCTGCTCCACCTGGACCTCCAACTTCCTAGTCAGATTGCTTTTCCCTTCTGCTG 
CTTACTATTTAGGAGCCTACGTTTTTATTATCTTCACCGACTTCCTCATTACCTTCTT 
GATCTTTACCTTCTTCAAAGTCCCTGAGATGTGTTACAGGACTTTCAAGGATATCACA 
AAGGCCTTTGAGGGGCAGGCACACGATGCAAATAGATCTGGAAGGGACTGTGTCATGG 
AGATGAACAGTATCCAGCCTGCTAAGGAGACCACCACCAATGTCTAAATCATGCCTCC 




ORF Start: ATG at 10 


ORF Stop: TAA at 1495 




SEQ ID NO: 160 


495 aa 1 MW at 54631. 8kD 


NOV53a, 
CG97472-01 Protein Sequence 


MGTKKVTPSLIFAITIATIGSFQFGYNTGVINAPEMIIREFINTLKDKANTPPSEMLC 
LSLWCLSMAIFSIGCMISSFSVGLFWSFDRRNSMLTVNLLAATGGCLMGLCKVAELV 
EILILAYLVIGLFCRLCTGFVPTYIGDISPIGLQGVFGTLNQLGIWGILVAQIFDLE 
FILGSEDLWPV1LGFPISPAMLQSAALPFCPESPRFLLINRKEEENAKEILQWLWGTQ 
DVSQDIQEMKDESARMAQEKQVTVLELFRVSSYRQPIIISIMLQLSQQLSVINTVFYY 
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STGIFKGAGVQEPICVTIGVGWNTIFTIVSLFLVERAVRRTLHMIGLGGMAFCSILM 
SVCLLLKDECNGISFVCIGAILVFWFFEIGPDHIPWFIVAELFSQGPGPAVMAVAGC 
STWTSNFLWLLFPSAAYYLGAYVFIIFTDFLITFLIFTFFKVPEMCYRTFKDITKAF 
EGQAHDflMRSGRDCVMEWNSIQPAKETTTMV 



Further analysis of the NOV53a protein yielded the following properties shown in 
Table 53B. 



Table 53B. Protein Sequence Properties NOV53a 


PSort analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi body; 
0. 1000 probability located in endoplasmic reticulum (lumen) 


SignalP analysis: 


Cleavage site between residues 22 and 23 



A search of the NOV53a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 53C. 



Table 53C. Geneseq Results for NOV53a 



Geneseq 
Identifier 


Protein/Organ ism/Length [Patent #, 
Date] 


NOV53a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY27289 


Glucose transporter protein GLUT3 - 
Homo sapiens, 494 aa. [US5942398-A, 
24-AUG-1999] 


1..491 
1.492 


375/492 (76%) 
420/492 (85%) 


0.0 


AAR11360 


Glucose Transporter Protein from CHO 
cells - Cricetulus sp, 492 aa. 
[W09 1 03554-A, 2 1 -MAR- 1 99 1 ] 


3..474 
5..478 


276/474 (58%) 
349/474 (73%) 


e-154 


ABB57244 


Mouse ischaemic condition related 
protein sequence SEQ ID NO:652 - 

Mus musculus, 492 aa. 
[WO200188188-A2, 22-NOV-2001] 


3..474 
5..478 


276/474 (58%) 
347/474 (72%) 


e-154 


AAW17835 


Human glucose transporter GLUT-1 - 
Homo sapiens, 492 aa. [ W097 15668- 
A2.01-MAY-1997] 


3..474 
S..478 


275/474 (58%) 
347/474 (73%) 


e-153 


AAB30522 


Amino acid sequence of a consensus 
GLUT polypeptide - Synthetic, 493 aa. 
[US6136547-A, 24-OCT-2000] 


6..487 
10..490 


272/482 (56%) 
341/482(70%) 


e-150 



In a BLAST search of public sequence datbases, the NOV53a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 53D. 
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Table 53D. Public BLASTP Results for NOV53a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV53a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 


PI 1 1 69 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Homo sapiens (Human), 496 aa. 


1 ..495 
1 ..496 


418/496(84%) 
444/496 (89%) 


0.0 


AAL89710 


GLUCOSE TRANSPORTER 14 
LONG FORM - Homo sapiens 
(Human), 520 aa. 


3..495 
27..520 


408/494 (82%) 
436/494 (87%) 


0.0 


AAL89709 


GLUCOSE TRANSPORTER 14 
SHORT FORM - Homo sapiens 
(Human), 497 aa. 


4..495 
5..497 


407/493 (82%) 
436/493 (87%) 


0.0 


P47842 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Canis familiaris (Dog), 495 aa. 


I..493 
1..494 


390/494 (78%) 
431/494 (86%) 


0.0 


P47843 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Ovis aries (Sheep), 494 aa. 


1..491 
1..492 


375/492 (76%) 
420/492 (85%) 


0.0 



PFam analysis predicts that the NOV53a protein contains the domains shown in 
the Table 53E. 



Table 53E. Domain Analysis of NOV53a 


Pfam Domain 


NOV53a Match Region 


Identities/ 
for the Matched Region 


Expect Value 


sugar_tr 


12..464 


173/489(35%) 
383/489 (78%) 


3.5e-143 



Example 54. 



5 The NOV54 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 54A. 
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Table 54A. NOV54 Sequence Analysis 




SEQIDNO: 161 


2049 bp 


NOV54a, 
CG97528-01 DNA Sequence 


TACACATTTTCCCTCATCTCTCTCACTGACACACACAAAACTTTTGGCCATCCXGGGG 

AATTCTGTGACTCTATCCTCTAATTGCAAAAGACTGGAAATTCTAATTCATTCCAACT 

CTTCCCTTGCAGACAAGAGAACAATGCCCTGGACATGGCTCCAGAGATCCACATGACA 

GGCCCAATGTGCCTCATTGAGAACACTAATGGGGAACTGGTGGCGAATCCAGAAGCTC 

TGAAAATCCTGTCTGCCATTACACAGCCTGTGGTGGTGGTGGCAATTGTGGGCCTCTA 

CCGCACAGGAAAATCCTACCTGATGAACAAGCTAGCTGGGAAGAATAAGGGCTTCTCT 

CTGGGCTCCACAGTGAAATCTCACACCAAAGGAATCTGGATGTGGTGTGTGCCTCACC 

CCAAAAAGCCAGAACACACCTTAGTCCTGCTTGACACTGAGGGCCTGGGAGATGTAAA 

GAAGGGTGACAACCAGAATGACTCCTGGATCTTCACCCTGGCCGTCCTCCTGAGCAGC 

ACTCTCGTGTACAATAGCATGGGAACCATCAACCAGCAGGCTATGGACCAACTGCAGT 

ATGTGACAGAGCTGACACATCGAATCCGATCAAAATCCTCACCTGATGAGAATGAGAA 

TGAGGATTCAGCTGACTTTGTGAGCTTCTTCCCAGATTTTGTGTGGACACTGAGAGAT 

TTCTCCCTGGACTTGGAAGCAGATGGACAACCCCTCACACCAGATGAGTACCTGGAGT 

ATTCCCTGAAGCTAACGCAAGGTACCAGTCAAAAAGATAAAAATTTTAATCTGCCCCG 

ACTCTGTATCCGGAAGTTCTTCCCAAAGAAAAAATGTTTTGTCTTCGATCTGCCCATT 

CACCGCAGGAAGCTTGCCCAGCTTGAGAAACTACAAGATGAAGAGCTGGACCCTGAAT 

TTGTGCAACAAGTAGCAGACTTCTGTTCCTACATCTTTAGCAATTCCAAAACTAAAAC 

TCTTTCAGGAGGCATCAAGGTCAATGGGCCTCGTCTAGAGAGCCTAGTGCTGACCTAT 

ATCAATGCTATCAGCAGAGGGGATCTGCCCTGCATGGAGAACGCAGTCCTGGCCTTGG 

CCCAGATAGAGAACTCAGCCGCAGTGCAAAAGGCTATTGCCCACTATGACCAGCAGAT 

GGGCCAGAAGGTGCAGCTGCCCGCAGAAACCCTCCAGGAGCTGCTGGACCTGCACAGG 

GTTAGTGAGAGGGAGGCCACTGAAGTCTATATGAAGAACTCTTTCAAGGATGTGGACC 

ATCTGTTTCAAAAGAAATTAAAGGCCCAGCTAGACAAAAAGCGGGATGACTTTTGTAA 

ACAGAATCAAGAAGCATCATCAGATCGTTGCTCAGCTTTACTTCAGGTCATTTTCAGT 

CCTCTAGAAGAAGAAGTGAAGGCGGGAATTTATTCGAAACCAGGGGGCT!\T" 3 T< 

TTATTCAGAAGCTACAAGACCTGGAGAAAAAGTACTATGAGGAACCAAGGAAGGGTCC 

TAAGGCTGAAGAGATTCTGCAGACATACTTGAAATCCAAGGAGTCTGTGA'JCGATGCA 

ATTCTACAGACAGACCAGATTCTCACAGAAAAGGAAAAGGAGATTGAAGGTGTGGAAT 

GTGTAAAAGCTGAATCTGCACAGGCTTCAGCAAAAATGGTGGAGGAAATGCAAATAAA 

GTATCAGCAGATGATGGAAGAGAAAGAGAAGAGTTATCAAGAACATGTGAAACAATTG 

ACTGAGAAGATGGAGAGGGAGAGGGCCCAGTTGCTGGAAGAGCAAGAGAAGACCCTCA 

CTAGTAAACTTCAGGAACAGGCCCGAGTACTAAAGGAGAGATGCCAAGGTGAAAGTAC 

CCAACTTCAAAATGAGATACAAAAGCTACAGAAGACCCTGAAAAAAAAAACCAAGAGA 

TATATGTCGCATAAGCTAAAGATCTAAACAACAGAGCTTTTCTGTCATCCTAACCCAA 

GGCATAACTGAAACAATTTTAGAATTTGGAACAAGTGTCACTATATTTGATAATAATT 

AGATCTTGCATCATAACAC 




ORF Start: ATG at 151 


ORF Stop: TA A at 1939 




SEQIDNO: 162 


596 aa 1 MW at 68176.6RD 


NOV54a, 
CG97528-01 Protein Sequence 


MAPEIHMTGPMCI.IENTNGELVANPEALKILSAITQPVWVAIVGLYRTGKSYLMNKL 
AGKNKGFSLGSTVKSHTKGIWMWCVPHPKKPEHTLVLLDTEGLGDVKKGDNQNDSWIF 
TLAVLLSSTLVYNSMGTINQQAMDQLQYVTELTHRIRSKSSPDENENEDSADFVSFFP 
DFVWTLRDFSLDLEADGQPLTPDEYLEYSLKLTQGTSQKDKNFNLPRLCIRKFFPKKK 
CFVFDLPIHRRKLAQLEKLQDEELDPEFVQQVADFCSYIFSNSKTKTLSGGIKVNGPR 
LESLVLTYINAISRGDLPCMENA-^jALAQIENSAAVQKAIAHYDQQMGQKVQLPAETL 
QELLDLHRVSEREATSVYMKNSFKDVDHLFQKKLKAQLDKKRDDFCKQNQEASSDRCS 
ALLQVIFSPLSEEVKAGIYSKPGGYCLFIQKLQDLEKKYi-EEPRKGPKAEEILQTYLK 
SKESVTDAILQTDQILTEKEKEISGVECVKAESAQASAKMVEEMQIKYQQMMEEKEKS 
YQEHVKQLTEKNERERAQLLEEQEKTLTSKLQEQARVLKERCOGESTQLQNEIQKLQK 
TLKKKTKRYMSHKLKI 
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Further analysis of the NOV54a protein yielded the following properties shown in 
Table 54B. 



Table 54B. Protein Sequence Properties NOV54a 


PSort analysis: 


0.8000 probability located in nucleus; 0.6000 probability located in endoplasmic 
reticulum (membrane); 0.3000 probability located in microbody (peroxisome); 
0. 1 000 probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV54a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 54C. 



Table 54C. Geneseq Results for NOV54a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV54a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM41334 


Human polypeptide SEQ ID NO 6265 - 
Homo sapiens, 605 aa. [WO200153312- 
A1,26-JUL-2001] 


I. .596 

II. .605 


591/596 (99%) 
592/596 (99%) 


0.0 


AAM41333 


Human polypeptide SEQ ID NO 6264 - 
Homo sapiens, 605 aa. [WO200153312- 
A1.26-JUL-2001] 


I. .596 

II. .605 


591/596 (99%) 
592/596 (99%) 


0.0 


ABB12327 


Human guanylate binding protein 
homologue, SEQ ID NO:2697 - Homo 
sapiens, 605 aa. [WO200157I88-A2, 09- 
AUG-2001] 


I. .596 

II. .605 


591/596(99%) 
592/596 (99%) 


0.0 


AAU25444 


Human mddt protein from clone 
LG:444850.9:2000MAY19 - Homo 
sapiens, 570 aa. [WO200 1 62922-A2, 30- 
AUG-2001] 


I. .554 

II. .563 


547/554 (98%) 
549/554 (98%) 


0.0 


AAM39547 


Human polypeptide SEQ ID NO 2692 - 
Homo sapiens, 592 aa. [WO200153312- 
A1.26-JUL-2001] 


1..585 
1..586 


514/587 (87%) 
550/587 (93%) 


0.0 



In a BLAST search of public sequence datbases, the NOV54a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 54D. 
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Table 54D. Public BLASTP Results for NOV54a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV54a 
Residues/ 
Match Residues 


Similarities 
for the 
Matched 
Portion 


Expect 
Value 


P32455 


Interferon-induced guanylate-binding 
protein 1) - Homo sapiens (Human), 592 


1..585 
1 ..586 


508/587 

547/587 
(92%) 


0.0 


P32456 


Interferon-induced guanylate-binding 
protein 2 (Guanine nucleotide- binding 

aa. 


1..583 
1..582 


441/583 
(75%) 

(86%) 


0.0 


Q9NV33 


CDNA FLJ 10961 FIS, CLONE 
PLACE1000588, HIGHLY SIMILAR 
TO INTERFERON-INDUCED 
GUANYLATE-BINDING PROTEIN 1 - 
Homo sapiens (Human), 447 aa 
(fragment). 


144..582 
10..447 


434/439 
(98%) 

435/439 
(98%) 


0.0 


Q01514 


Interferon-induced guanylate-binding 
protein 1 (Guanine nucleotide- binding 
protein 1) (Interferon-gamma inducible 
protein MAG-1) - Mus musculus 

(Mouse), 589 aa. 


1 ..579 
1..578 


392/579 
(67%) 

481/579 
(82%) 


0.0 


Q96PP8 


GUANYLATE BINDING PROTEIN 5 - 
Homo sapiens (Human), 586 aa. 


1..582 
1..577 


394/582 
(67%) 

469/582 
(79%) 


0.0 



PFam analysis predicts that the NOV54a protein contains the domains shown in 
the Table 54E. 



Table 54E. Domain Analysis of NOV54a 


Pfam Domain 


NOV54a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


GBP 


6..280 


196/284 (69%) 
256/284 (90%) 


1.6e-192 


GBP_C 


282..579 


169/305(55%) 
255/305 (84%) 


2.5e-144 
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Example 55. 

The NOV55 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 55A. 



Table 55A. NOV55 Sequt 


nee Analysis 




SEQ ID NO: 163 


1184 bp 


NOV55a, 
CG97629-01 DNA Sequence 


TATCTAGAGATGCTTTAAATTCCTGAGGGGTTGGAGGACTTAGCTTACGTGCTAAGAT 

ATGCAAAACTGGACTTCTTCGGACAGGGACATTTTGCTACAGCCTATAAGGCCAGAGA 

TAAGAACACCAACCAAATCGTCACCATTAAGAAAATCAAACTTGGACACAGGTCAGAA 

GCTAAAGGTGGTATAAATAGAACAGCCTTAAGGGAGATAAAGCTCTTACAGAAACTAA 

GTCATCCGAACACAATTGGTCTCCTTGACACATTTGGACATAAGTCTAACATTAGCCT 

TGTCTTGGATGTTATGGAAACTGACCTAGAGGTTATAATAAAGGATAACAGTCTTGTG 

CTGACACCATCCCACATTCAAGCCTACATGTTGATGGCTTTTCAGGGCTTAGAATATT 

TGCATCAGCATTGGATTCTACACAGGGATCTGAAACCAAAGGACTTCTTGTTAGATAA 

GAATGGAGTTCTGAAACTGGCAGACTTTGTCCTGGTCAAATCATTTGGGAGCCCCAGC 

AGGACTTACACACATCAAGTTGTGACCAGATGGTATTGGGCCCCTGTGGAGTTATTGT 

TTGGAACTAGGATGTATGATGTGGGAATGGACATGTGGGCTGTTTGTTGTTTATTAGC 

AGAATTGCTTCTAAGGGTTCCTTTTTTGCCTGGAGAGTCAGACCTTGATCAGCTAACA 

AGAATATTTGAAACTTTGGGTACACCAACTGAAGAACAGTGGCCTGACATGTGTAGCC 

TTCCAGATTCTGTGACATTTAAGAGTTTCCCTGGGGTCCCACTACAACACATCTTCAC 

TGCAGCTGGGGACGACTTGCTGGATCTCATCCAAGGCTTGTTCTTCTTTAACCCGTGT 

ACTAGGATCACAGCCTCACAGGCATTGAAGACCAAGGACTTCAGAAACTGTCCAGGGC 

CAACACCTGGATGCCAGCTTCCAAGACCAAACTGTTCAGTGGAAGCATTAAAGGAACC 

GTCCAATCCAACCATGGCAACAAAACAGAAAAAGACAGAGGCCTTAGAACW, 

TTGCCCAAGAAGCTATGTTTTTAGTTCCAGTGAACAATGGACAGTTCCACTGCTGAAA 

GAAATGATCCAAAGGCAAATAATA 




ORF Start: ATG at 83 


ORF Stop: TAG at 1124 




SEQ ID NO: 164 


347 aa j MW at 39095.0kD 


NOV55a, 
CG97629-01 Protein Sequence 


MAGDVKSQANRYAKLDFFGQGHFATAYKARDKNTNQIVTIKKIKLGHRSEAKGGINRT 
ALREIKLLQKLSHPNTIGLLDTFGHKSNISLVLDVMETDLEVIIKDNSLVLTPSHIQA 
YMLMAFQGLEYLHQHWILHRDLKPKDFLLDKNGVLKLADFVLVKSFGSPSRTYTHQW 
TRWYWAPVELLFGTRMYDVGMDMWAVCCLLAELLLRVP FLPGESDLDQLTRI FETLGT 
PTEEQWPDMCSLPDSVTFKSFPGVPLQHIFTAAGDDLLDLIQGLFFFNPCTRITASQA 
LKTKDFRNCPGPTPGCQLPRPNCSVEALKEPSNPTMATKQKKTEALEQGILPKKLCF 



Further analysis of the NOV55a protein yielded the following properties shown in 
Table 55B. 



Table 55B. Protein Sequence Properties NOV55a 


PSort analysis: 


0.7000 probability located in plasma membrane; 0.5582 probability located in 
microbody (peroxisome); 0.2000 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV55a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 55C. 
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Table 55C. Geneseq Results for NOV55a 


Geneseq 


Protein/Organism/Length [Patent #, 


NOV55a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB85505 


Human protein kinase SGK149 - 
Homo sapiens, 332 aa. 
[WO200155356-A2, 02-AUG-2001] 


23.347 
11. .332 


268/325 (82%) 
285/325 (87%) 


e-153 


AAB53379 


Human colon cancer antigen protein 
sequence SEQ ID NO:9 19 - Homo 
sapiens, 260 aa. [WO200055351-A1, 
21-SEP-2000] 


94..347 
8..260 


214/254 (84%) 
227/254 (89%) 


e-124 


AAG75087 


Human colon cancer antigen protein 
SEQ ID NO:585 1 - Homo sapiens, 260 
aa. [WO200122920-A2, 05-APR-2001] 


94..S47 
8..260 


214/254 (84%) 
227/254 (89%) 


e-124 


ABB59809 


Drosophila melanogaster polypeptide 
SEQ ID NO 6219- Drosophila 
melanogaster, 353 aa. [WO200 171042- 
A2, 27-SEP-2001] 


1..339 
1 ..342 


201/344 (58%) 
252/344 (72%) 


e-113 


A AG 14009 


Arabidopsis thaliana protein fragment 

SEQ ID NO: 13712 - Arabidopsis 
thaliana, 348 aa. [EP1033405-A2, 06- 
SEP-2000] 


10..319 
1I..316 


151/310 (48%) 
209/310 (66%) 


2e-85 



In a BLAST search of public sequence datbases, the NOV55a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 55D. 
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Table 55D. Public BLASTP 


Results for NOV55a 




Protein 
Accession 
Number 


Protein/Organism/Length 


NOV55a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q03147 


Cell division protein kinase 7 (EC 
2.7.1 .-) (CDK-activating kinase) (CAK) 
(TFIIH basal transcription factor 
complex kinase subunit) (39 kDa 
protein kinase) (P39 Mo 15) (Protein- 
tyrosine kinase MPK-7) (CR4 protein 
kinase) (CRK4) - Mus musculus 
(Mouse), 346 aa. 


1..347 
1..346 


298/347 (85%) 
313/347 (89%) 


e-173 




Cell division protein kinase 7 (EC 
2.7.1.-) (CDK-activating kinase) (CAK) 
(TFIIH basal transcription factor 
complex kinase subunit) (39 kDa 
protein kinase) (P39 Mo 15) (STK1) 
(CAK1) - Homo sapiens (Human), 346 
aa. 


1..347 
1.346 


292/347 (84%) 
308/347 (88%) 


e-169 


S34652 


cell division cycle-2-related protein 
kinase (EC 2.7. 1 .-) CRK4 - mouse, 346 
aa. 


1 ..347 
1..346 


292/347 (84%) 
309/347 (88%) 


e-169 


P51952 


Cell division protein kinase 7 (EC 
2.7.1.-) (CDK-activating kinase) (CAK) 
(TFIIH basal transcription factor 
complex kinase subunit) (39 protein 
kinase) (P39 Mol5) - Rattus norvegicus 
(Rat), 329 aa (fragment). 


9..338 
1..329 


281/330 (85%) 
295/330 (89%) 


e-163 


S51085 


CdK-activating kinase Cdk7 - rat, 312 
aa (fragment). 


9..320 
1-311 


269/312 (86%) 
281/312 (89%) 


e-156 



PFam analysis predicts that the NOV55a protein contains the domains shown in 
the Table 55E. 



Table 55E. Domain Analysis of NOV55a 


Pfam Domain 


NOV55a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


pkinase 


12..296 


99/301 (33%) 
217/301 (72%) 


1.4e-61 
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Example 56. 

The NOV56 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 56A. 



Table 56A. NOV56 Sequence Analysis 




SEQ ID NO: 165 


j 1821 bp 


NOV56a, 
CG97648-01 DNA Sequence 


CGGGAAGGGAAAGCAGCCAGCAGCCCTCCAGCCCTCTTGTGCTTTCCCTGGGAGTGCG 
CCCCGTGCTCAGCCATGGTGGACATGGGGGCCCTGGACAACCTGATCG(-CA»<~A<-rGr 
CTACCTGCAGGCCCGGAAGCCCTCGGACTGCGACAGCAAAGAGCTGCAGCGGCGGCGG 
CGTAGCCTGGCCCTGCCCGGGCTGCAGGGCTGCGCGGAGCTCCGCCAGAAGCTGTCCC 
TGAACTTCCACAGCCTGTGTGAGCAGCAGCCCATCGGTCGCCGCCTCTTCCGTGACTT 

TGGGAGCTGGCCGAGGAGGGACCCACCAAAGACAGCGCGCTGCAGGGGCTGGTGGCCA 

CACCAAGTGCCAAGCAGCCACCACTGAGGAAGAGCGAGTGGCTGCAGTGACGCTGGCC 
AAGGCTGAGGCCATGGCTTTCTTGCAAGAGCAGCCCTTTAAGGATTTCGTGACCAGCG 
CCTTCTACGACAAGTTTCTGCAGTGGAAACTCTTCGAGATGCAACCAGTGTCAGACAA 
GTACTTCACTGAGTTCAGAGTGCTGGGGAAAGGTGGTTTTGGGGAGGTATGTGCCGTC 
CAGGTGAAAAACACTGGGAAGATGTATGCCTGTAAGAAACTGGACAAGAAGCGGCTGA 


1 

! 

! ■ 

t 

I 


GTCATGAGCCTGATGAATGGGGGAGACCTCAAGTTCCACATCTACAACGTGGGCACGC 
GTGGCCTGGACATGAGCCGGGTGATCTTTTACTCGGCCCAGATAGCCTGTGGGATGCT 
GCACCTCCATGAACTCGGCATCGTCTATCGGGACATGAAGCCTGAGAATGTGCTTCTG 
GATGACCTCGGCAACTGCAGGTTATCTGACCTGGGGCTGGCCGTGGAGATGAAGGGTG 
GCAAGCCCATCACCCAGAGGGCTGGAACCAATGGTTACATGGCTCCTGAGATCCTAAT 
GGAAAAGGTAAGTTATTCCTATCCTGTGGACTGGTTTGCCATGGGATGCAGCATTTAT 
GPAATGGTTGCTGGACGAACACCATTCAAAGATTACAAGGAAAAGGTCAGTAAAGAGG 
ATCTGAAGCAAAGAACTCTGCAAGACGAGGTCAAATTCCAGCATGATAACTTCACAGA 
GGAAGCAAAAGATATTTGCAGGCTCTTCTTGGCTAAGAAACCAGAGCAACGCTTAGGA 
AGCAGAGAAAAGTCTGATGATCCCAGGAAACATCATTTCTTTAAAACGATCAACTTTC 
CTCGCCTGGAAC-CTGGCCTAATTGAACCCCCATTTGTGCCAGACCCTTCAGTGGTTTA 
TGCCAAAGACATCGCTGAAATTGATGATTTCTCTGAGGTTCGGGGGGTGGAATTTGAT 
GACAAAGATAAGCAGTTCTTCAAAAACTTTGCGACAGGTGCTGTTCCTATAGCATGGC 
AGGAAGAAATTATAGAAACGGGACTGTTTGAGGAACTGAATGACCCCAACAGACCTAC 
GGGTTGTGAGGAGGGTAATTCATCCAAGTCTGGCGTGTGTTTGTTATTGTAAATTGCT 
CTCTTTACCAGACAGGCAGCAGGAGTCTCGGCTGACATAATCCTCGAATGTTCCACAC 
GTGGAAATCTGTGGAATGAGGGC 




ORF Start: ATG at 73 


ORF Stop: TAA at 1732 




SEQ ID NO: 166 


553 aa J MW at 62210.8kD 


NOV56a, 
CG97648-01 Protein Sequence 


MVDMGALDNLIANTAYLQARKPSDCDSKELQRRRRELALPGLQGCAELRQKLSLNFHS 
LCEQQPIGRRLFRDFLATVPTFRKAATFLEDVQNWELAEEGPTKDSALQGLVATCASA 
PAPGNPQPFLSQAVATKCQAATTEEERVAAVTLAKAEAMAFLQEQPFKDFVTSAFYDK 
FLQWKLFEMQPVSDKYFTEFRVLGKGGFGEVCAVQVKNTGKMYACKKLDKKRLKKKGG 
EKMALL EKEILEKVSS PFIVSLAYAFESKTHLCLVMSLMNGGDLKFHIYNVGTRGLDM 
SRVIFYSAQIACGMLHLHELGIVYRDMKPENVLLDDLGNCRLSDLGLAVEMKGGKPIT 
QRAGTNGYMAPEILMEKVSYSYPVDWFAMGCSIYEMVAGRTPFKDYKEKVSKEDLKQR 
TLQDEVKFQHDNFTSEAKDICRLFLAKKPEQRliGSREKSDDPRKHHFFKTINFPRLEA 
GLIEPPFVPDPSWYAKDIAEIDDFSEVRGVEFDDKDKQFFKNFATGAVPIAWQEEII 
ETGLFEELNDPNRPTGCEEGNSSKSGVCLLL 



Further analysis of the NOV56a protein yielded the following properties shown 
Table 56B. 
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Table 56B. Protein Sequence Properties NOV56a 


PSort analysis: 


0.9685 probability located in nucleus; 0.1 101 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV56a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 56C. 



Table 56C. Geneseq Results for NOV56a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV56a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAE 16273 


Human kinase PKIN-19 protein - 
Homo sapiens, 553 aa. 
[WO2001 96547- A2, 20-DF.C- 
2001] 


1..553 
1..553 


553/553 (100%) 
553/553 (100%) 


0.0 


AAG77815 


Human G-protein coupled receptor 
kinase 1 protein - Homo sapiens, 
553 aa. [WO200168869-A2, 20- 
SEP-2001] 


1..553 
1..553 


553/553 (100%) 
553/553 (100%) 


0.0 


AAU03502 


Human protein kinase #2 - Homo 
sapiens, 553 aa. [WO200138503- 
A2.31-MAY-2001] 


1..553 
1..553 


551/553 (99%) 
551/553 (99%) 


0.0 


AAG77816 


Human G-protein coupled receptor 
kinase 2 protein - Homo sapiens, 
353 aa. [WO200168869-A2, 20- 
SEP-2001] 


1..350 
1..350 


350/350 (100%) 
350/350 (100%) 


0.0 


AAY24423 


GRK4 polymorphism GRK4-alpha 
protein sequence - Homo sapiens, 
578 aa. [W09935279-A1, 15-JUL- 
1999] 


7..531 

3..526 


253/528 (47%) 
348/528 (64%) 


e-140 



In a BLAST search of public sequence datbases, the NOV56a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 56D. 
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Table 56D. Public BLASTP Results for NOV56a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV56a Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q8WTQ7 


G-PROTEIN-COUPLED 
RECEPTOR KINASE 7 (G 
PROTEIN-COUPLED 
RECEPTOR KINASE 7) - Homo 
sapiens (Human), 553 aa. 


1..553 
1..553 


553/553 (100%) 
553/553 (100%) 


0.0 


Q8WMV0 


RETINA G PROTEIN- 
COUPLED RECEPTOR 
KINASE 7 - Bos taurus (Bovine) 
552 aa. 


1..553 
1..552 


475/553 (85%) 
508/553 (90%) 


0.0 


Q8WP15 


G PROTEIN-COUPLED 
RECEPTOR KINASE 7 - Sus 
scrofa(Pig), 553 aa. 


1..553 
1..553 


470/553 (84%) 
509/553 (91%) 


0.0 


Q9Z2G7 


G PROTEIN-COUPLED 
RECEPTOR KINASE GRK7 - 
Spermophilus tridecemlineatus 
(Thirteen-lined ground squirrel), 
548 aa. 


2..553 
1..548 


468/552 (84%) 
507/552 (91%) 


0.0 


073658 


OLGRK-C - Oryzias latipes 
(Medaka fish), 557 aa. 


1..553 
1..557 


330/563 (58%) 
416/563 (73%) 


0.0 



PFam analysis predicts that the NOV56a protein contains the domains shown in 
the Table 56E. 



Table 56E. Domain Analysis of NOV56a 


Pfam Domain 


NOV56a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RGS 


55..78 


13/24 (54%) 
22/24 (92%) 


5.6e-07 


pkinase 


191. .454 


94/294 (32%) 
202/294 (69%) 


5.2e-70 


pkinase_C 


455. .471 


8/17(47%) 
13/17 (76%) 


0.3 



Example 57. 



The NOV 57 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 57A. 
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Table 57A. NOV57 Sequence Analysis 




SEQIDNO:167 | 1722 bp 


NOV57a, 
CG97658-01 DNA Sequence 


GC&TGGTGGCACCATCTACCACAGGTCTCCCCCAGCCGATAGTGATGGAGGCACTGGA 
CGAGGCTGAAGGGCTCCAGGACTCACAGAGAGAGATGCCGCCACCCCCTCCTCCCTCG 
CCGCCCTCAGATCCAGCTCAGAAGCCACCACCTCGAGGCGCTGGGAGCCACTCCCTCA 




GCTCTGC-TTCAGCGGTTATGGCCACATGTGGTCACAGAACGCCACAAACCTCGTCTCC 
TCTTTGCTGACGCTCCTGAAACAGCTGGAACCCACGTCCTGGCTTGACTCTGGGACGT 




CCTGGTGTGGCACCTCCTGAGGACACCCCCAGAGCCACCCACCCCACTGCCCCCTGAG 
GACAGGCGCCAGTCAGTGAGCCGCCAGCCCTCCTTCACCTACTCAGAGTGGATGGAGG 

TTGTGTGATGGACATCAAGCCTGAGGCTGACCCCACCTCACTCACCGTCAAGTCCATG 
GGTCTGCAGGAGAGGAGGGGTTCCAATGTCTCCCTGACCCTGGACATGTGCACTCCGG 
GCTGCAACGAGGAGGGCTTTGGCTATCTCATGTCCCCACGTGAGGAGTCCGCCCGCGA 
GTACCTGCTCAGCGCCTCCCGTGTCCTCCAAGCAGAAGAGCTTCATGAAAAGGCCCTG 

AAGAGTACGACATCCCTGGGAGGTGTCGGAAGAACCGGTACAAAACCATACTTCCCAA 
CCCTCACAGCAGAGTGTGTCTGACCTCACCAGACCCTGACGACCCTCTGAGTTCCTAC 




AGGGACCCATCGTCAGCACGGTCGCCGACTTCTGGCGCATGGTGTGGCAGGAGCACAC 
GCCCATCATTGTCATGATCACCAACATCGAGGAGATGAACGAGAAATGCACCGAGTAT 
TGGCCGGAGGAGCAGGTGGCGTACGACGGTGTTGAGATCACTGTGCAGAAAGTCATTC 
ACACGGAGGATTACCGGCTGCGACTCATCTCCCTCAAGAGTGGGACTGAGGAGCGAGG 
CCTGAAGCATTACTGGTTCACATCCTGGCCCGACCAGAAGACCCCAGACCGGGCCCCC 
CCACTCCTGCACCTGGTGCGGGAGGTGGAGGAGGCAGCCCAGCAGGAGGGTCCCCACT 
GTGCCCCCATCATCGTCCACTGCAGTGCAGGGATTGGGAGGACCGGCTGCTTCATTGC 
CACCAGCATCTGCTGCCAGCAGCTGCGGCAGGAGGGTGTAGTGGACATCCTGAAGACC 
ACGTGCCAGCTCCGTCAGGACAGGGGCGGCATGATCCAGCACTGCGAGCAGTACCAGT 
TTGTGCACCACGTCATGAGCCTCTACGAAAAGCAGCTGTCCCACCAGTCCCCAGAATG 
ACTGCGCTTCTCCTACAAGGTTCTCTGGGCACTGCCAGCCTGAGTCTCGGCCCTCACC 
CAGGGCCCTGCCTCGGGTCGGAATTCGATATCAAGCTTAT 




ORF Start: ATG at 3 


URFStop: TGAat 1623 




SEQIDNO: 168 


540 aa | MW at 60799.6kD 


NOV57a, 
CG97658-01 Protein Sequence 


MVAPSTTGLPQPIVMEALDEAEGLQDSQREMPPPPPPSPPSDPAQKPPPRGAGSHSLT 
VRSSLCLFAASQFIjLACGVLWFSGYGHMWSQKATNLVSSLLTLLKQLEPTSWLDSGTW 
GVPGLLLVFLSVGLVLVTTLVWHLLRTPPEPPTPLPPEDRRQSVSRQPSFTYSEWMEE 
KIEDDFLDLDPVPETPVFDCVMDIKPEADPTSLTVKSMGLQERRGSNVSLTLDMCTPG 
CNEEGFGYLMSPREESAREYLLSASRVLQAEELHEKALDPFLLQAEFFEIPMNFVVPK 
EYDIPGRCRKNRYKTILPNPHSRVCLTSPDPDDPLSSYINANYIRGYGGEEKVYIATQ 
G P IVSTVADFIVRMWQEHTPI IVMI TNI EEMNEKCTEYWPEEQVAYDGVEITVQKVIH 
TEDYRLRLISLKSGTEERGLKHYWFTSWPDQKTPDRAPPLLHLVREVEEAAQQEGPHC 
APIIVHCSAGIGRTGCFIATSICCQQLRQEGWDILKTTCQLRQDRGGMIQHCEQYQF 
VHHVMSLYEKQLSHQSPE 



Further analysis of the NOV57a protein yielded the following properties shown in 
Table 57B. 



Table 57B. Protein Sequence Properties NOV57a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.1000 
probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV57a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 57C. 



Table 57C. Geneseq Results for NOV57a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV57a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAG67630 


Amino acid sequence of a human 
protein - Homo sapiens, 537 aa. 
[WO200 1 093 1 6-A 1 , 08-FEB- 


8..540 
5. .537 


532/533 (99%) 
532/533 (99%) 


0.0 


AAG67451 


Amino acid sequence of a human 
polypeptide - Homo sapiens, 537 
aa. [WO200109345-A1, 08-FEB- 
2001] 


8..540 
5. .537 


532/533 (99%) 
532/533 (99%) 


0.0 


AAM39481 


Human polypeptide SEQ ID NO 
2626 - Homo sapiens, 565 aa. 
[WO200 15331 2-A1, 26-JUL- 
2001] 


8..540 
33. .565 


527/533 (98%) 
529/533 (98%) 


0.0 


AAM4I267 


Human polypeptide SEQ ID NO 
6198 - Homo sapiens, 600 aa. 
[WO200I53312-Al,26-JUL- 
2001] 


8..540 
68..600 


526/533 (98%) 
528/533 (98%) 


0.0 


AAE14457 


Human protein phosphatase-7 - 

Homo sapiens, 541 aa. 
[WO200196546-A2, 20-DEC- 
2001] 


8..540 
9..541 


526/533 (98%) 
528/533 (98%) 


0.0 



In a BLAST search of public sequence datbases, the NOV57a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 57D. 
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Table 57D. Public BLASTP Results for NOV57a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV57a 
Residues/ 
Match Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


P54829 


Protein-tyrosine phosphatase, non- 
receptor type 5 (EC 3.1.3.48) 
(Protein-tyrosine phosphatase 

striatum-enriched) (STEP) (Neural- 
specific protein-tyrosine 
phosphatase) - Homo sapiens 
(Human), 537 aa (fragment). 


8..540 
5..537 


532/533 (99%) 
532/533 (99%) 


0.0 


P54830 


Protein-tyrosine phosphatase, non- 
receptor type 5 (EC 3 . 1 .3.48) 
(Protein-tyrosine phosphatase 
striatum-enriched) (STEP) (Neural- 
specific protein-tyrosine 
phosphatase) - Mus musculus 
(Mouse), 541 aa. 


8..540 
9..541 


474/533 (88%) 
495/533 (91%) 


0.0 


P35234 


Protein-tyrosine phosphatase, non- 
receptor type 5 (EC 3.1.3.48) 
(Protein-tyrosine phosphatase 
striatum-enriched) (STEP) (Neural- 
specific protein-tyrosine 
phosphatase) - Rattus norvegicus 
(Rat), 369 aa. 


172..540 
1..369 


346/369 (93%) 
354/369 (95%) 


0.0 


Q9BE09 


HYPOTHETICAL 42. 1 KDA 
PROTErN - Macaca fascicularis 

(Crab eating macaque) 
(Cynomolgus monkey), 370 aa. 


128..440 
1..313 


304/313 (97%) 
306/313 (97%) 


0.0 


JC4263 


protein-tyrosine-phosphatase (EC 
3.1.3.48), receptor type PCPTP1 
precursor - rat, 656 aa. 


110..538 
222..654 


220/442 (49%) 
286/442 (63%) 


e-114 



PFam analysis predicts that the NOV57a protein contains the domains shown in 
the Table 57E. 



Table 57E. Domain Analysis of NOV57a 


Pfam Domain 


NOV57a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Y_phosphatase 


297..529 


105/278(38%) 
197/278(71%) 


2.3e-103 
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Example 58. 

The NOV58 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 58A. 



Table 58A. NOV58 Sequence Analysis 




SEQlDNO:169 2937 bp 


NOV58a, 
CG97842-01 DNA Sequence 


ATCAGTTCTCCACTATCCTTCTGTTTTTCTAGGTAACTATAACTACCCAATATTGCAG 
CCATGGAGTCCATGCTTAATAAATTGAAGAGTACTGTTACAAAAGTAACAGCTGATGT 
CACTAGTGCTGTAATGGGAAATCCTGTCACTAGAGAATTTGATGTTGGTCGACACATT 
GCCAGTGGTGGCAATGGGCTAGCTTGGAAGATTTTTAATGGCACAAAAAAGTCAACAA 
AGCAGG AAG TG G C AG TTTTTG T CTTTGATAAAAAACTGATTG ACAAGTATCAAAAATT 
TGAAAAGGATCAAATCATTGATTCTCTAAAACGAGGAGTCCAACAGTTAACTCGGCTT 
CGACACCCTCGACTTCTTACTGTCCAGCATCCTTTAGAAGAATCCAGGGATTGCTTGG 
CATTTTGTACAGAACCAGTTTTTGCCAGTTTAGCCAATGTTCTTGGTAACTGGGAAAA 
TCTACCTTCCCCTATATCTCCAGACATTAAGGATTATAAACTTTATGATGTAGAAACC 
AAATATGGTTTGCTTCAGGTTTCTGAAGGATTGTCATTCTTGCATAGCAGTGTGAAAA 
TGGTGCATGGAAATATCACTCCTGAAAATATAATTTTGAATAAAAGTGGAGCCTGGAA 
AATAATGGGTTTTGATTTTTGTGTATCATCAACCAATCCTTCTGAACAAGAGCCTAAA 




ATTTGGCTCCTGAATACATACTTTCTGTGAGCTGTGAAACAGCCAGTGATATGTATTC 
TTTAGGAACTGTTATGTATGCTGTATTTAATAAAGGGAAACCTATATTTGAAGTCAAC 
AAGCAAGATATTTACAAGAGTTTCAGTAGGCAGTTGGATCAGTTGAGTCGTTTAGGAT 
CTAGTTCACTTACAAATATACCTGAGGAAGTTCGTGAACATGTAAAGCTACTGTTAAA 
TGTAACTCCGACTGTAAGACCAGATGCAGATCAAATGACAAAGATTCCCTTCTTTGAT 
GATGTTGGTGCAGTAACACTGCAATATTTTGATACCTTATTCCAAAGAGATAATCTTC 
AGAAATCACAGTTTTTCAAAGGACTGCCAAAGGTTCTACCAAAACTGCCCAAGCGTGT 
CATTGTGCAGAGAATTTTGCCTTGTTTGACTTCAGAATTTGTAAACCCTGACATGGTA 
CCTTTTGTTTTGCCCAATGTTCTACTTATTGCTGAGGAATGCACCAAAGAAGAATATG 




TTTGTTAATTTTCCTACAAAAAATGGATTTGCTACTAACCAAAACCCCTCCTGATGAG 
ATAAAGAACAGTGTTCTACCCATGGTTTACAGAGCACTAGAAGCTCCTTCCATTCAGA 
TCCAGGAGCTCTGTCTAAACATCATTCCAACCTTTGCAAATCTTATAGACTACCCATC 

GCGGTTCGTGTAAATTCATTAGTGTGCTTAGGAAAGATTTTGGAATACTTGGATAAGT 
GGTTTGTACTTGATGATATCCTACCCTTCTTACAACAAATTCCATCCAAGGAACCTGC 
GGTCCTCATGGGAATTTTAGGTATTTACAAATGTACTTTTACTCATAAGAAGTTGGGA 
ATCACCAAAGAGCAGCTGGCCGGAAAAGTGTTGCCTCATCTTATTCCCCTGAGTATTG 
AAAACAATCTTAATCTTAATCAGTTCAATTCTTTCATTTCCGTCATAAAAGAAATGCT 
TAATAGATTGGAGTCTGAACATAAGACTAAACTGGAGCAACTTCATATAATGCAAGAA 
CAGCAGAAATCTTTGGATATAGGAAATCAAATGAATGTTTCTGAGGAGATGAAAGTTA 
CAAATATTGGGAATCAGCAAATTGACAAAGTTTTTAACAACATTGGAGCAGACCTTCT 
GACTGGCAGTGAGTCCGAAAATAAAGAGGACGGGTTACAGAATAAACATAAAAGAGCA 
TCACTTACACTTGAAGAAAAACAAAAATTAGCAAAAGAACAAGAGCAGGCACAGAAGC 
TGAAAAGCCAGCAGCCTCTTAAACCCCAAGTGCACACACCTGTTGCTACTGTTAAACA 
GACTAAGGACTTGACAGACACACTGATGGATAATATGTCATCCTTGACCAGCCTTTCT 
GTTAGTACCCCTAAATCTTCTGCTTCAAGTACTTTCACTTCTGTTCCTTCCATGGGCA 
TTGGTATGATGTTTTCTACACCAACTGATAATACAAAGAGAAATTTGACAAATGGCCT 
AAATGCCAATATGGGCTTTCAGACTTCAGGATTCAACATGCCCGTTAATACAAACCAG 
AACTTCTACAGTAGTCCAAGCACAGTTGGAGTGACCAAGATGACTCTGGGAACACCTC 
CCACTTTGCCAAACTTCAATGCTTTGAGTGTTCCTCCTGCTGGTGCAAAGCAGACCCA 
ACAAAGACCCACAGATATGTCTGCCCTTAATAATCTCTTTGGCCCTCAGAAACCCAAA 
GTTAGCATGAACCAGTTATCACAACAGAAACCAAATCAGTGGCTTAATCAGTTTGTAC 
CTCCTCAAGGTTCTCCAACTATGGGCAGTTCAGTAATGGGGACACAGATGAACGTGAT 
AGGACAATCTGCTTTTGGTATGCAGGGTAATCCTTTCTTTAACCCACAGAACTTTGCA 
CAGCCACCAACTACTATGACCAATAGCAGTTCAGCTAGCAATGATTTAAAAGATCTTT 
TTGGGTGAGGTGTCTTACTTCTATTTTGAAGGATTATTTCAGTTTCAATCATGGGTGA 
GCTGATTTACATC1 r TAT AT AG rGGC rTGGAGGAAG 
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ORF Start: ATG at 61 


ORF Stop: TGA at 284 S 




SEQIDNO: 170 


929 aa 1 MW at 103707.8kD 


NOV58a, 
CG97842-01 Protein Sequence 


MESMLNKLKSTVTKVTADVTSAVMGNPVTREFDVGRHIASGGNGLAWKIFNGTKKSTK 
QEVRVFVFDKKLIDKYQKFEKBQIIDSLKRGVQQLTRIJlHPRLIiTVQHPLEESRDCLA 
FCTEPVFASLANVLGNWENLPSPISPDIKDYKLYDVETKYGLLQVSEGLSFLHSSVKM 
VHGKITPEKIILNKSGAWKIMGFDFCVSSTNPSEQEPKFPCKEWDPNLPSLCLPNPEY 
LAPEYILSVSCETASDMYSLGTVMYAVFNKGKPIFETOKQDIYKSFSRQLDQLSRLGS 
SSLTNIPEEVREHVKLLLNVTPTVRPDADQMTKIPFFDDVGAVTLQYFDTLFQRDNLQ 
KSQFFKGLPKVLPKLPKRVIVQRILPCLTSEFVNPDMVPFVLPNVLLIAEECTKEEYV 
KLILPELGPVFKQQEPIQILLIFLQKMDLLLTKTPPDEIKNSVLPMVYRALEAPSIQI 
QELCLNIIPTFANLIDYPSMKNALIPRIKNACLQTSSLAVRVNSLVCLGKILEYLDKW 
FVLDDILPFLQQIPSKEPAVLMGILGIYKCTFTHKKLGITKEQLAGKVLPHLIPLSIE 
NNLNLNQFNSFISVIKEMLNRLESEHKTKLEQLHIMQEQQKSLDIGNQMNVSEEMKVT 
NIGNQQIDKVFNNIGADLLTGSESENKEDGLQNKHKRASLTLEEKQKLAKEQEQAQKL 
KSQQPLKPQVHTPVATVKQTKDLTDTLMDNMSSLTSLSVSTPKSSASSTFTSVPSMGI 
GMMFSTPTDMTKRNLTNGLNANMGFQTSGFNMPVNTNQNFYSSPSTVGVTKMTLGTPP 
TLPNFNALSVPPAGAKQTQQRPTDMSALNNLFGPQKPKVSMNQLSQQKPNQWLNQFVP 
PQGSPTMGSSVMGTQMNVIGQSAFGMQGNPFFNPQNFAQPPTTMTNSSSASNDLKDLF 
G 



Further analysis of the NOV58a protein yielded the following properties shown ij 
Table 58B. 



Table 58B. Protein Sequence Properties NOV58a 


PSort analysis: 


0.4564 probability located in mitochondrial matrix space; 0.3000 probability located 
in microbody (peroxisome); 0.3000 probability located in nucleus; 0. 1507 
probability located in mitochondrial inner membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV58a protein against the Geneseq database, a proprietary 
5 database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 58C. 
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Table 58C. Geneseq Results for NOV58a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 


NOV58a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


' AAB85782 

1 


Human kinase PKIN-I - Homo 
sapiens, 792 aa [WO^OO 1 6099 1 -A^ 
23-AUG-2001] 


153..929 


771/781 (98%) 
772/781 (98%) 


0.0 


AAG67427 


Amino acid sequence of a human 
protein kinase/protein phosphatase - 

Homo sapiens, 756 aa. 
[ WO200 1 09345-A 1 , 08-FEB-200 1 ] 


174..929 
1..756 


755/756 (99%) 
755/756 (99%) 


0.0 


AAB95177 


Human protein sequence SEQ ID 
NO: 1 7239 - Homo sapiens, 756 aa. 
[EPI074617-A2,07-FEB-2001] 


174..929 
1..756 


755/756 (99%) 
755/756 (99%) 


0.0 


AAB95772 


Human protein sequence SEQ ID 
NO: 1 87 1 1 - Homo sapiens, 735 aa. 
[EP 1 0746 1 7-A2, 07-FEB-200 1 ] 


195..929 
1..735 


732/735 (99%) < 
734/735 (99%) 


0.0 


AAB65678 


Novel protein kinase, SEQ ID NO: 

206 - Homo sapiens, 505 aa. 
[WO200073469-A2, 07-DEC-2000] 


1..496 
1..496 


494/496 (99%) 
494/496 (99%) 


0.0 



In a BLAST search of public sequence datbases, the NOV58a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 58D. 
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Table 58D. Public BLASTP Results for NOV58a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV58a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9P2I7 


KIAA1360 PROTEIN - Homo sapiens 
(Human), 796 aa (fragment). 


138..929 
1..796 


788/796 (98%) 
789/796 (98%) 


0.0 


Q96ST4 


CDNA FLJ 14645 FIS, CLONE 
NT2RP2001839, WEAKLY SIMILAR 
TO SCY1 PROTEIN - Homo sapiens 
(Human), 756 aa. 


I74..929 
1..756 


755/756 (99%) 
755/756 (99%) 


0.0 


Q9H7V5 


CDNA FLJ 142 12 FIS, CLONE 
NT2RP3003500, WEAKLY SIMILAR 
TO SCY1 PROTEIN - Homo sapiens 
(Human), 735 aa. 


195..929 
1..735 


732/735 (99%) 
734/735 (99%) 


0.0 


Q9NVH3 


CDNA FLJ 10735 FIS, CLONE 
NT2RP3001407, WEAKLY SIMILAR 
TO SCYI PROTEIN - Homo sapiens 
(Human), 446 aa. 


484..929 
1..446 


445/446 (99%) 
445/446 (99%) 


0.0 


Q96EF4 


UNKNOWN (PROTEIN FOR 
MGC:88 14) - Homo sapiens (Human), 
446 aa. 


484..929 
1..446 


444/446 (99%) 
444/446 (99%) 


0.0 



PFam analysis predicts that the NOV58a protein contains the domains shown in 
the Table 58E. 



Table 58E. Domain Analysis of NOV58a 


Pfam Domain 


NOV58a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


pkinase 


159..327 


44/203 (22%) 
1 16/203 (57%) 


3e-07 



Example 59. 



The NOV59 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 59A. 
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Table 59A. NOV59 Sequence Analysis 




SEQ ID NO: 171 


2933 I,,, 


NOV59a, 
CG9802I-0I DNA Sequence 


ACTCTATGTCTCCTCTCGTTGGATTGTGACACCGGGAGGTCAGGGAACTCCAGGACCT 
TGTTCTCTGCTGQATTCGCAGCAACCAGCACAGCACGTAGGGCGTAGTTGGTGCTGGA 
TGGATGTTTGTTGAATGAATGAATGATGAATGGCTGGCACCTTGTCTGCTCATCCCTA 
ACTCCTGTTCCTTCATCTGTGCAGCCCTAATCTTTGTTTCCTCATCTGTCCATCCCTT 
TATTTGTGCATCCTCATTCTTAGCCCCTTCACTGCCCTTCTCCATCTCTTCCTCCTTG 
TTCATTTGTCCCTGTTCTCTGTCCTCTACTCCACTCATGCCCATCTCTGTCCCCTTGA 
CTTACCCAGTCCCTGCTACTATCTCCATCCCTAATTTCTGCCCTCTTGTCTGTCTACT 
CCTAATTCCTTTTCCTTGTCCATCCCTAATACCTGTCACCTTGTCCTTCTTCCTCGAA 
TCTCCATCCCTAATCCATCTGCCCCTAATCTCTGTCCCCTTTGCCCATCCTTCCTTTT 
CTCGGTGTCTCTTTCCACCCTTATCTCCACACCTGCCCACCCTGCACTCCCATTCTGT 
TTCCCATCTGCACCCTTGCCCCATCCCTCCCACACACAGGACCAGACGGCCACCATGT 
CAGGAGACTACGAGGATGACCTCTGCCGGCGGGCACTCATCCTGGTCTCGGACCTCTG 
TGCGCGGGTCCGAGATGCTGACACCAACGACAGGTGCCAGGAGTTCAATGACCGAATC 




CTGGGTGCCCTGGCGGGACAAGGGAGGCTCGGCAGTGGGCGGTGGCCCCCTGCGCAAA 
GACCTAGGCCCTGGTGTCGGGCTGGCAGGCCTGGTAGGCGGAGGCGGGCACCACCTGG 
CGGCTGGCCTGGGTGGCCATCCTCTGCTGGGCGGCCCACACCACCATGCCCATGCCGC 
CCACCATCCACCCTTTGCTGAGCTGCTGGAGCCAGGCAGCCTGGGGGGTTCTGACACC 
CCTGAGCCCTCCTACTTGGACATGGACTCGTATCCAGAGGCTGCAGCAGCAGCAGTGG 
CCGCTGGGGTCAAACCGAGCCAAACATCCCCTGAGCTGCCCTCTGAGGGGGGAGCAGG 
CTCTGGGTTGCTCCTGCTGCCCCCCAGTGGTGGGGGCTTGCCCAGTGCCCAGTCACAT 
CAGCAGGTCACAAGCCTGGCACCCACTACCAGGTACCCAGCCCTGCCCCGACCCCTCA 
CCCAGCAGACTCTGACCTCCCAGCCGGACCCCAGCAGTGAGGAGCGCCCACCTGCCCT 




GAGAGGCAGGCACAGGGGCACCCTGTGGCCGTATCAGCTTCGCCCTGCGGTACCTCTA 
TGGCTCGGACCAGCTGGTGGTGAGGATCCTGCAGGCCCTGGACCTCCCTGCCAAGGAC 
TCCAACGGCTTCTCAGACCCCTACGTCAAGATCTACCTGCTGCCTGACCGCAAGAAAA 
AGTTTCAGACCAAGGTGCACAGGAAGACCCTGAACCCCGTCTTCAATGAGACGTTTCA 
ATTCTCGGTGCCCCTGGCCGAGCTGGCCCAACGCAAACTGCACTTCAGCGTCTATGAC 
TTTGACCGCTTCTCGCGGCACGACCTCATCGGCCAGGTGGTGCTGGACAACCTCCTGG 
AGCTGGCCGAGCAGCCCCCTGACCGCCCGCTCTGGAGGGACATCGTGGAGGGCGGCTC 
GGAAAAAGCAGATCTTGGGGAGCTCAACTTCTCACTCTGCTACCTCCCCACGGCCGGG 
CGCCTCACCGTGACCATCATCAAAGCCTCTAACCTCAAAGCGATGGACCTCACTGGCT 
TCTCAGACCCCTACGTGAAGGCCTCCCTGATCAGCGAGGGGCGGCGTCTGAAGAAGCG 
GAAAACCTCCATCAAGAAGAACACGCTGAACCCCACCTATAATGAGGCGCTGGTGTTC 
GACGTGGCCCCCGAGAGCGTGGAGAACGTGGGGCTCAGCATCGCCGTGGTGGACTACG 
ACTGCATCGGGCACAACGAGGTGATCGGCGTGTGCCGTGTGGGCCCCGACGCTGCCGA 
CCCGCACGGCCGCGAGCACTGGGCAGAGATGCTGGCCAATCCCCGCAAGCCCGTGGAG 
CACTGGCATCAGCTAGTGGAGGAAAAGACTGTGACCAGCTTCACAAAAGGCAGCAAAG 
GACTATCAGAGAAAGAGAACTCCGAGTGAGGGGTCTGGCCTAGGCCCGGGATCGGACC 
AGGCTCCCTCAGGACCCCATCCTTTCCTGCCCGGACCGTGAATTCATCTCCTTGAAGC 
CATAACGTCCGAGCTGCTGGTGCGGGGCAGCCCTGGCCCTAGGCTTCCTAACCCTGGA 
AGCGAGAGGATGAGAGGAGGCCGGCCCAGCTCCTTCTTTCAGGGTGGGGGTCATTCAG 
CCTCCACTGTGTCTGTCTTTTCTTCCCTGGGGCTCCCCCTCGAGGCGAGGGGCCATGC 
ATGTCTGGGGGACCCCTGCCCCCCAAAACCCTCTGTCTGTCTCTGTCTCTTTGCTGTT 
TGTCCAAGACTCAGTGTCCCGACCCTTGTTCTCGCCGTGAATGTCAATGGGCCAATCC 




GCGTCTGGCCGGTCCCCCCACTGCTGCTGCTATCAACGCCAGAATAAACACACTCTGT 
GGGTCTCACTCCAAAAAAAAAAAAAAAAAAAGC 




ORF Start: ATG at 635 


ORF Stop: TGA at 2405 




SEQ ID NO: 172 


590 aa 1 MW at 63303.0kD 


NOV59a, 
CG98021-0I Protein Sequence 


MSGDYEDDLCRRALILVSDLCARVRDADTNDRCQEFNDRIRGYPRGPDADISVSLLSV 
IVTFCGIVLLGVSLFVSWPCLCWVPWRDKGGSAVGGGPLRKDLGPGVGIiAGLVGGGGHH 
IaAAGLGGHPLLGGPHHHAHAAHHPPFAELLEPGSLGGSDTPEPSYLDMDSYPEAAAAA 
VAAGVKPSQTSPELPSEGGAGSGLLLLPPSGGGLPSAQSHQQVTSLAPTTRYPALPRP 
LTQQTLTSQPDPSSEERPPALPLPLPGGEEKAKLIGQIKPELYQGTGPGGRRSGGGPG 
SGEAGTGAPCGRISFALRYLYGSDQLWRILQALDLPAKDSNGFSDPYVKIYLLPDRK 
KKFQTKVHRKTLNPVFNETFQFSVPLAELAQRKLHFSVYDFDRFSRHDLIGQWLDNL 
LELAEQPPDRPLWRDIVEGGSEKADLGELNFSLCYLPTAGRLTVTIIKASNLKAMDLT 
GFSDPYVKASLISEGRRLKKRKTSIKKNTLNPTYNEALVFDVAPESVENVGLSIAWD 
YDCIGHNEVIGVCRVGPDAADPHGREHWAEMLANPRKPVEHWHQLVEEKTVTSFTKGS 
KGLSEKENSE 
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Further analysis of the NOV59a protein yielded the following properties shown ii 
Table 59B. 



Table 59B. Protein Sequence Properties NOV59a 



PSort analysis: 


0.6000 probability located in endoplasmic reticulum (membrane); 0.4600 
probability located in plasma membrane; 0.3 128 probability located in microbody 
(peroxisome); 0.3000 probability located in nucleus 


SignalP analysis: 


Cleavage site between residues 28 and 29 



A search of the NOV59a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 59C. 



308 



WO 03/010327 



PCT/US02/14199 



Table 59C. Geneseq Results for NOV59a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV59a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU19714 


Human novel extracellular matrix 
protein, Seq ID No 364 - Homo sapiens, 
295 aa. [WO200155368-A1, 02-AUG- 
2001] 


291..571 
I..281 


278/281 (98%) 
278/281 (98%) 


e-161 


AAU19715 


Human novel extracellular matrix 
protein, Seq ID No 365 - Homo sapiens, 
461 aa. [WO200155368-AI, 02-AUG- 
2001] 


77. .583 
33..461 


235/509 (46%) 
304/509 (59%) 


e-116 


AAU 19847 


Human novel extracellular matrix 
protein, Seq ID No 497 - Homo sapiens, 
208 aa. [WO200155368-A 1, 02-AUG- 
2001] 


309..479 
19.189 


161/171 (94%) 
162/171 (94%) 


9e-88 


AAR97722 


Mouse inositol polyphosphate binding 
protein IP4-BP - Mus musculus, 422 aa. 
[JP08092290-A, 09-APR-1996] 


288..581 
130..422 


148/297 (49%) 
203/297 (67%) 


4e-76 


ABB59660 


Drosophila melanogaster polypeptide 

SEQ ID NO 5772 - Drosophila 
melanogaster, 474 aa. [WO200171042- 
. A2,27-SEP-200I] 


301..581 
194..474 


148/287 (51%) 
200/287 (69%) 


le-75 



In a BLAST search of public sequence datbases, the NOV59a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 59D. 
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Table 59D. Public BLASTP Results for NOV59a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV59a 
Residues/ 
Match Residues 


Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BQG1 


Synaptotagmin III (Sytlll) - Homo 
sapiens (Human), 590 aa. 


1..590 
1..590 


590/590 
(100%) 
590/590 
(100%) 


0.0 


Q925B7 


SYNAPTOTAGMIN 3 - Rattus 
norvegicus (Rat), 588 aa. 


1..590 
1..588 


566/592 (95%) 
576/592 (96%) 


0.0 


P40748 


Synaptotagmin III (Sytlll) - Rattus 
norvegicus (Rat), 588 aa. 


1..590 
1..588 


564/592 (95%) 
575/592 (96%) 


0.0 


035681 


Synaptotagmin III (Sytlll) - Mus 
musculus (Mouse), 587 aa. 


1..590 
1..587 


561/592 (94%) 
572/592 (95%) 


0.0 


P24507 


Synaptotagmin C (Synaptic vesicle 
protein 0-P65-C) - Discopyge 
ommata (Electric ray), 537 aa. 


1..576 
1-512 


342/578 (59%) 
405/578 (69%) 


0.0 



PFam analysis predicts that the NOV59a protein contains the domains shown in 
the Table 59E. 



Table 59E. Domain Analysis of NOV59a 


Pfam Domain 


NOV59a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


C2 


316..402 


56/97 (58%) 
82/97 (85%) 


1 .4e-44 


C2 


448..536 


41/97(42%) 
79/97(81%) 


1.5e-38 



Example 60. 



The NOV60 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 60A. 
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Table 60A. NOV60 Sequence Analysis 




SEQIDNO:173 j 2133 bp 


NOV60a, 
CG98030-01 DNA Sequence 

] 

i 


CGTTTACGGCATGTCRTTCCTGGACACATGGCATGTTCCATGGCGTGTRRrnRTEfian 
CTTGCAAGTATGAGAACCCAGCCCGCTGGAGTGAGCAGGAGCAAGCCATTAAGGGGGT 
TTACTCATCCTGGGTCACTGATAATATACTGGCCATGGCCCGCCCATCCTCTGAGCTC 
CTGGAGAAGTACCACATCATTGATCAGTTCCTCAGCCATGGCATAAAAACAATAATCA 
ACCTCCAGCGCCCTGGTGAGCATGCTAGCTGTGGGAACCCTCTGGAACAAGAAAGTGG 
CTTCACATACCTTCCTGAGGCTTTCATGGAGGCTGGCATTTACTTCTACAATTTCGGA 
TGGAAGGATTATGGTGTAGCGTCTCTTACTACTATCCTAGATATGGTGAAGGTGATGA 
CATTTGCCTTACAGGAAGGAAAAGTAGCTATCCATTGTCATGCAGGGCTTGGTCGAAC 
AGGTGTTTTAATAGCCTGTTACTTAGTTTTTGCAACGAGAATGACTGCTGACCAAGCA 
ATTATATTTGTGCGGGCAAAGCGACCCAATTCCATACAAACCAAGAGACAGCTCCTCT 
GTGTAAGGGAATTTACTCAGTTTCTAACTCCTCTCCGCAATATATTCTCTTGCTGTGA 
TCCCAAAGCACATGCTGTCACCTTACCTCAATATCTAATTCGCCAGCGTCATCTGCTT 
CATGGTTATGAGGCACGACTTCTGAAACACGTGCCAAAAATTATCCACCTAGTTTGCA 
AATTGCTGCTGGACTTAGCGGAGAACAGGCCAGTGATGATGAAGGATGTGTCCGAAGG 
ACCTGGTCTCTCTGCTGAAATAGAAAAGACAATGTCTGAGATGGTCACCATGCAGCTG 
GATAAAGAGTTACTGAGGCATGACAGTGATGTGTCCAACCCGCCTAACCCCACTGCAG 
TGGCAGCAGATTTTGACAATCGAGGCATGATTTTCTCCAATGAGCAACAGTTTGACCC 
TCTTTGGAAAAGGCGGAATGTTGAGTGCCTTCAACCCCTGACTCATCTGAAAAGGCGG 
CTCAGCTACAGTGACTCAGATTTAAAGAGGGCCGAGAACCTCCTGGAGCAAGGGGAGA 
CTCCACAGACAGTGCCTGCCCAGATCTTGGTTGGCCACAAGCCCAGGCAGCAGAAGCT 
CATAAGCCATTGTTACATCCCACAGTCTCCAGAACCAGACTTACACAAGGAAGCCTTG 
GTTCGCAGCACACTTTCTTTCTGGAGTCAGTCAAAGTTTGGAGGCCTGGAAGGACTCA 
AAGATAATGGGTCACCAATTTTCCATGGAAGGATCATTCCAAAGGAAGCACAGCAGAG 
TGGAGCTTTCTCTGCAGATGTTTCAGGCTCACACAGCCCTGGGGAGCCAGTTTCACCC 
AGCTTTGCA.AATGTCCATAAGGATCCAAACCCTGCTCACCAGCAAGTGTCTCACTGTC 
AGTGTAAAACTCATGGTGTTGGGAGCCCTGGCTCTGTCAGGCAGAACAGCAGGACACC 
CCGAAGCCCTCTGGACTGTGGCTCCAGTCCCAAAGCACAGTTCTTGGTTGAACATGAA 
ACCCAGGACAGTAAAGATCTGTCTGAAGCAGCTTCACACTCTGCATTACAGTCTGAAT 
TGAGTGCTGAGGCAAGAAGAATACTGGCGGCCAAAGCCCTAGCAAATTTAAATGAATC 
TGTAGAAAAGGAGGAACTAAAAAGGAAGAAAGAGCTTAATTCCCGAGATGGAGCTTGG 
GAAAGAATATGTGGCGAGAGGGACCCTTTCATCCTATGCAGCTTGATGTGGTCTTGGG 
TGGAGCAACTGAAGGAGCCTGTAATCACCAAAGAGGATGTGGACATGTTGGTTGACAG 
GCGAGCAGATGCCGCAGAAGCACTTTTTTTATTAGAGAAGGGACAGCACCAGACTATT 
CTCTGCGTGTTGCACTGCATAGTGAACCTGCAGACAATTCCCGTGGATGTGGAGGAAG 
CTTTCCTTGCCCATGCCATTAAGGCATTCACTAAGGTTAATTTTGATTCTGAAAATGG 
ACCAACAGTTTACAACACCCTGAAGAAAATATTTAAGCACACGCTGGAAGAAAAAAGA 
AAAATGACAAAAGATGGCCCTAAGCCTGGCCTCTAGCTTTCACTC 




ORF Start: ATG at 28 


ORF Stop: TAG at 2122 




SEQ ID NO: 174 


698 aa j MW at 78224.7kD 


NOV60a, 
CG98030-01 Protein Sequence 


MACSMACGGRACKYENPARWSEQEQAIKGVYSSWVTDNILAMARPSSELLEKYHIIDQ 
FLSHGIKTIINLQRPGEHASCGNPLEQESGFTYLPEAFMEAGIYFYNFGWKDYGVASL 
TTILDMVKVMTFALQEGKVAIHCHAGLGRTGVLIACYLVFATRMTADQAIIFVRAKRP 
NSIQTKRQLLCVREFTQFLTPLRNIFSCCDPKAHAVTLPQYLIRQRHLLHGYEARLLK 
HVPKIIHLVCKLLLDLAENRPVMMKDVSEGPGLSAEIEKTMSEMVTMQLDKELLRHDS 
DVSNPPNPTAVAADFDNRGMIFSNEQQFDPLWKRRNVECLQPLTHLKRRLSYSDSDLK 
RAENLLEQGETPQTVPAQILVGHKPRQQKLISHCYIPQSPEPDLHKEALVRSTLSFWS 
QSKFGGLEGLKDNGSPIFHGRIIPKEAQQSGAFSADVSGSHSPGEPVSPSFANVHKDP 
NPAHQQVSHCQCKTHGVGSPGSVRQNSRTPRSPLDCGSSPKAQFLVEHETQDSKDLSE 
AASHSALQSELSAEARRILAAKALANLNESVEKEELKRKKELNSRDGAWERICGERDP 
FILCSLMWSWVEQLKEPVITKEDVDMLVDRRADAAEALFLLEKGQHQTILCVLHCIVN 
LQTIPVDVEEAFLAKAIKAFTKVNFDSENGPTVYNTLKKIFKHTLEEKRKMTKDGPKP 
3L 
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SEQIDNO: 175 


j 2 147 bp 


NOV60b, 
CG98030-02 DNA Sequence 


^^TTTAOGGCATGTCgTTCCTGGACACflTGGCRTGTTCCATGGCQTGTGGCGGTAGAG 
CTTGCAAGTATGAGAACCCAGCCCGCTGGAGTGAGCAGGAGCAAGCCATTAAGGGGGT 
TTACTCATCCTGGGTCACTGATAATATACTGGCCATGGCCCGCCCATCCTCTGAGCTC 
CTGGAGAAGTACCACATCATTGATCAGTTCCTCAGCCATGGCATAAAAACAATAATCA 
ACCTCCAGCC-CCCTGGTGAGCATGCTAGCTGTGGGAACCCTCTGGAACAAGAAAGTGG 
CTTCACATACCTTCCTGAGGCTTTCATGGAGGCTGGCATTTACTTCTACAATTTCGGA 
TGGAAGGATTATGGTGTAGCGTCTCTTACTACTATCCTAGATATGGTGAAGGTGATGA 


j 


AGGTGTTTTAATAGCCTGTTACTTAGTTTTTGCAACGAGAATGACTGCTGACCAAGCA 
ATTATATTTGTGCGGGCAAAGCGACCCAATTCCATACAAACCAGAGGACAGCTCCTCT 
GTGTAAGGGAATTTACTCAGTTTCTAACTCCTCTCCGCAATATATTCTCTTGCTGTGA 
TCCCAAAGCACATGCTGTCACCTTACCTCAATATCTAATTCGCCAGCGTCATCTGCTT 
CATGGTTATGAGGCACGACTTCTGAAACACGTGCCAAAAATTATCCACCTAGTTTGCA 
AATTGCTGCTGGACTTAGCGGAGAACAGGCCAGTGATGATGAAGGATGTGTCCGAAGG 
ACCTGGTCTCTCTGCTGAAATAGAAAAGACAATGTCTGAGATGGTCACCATGCAGCTG 
GATAAAGAGTTACTGAGGCATGACAGTGATGTGTCCAACCCGCCTAACCCCACTGCAG 
TGGCAGCAGATTTTGACAATCGAGGCATGATTTTCTCCAATGAGCAACAGTTTGACCC 
TCTTTGGAAAAGGCGGAATGTTGAGTGCCTTCAACCCCTGACTCATCTGAAAAGGCGG 
CTCAGCTACAGTGACTCAGATTTAAAGAGGGCCGAGAACCTCCTGGAGCAAGGGGAGA 
CTCCACAGACAGTGCCTGCCCAGATCTTGGTTGGCCACAAGCCCAGGCAGCAGAAGCT 
CATAAGCCATTGTTACATCCCACAGTCTCCAGAACCAGACTTACACAAGGAAGCCTTG 
GTTCGCAGCACACTTTCTTTCTGGAGTCAGTCAAAGTTTGGAGGCCTGGAAGGACTCA 

TGGAGCTTTCTCTGCAGATGTTTCAGGCTCACACAGCCCTGGGGAGCCAGTTTCACCC 
AGCTTTGCAAATGTCCATAAGGATCCAAACCCTGCTCACCAGCAAGTGTCTCACTGTC 
AGTGTAAAACTCATGGTGTTGGGAGCCCTGGCTCTGTCAGGCAGAACAGCAGGACACC 
CCGAAGCCCTCTGGACTGTGGCTCCAGTCCCAAAGCACAGTTCTTGGTTGAACATGAA 
ACCCAGGACAGTAAAGATCTGTCTGAAGCAGCTTCACACTCTGCATTACAGTCTGAAT 
TGAGTGCTGAGGCAAGAAGAATACTGGCGGCCAAAGCCCTAGCAAATTTAAATGAATC 
TGTAGAAAAGGAGGAACTAAAAAGGAAGGTAGAAATGTGGCAGAAAGAGCTTAATTCC 
CGAGATGGAGCTTGGGAAAGAATATGTGGCGAGAGGGACCCTTTCATCCTATGCAGCT 
TGATGTGGTCTTGGGTGGAGCAACTGAAGGAGCCTGTAATCACCAAAGAGGATGTGGA 
CATGTTGGTTGACAGGCGAGCAGATGCCGCAGAAGCACTTTTTTTATTAGAGAAGGGA 
CAGCACCAGACTATTCTCTGCGTGTTGCACTGCATAGTGAACCTGCAGACAATTCCCG 
TGGATGTGGAGGAAGCTTTCCTTGCCCATGCCATTAAGGCATTCACTAAGGTTAGTTT 
TGATTCTGAAAATGGACCAACAGTTTACAACACCCTGAAGAAAATATTTAAGCACACG 
CTGGAAGAAAAAAGAAAAATGACAAAAGATGGCCCTAAGCCTGGCCTCTAGCTTTCAC 




ORF Start: ATG at 28 


ORF Stop: TAG at 2137 




SEQIDNO: 176 


703 aa 1 MW at 78800.4kD 


NOV60b, 
CG98030-02 Protein Sequence 




MACSMACGGRACKYENPARWSEQEQAIKGVYSSWVTDNILAMARPSSELLEKYHIIDQ 
FLSHGIKTIINLQRPGEHASCGNPLEQESGFTYLPEAFMEAGIYFYNFGWKDYGVASL 
TTILDMVKVMTFALQEGKVAIHCHAGLGRTGVLIACYLVFATRMTADQAIIFVEAKRP 
NSIQTRGQLLCVREFTQFLTPLRNIFSCCDPKAHAVTLPQYLIRQRHLLHGYEARLLK 
HVPKIIHLVCKLLLDLAENRPVMMKDVSEGPGLSAEIEKTHSEMVTMQLDKELLRHDS 
DVSNPPNPTAVAADFDNRGMIFSNEQQFDPLWKRRNVECLQPLTHLKRRLSYSDSDLK 
RAENLLEQGETPQTVPAQILVGHKPRQQKLISHCYIPQSPEPDLHKEALVRSTLSFWS 
QSKFGGLEGLKDNGSPIFHGRIIPKEAOQSGAFSADVSGSHSPGEPVSPSFANVHKDP 
tJPAHQQVSHCQCKTHGVGSPGSVRQNSRTPRSPLDCGSSPKAQFLVEHETQDSKDLSE 
^SHSALQSELSAEARRIIAAKALANLNESVEKEELKRKVEMWQKELNSRDGAWERIC 
3ERDPFILCSLMWSWVEQLKEPVITKEDVDMLVDRRADAAEALFLLEKGQHQTILCVL 
^CIVNLQTIPVDVEEAFLAKAIKAFTKVSFDSENGPTVYNTLKKIFKHTLEEKRKMTK 
3GPKPGL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 60B. 
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Table 60B. Comparison of NOV60a against NOV60b. 


Protein Sequence 


NOV60a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV60b 


1..698 
1..703 


695/703 (98%) 
697/703 (98%) 



Further analysis of the NOV60a protein yielded the following properties shown in 
Table 60C. 



Table 60C. Protein Sequence Properties NOV60a 


PSort analysis: 


0.9400 probability located in nucleus; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1 000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV60a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 60D. 



Table 60D. Geneseq Results for NOV60a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV60a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABG08740 


Novel human diagnostic protein 
#873 1 - Homo sapiens, 476 aa. 
[WO200175067-A2, ll-OCT-2001] 


432..662 
227..462 


229/236 (97%) 
231/236(97%) 


e-129 


ABG08740 


Novel human diagnostic protein 
#873 1 - Homo sapiens, 476 aa. 
[WO200175067-A2, ll-OCT-2001] 


432..662 
227..462 


229/236 (97%) 
231/236(97%) 


e-129 


AAG67632 


Amino acid sequence of a human 
protein - Homo sapiens, 447 aa. 
[WO200109316-A1, 08-FEB-2001] 


16.. 190 
171.335 


53/183 (28%) 
79/183 (42%) 


6e-10 


AAG67453 


Amino acid sequence of a human 
polypeptide - Homo sapiens, 447 aa. 
[WO200109345-A1, 08-FEB-2001] 


16..190 
171. .335 


53/183 (28%) 
79/183(42%) 


6e-10 


AAU75364 


Human dual specificity phosphatase 
CDC14A L572P variant - Homo 
sapiens, 594 aa. [US6331614-B1, 
18-DEC-2001] 


91..177 
232..316 


32/87 (36%) 
46/87 (52%) 


3e-07 
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In a BLAST search of public sequence datbases, the NOV60a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 60E. 



Table 60E. Public BLASTP Results for NOV60a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV60a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q8WX19 


BA490F3.2.1 (NOVEL PROTEIN 
ISOFORM I) -Homo sapiens 
(Human), 698 aa. 


1..698 
1..698 


696/698 (99%) 
697/698 (99%) 


0.0 


Q9N091 


UNNAMED PROTEIN PRODUCT 
- Macaca fascicularis (Crab eating 
macaque) (Cynomolgus monkey), 
683 aa. 


1..682 
1..682 


654/682(95%) 
663/682(96%) 


0.0 


Q8WX18 


BA490F3.2.2 (NOVEL PROTEIN 
ISOFORM 2) - Homo sapiens 
(Human), 112aa. 


587..698 
1..I12 


112/112(100%) 
112/112(100%) 


le-59 


Q9UAX0 


T12B3.I PROTEIN - 
Caenorhabditis elegans, 446 aa. 


3. .246 
42..281 


92/244 (37%) 
147/244 (59%) 


2e-44 


060730 



CDC14B3 PHOSPHATASE - 
Homo sapiens (Human), 471 aa. 


16.. 190 

201. .365 


54/1 83 (29%) 
81/183 (43%) 


le-10 



PFam analysis predicts that the NOV60a protein contains the domains shown in 
the Table 60F. 



Table 60F. Domain Analysis of NOV60a 


Pfam Domain 


NOV60a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


DSPc 


38..192 


34/181 (19%) 
98/181 (54%) 


0.0011 


Y_phosphatase 


25..192 


34/290(12%) 
105/290(36%) 


0.0046 



Example 61. 



The NOV61 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 61 A. 
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Table 61A. NOV61 Sequence Analysis 






SEQIDNO:177 | 4172 bp 




NOV61a, 
CG98061-01 DNA Sequence 


ATGGCGGCGGCGGGAGCCCTGGAACGGAGCTTCGTGGAGCTAAGCGGAGCTGAGCGCG 
AAAGGCCGAGGCACTTTCGGGAATTCACAGTCTGCAGCATTGGGACTGCAAATGCCGT 
GGCTGGCGCCGTAAAATACAGTGAAAGCGCGGGAGGCTTTTACTACGTGGAGAGTGGC 
AAGTTGTTCTCCGTAACCAGAAACAGGTTCATTCATTGGAAGACCTCTGGAGATACAT 
TGGAGCTGATGGAGGAGTCACTGGACATAAATCTGTTGAATAATGCCATTCGCCTAAA 
ATTCCAAAATTGCAGTGTTTTACCTGGAGGGGTTTATGTCTCTGAGACTCAGAATCGT 
GTGATAATCTTGATGTTAACCAATCAAACAGTGCACAGGTTACTTTTACCACACCCCT 
CCCGGATGTATAGGAGTGAG7TGGTAGTTGACAGTCAGATGCAGTCAATATTCACTGA 
CATTGGAAAAGTTGATTTCACAGATCCTTGCAACTATCAGTTAATTCCAGCAGTACCT 
GGAATATCTCCTAATTCCACCGCCTCTACAGCCTGGCTCAGCAGTGATGGGGAGGCCC 

CATACCTGGTATGGTGTCAGTCGTGGAACTGAAACAGAGTTCAGTAATGCAACGATTG 

CTTACAGGCTGGATGCCAACAGCTATCAGGGGTGACCAGTCGCCTTCAGATCGTCCCC 

TCAGTCTTGCTGTTCATTGTGTGGAGCATGATGCCTTCATCTTTGCTTTGTGTCAGGA 

TCATAAACTACGAATGTGGTCTTACAAGGAGCAAATGTGCCTAATGGTAGCTGACATG 

CTGGAGTATGTCCCTGTGAAGAAAGACCTTCGGCTTACTGCTGGAACTGGACACAAAT 

TACGGCTTGCTTATTCCCCCACCATGGGACTCTACCTGGGGATATACATGCATGCACC 

AAAACGAGGACAGTTCTGCATTTTCCAGTTGGTGAGCACTGAGAGTAATCGCTATAGT 

CTCGATCATATTTCTTCACTGTTCACTTCTCAGGAGACACTGATTGACTTTGCCTTAA 

CTTCCACGGATATCTGGGCCCTGTGGCATGATGCTGAGAACCAAACAGTAGTGAAATA 

CATCAACTTTGAACATAATGTTGCAGGTCAGTGGAATCCAGTTTTTATGCAGCCTCTG 

CCAGAGGAAGAGATTGTCATCAGAGATGATCAAGACCCCAGAGAGATGTATCTGCAAA 

GTCTTTTTACACCAGGACAATTCACAAATGAAGCTTTATGTAAGGCTTTACA^ 1 I 

CTGCCGAGGAACTGAGAGGAATTTGGATCTTTCCTGGAGTGAACTGAAGAAAGAAGTT 

ACTTTAGCTGTTGAAAATGAGCTTCAAGGAAGTGTAACAGAGTATGAATTCTCCCAGG 

AGGAGTTTCGAAATTTAC.AACAAGAATTCTGGTGCAAGTTCTATGCCTGTTGTCTTCA 

GTATCAAG.AAGCCCTCTCTCACCCTCTTGCCCTACATTTGAATCCACACACAAACATG 

GTGTGCC7GCTGAAAAAAGGGTACCTGTCTTTCCTTATTCCCTCATCCTTAGTGGATC 

ATTTGTATCTCCVGCCTTATGAGAACCTTTTGACAGAAGATGAGACAACCATATCTGA 

TGATGTGGATATCGCTCGGGATGTCATATGTCTTATAAAATGCCTCCGGCTGATTGAA 

GAGTCAGTAACTGTGGA7ATGTCAGTTATAATGGAAATGAGTTGTTATAACCTACAGT 

CTCCGGAAAAGGCTGCAGAGCAGATTCTGGAAGATATGATCACTATTGATGTAGAAAA 

TGTGATGGAGGATATTTGTAGTAAACTGCAAGAGATTAGGAACCCAATCCATGCAATT 

GGACTACTTATACGGGAAATGGATTATGAAACAGAAGTGGAAATGGAAAAGGGATTCA 

ATCCAGCTCAGCCTTTGAATATTCGAATGAATCTTACCCAGCTCTATGGTAGTAACAC 

AGCAGGGTATATTGTGTGCAGAGGGGTGCATAAAATCGCCAGTACTCGTTTCCTGATC 

TGCAGAGATCTTTTGATCTTACAGCAGCTGTTAATGAGGCTTGGAGATGCTGTGATTT 

GGGGAACTGGTCAGCTCTTTCAAGCTCAGCAAGACCTACTACATCGAACAGCTCCCCT 

ACTCTTATCTTATTACCTCATTAAATGGGGAAGTGAGTGCTTGGCAACTGATGTTCCA 

CTTGACACACTGGAGTCTAATCTCCAACACTTATCAGTACTGGAATTAACAGACTCTG 

GTGCTTTAATGGCAAATAGGTTTGTATCTAGTCCTCAGACTATTGTGGAGTTATTCTT 

CCAAGAAGTTGCAAGAAAACACATTATATCTCACCTCTTCTCTCAGCCAAAGGCACCT 

CTGAGCCAAACTGGATTGAATTGGCCTGAAATGATTACTGCAATTACCAGTTATTTAT 

TGCAGCTTTTATGGCCTAGCAATCCTGGTTGTCTCTTTCTAGAATGTTTGATGGGAAA 

TTGCCAATATGTACAATTGCAGGATTATATTCAACTGCTACATCCCTGGTGTCAAGTC 

AATGTTGGTTCCTGTCGATTTATGCTGGGAAGGTGTTACCTAGTTACAGGAGAAGGAC 

AGAAGGCTCTGGAATGTTTTTGTCAGGCAGCATCTGAAGTAGGCAAAGAGGAATTCTT 

GGATCGCTTGATTCGCTCAGAGGATGGGGAGATCGTGTCTACCCCCAGGCTGCAGTAT 

TATGACAAGGT7TTACGACTACTAGATGTCATTGGTTTGCCTGAACTGGTTATTCAGT 

TGGCTACATCAGCCATAACTGAAGCAAGTGATGACTGGAAAAGTCAGGCTACTCTAAG 

TTAACCCAAATTCCTGATTCCAGCAGGCAATTAGATTGTTTACGGCAGTTGGTGGTAG 
TTCTTTGTGAACGCTCACAGCTACAGGATCTTGTAGAGTTTCCCTATGTGAATCTGCA 
TAATGAGGTTGTGGGAATAATTGAGTCACGTGCTAGAGCTGTGGACCTTATGACTCAC 
AATTACTATGAACTTCTGTATGCCTTTCACATCTATCGCCACAATTACCGCAAGGCTG 
GCACAGTGATGTTTGAGTATGGAATGCGGCTTGGCAGAGAAGTTCGAACTCTCCGGGG 
ACTTGAGAAACAAGGCAACTGTTATCTGGCTGCTCTCAATTGTTTACGACTTATTCGT 
CCAGAATATGCGTGGATTGTGCAGCCAGTGTCTGGTGCAGTGTATGATCGCCCTGGAG 
CATCCCCTAAGAGGAATCATGATGGAGAATGCACAGCTGCCCCCACAAATCGACAAAT 
rGAAATCCTGGAACTGGAAGATCTGGAGAAAGAGTGTTCCTTGGCTCGCATCCGCCTC 
ACTTTGGCTCAGCATGATCCATCAGCGGTTGCAGTTGCTGGAAGTTCATCAGCAGAGG 
ftAATGGTCACTCTCTTGGTTCAGGCGGGCCTCTTTGACACTGCCATATCACTCTGTCA 
3ACTTTTAAGCTTCCCTTAACGCCAGTCTTTGAAGGGCTTGCCTTCAAATGCATCAAA 
rTGCAATTTGGAGGAGAGGCAGCACAAGCAGAAGCCTGGGCCTGGCTAGCAGCCAATC 
\GCTCTCATCTGTCATCACTACTAAGGAGTCTAGTGCTACAGATGAAGCATGGCGACT 
OTATCCACTTACCTGGAGAGGTACAAAGTCCAGAATAACTTGTATCACCACTGTGTA 
^TCAACAAGCTCTTGTCTCATGGAGTGCCTCTGCCTAATTGGCTTATAAACAGTTACA 
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AGAAGGTTGATGCTGCTGAATTGCTTCGTTTATACTTAAACTATGACCTTTTAGAAGA 
AGCTGTGGATTTGGTGTCAGAATATGTGGATGCTGTATTGGGAAAAGGACATCAATAC 
TTCGGAATTGAGGTAGGCATATACAAATGGTCCCCAATTGGGAGAGACAGTGCCAACA 
GTCACAACATCGCACTGTCCCAGAAAATACTTGACAAATTGGAGGACTACCK.GCAAAA 
AGTTG ATAAOG l W t 3GG rTTATTATATCGTCGGACCTTGTGATTTGGATT 




ORF Start: ATG at 1 


ORF Stop: TG A at 4 162 




SEQ ID NO: 178 


1387 aa I MW at 157181.6kD 


NOV61a, 
CG98061-01 Protein Sequence 




MAAAGALERSFVELSGAERERPRHFREFTVCSIGTANAVAGAVKYSESAGGFYYVESG 
KLFSVTRWRFIHIVKTSGDTLELMEESLDINLLNNAIRLKFQNCSVLPGGVYVSETQNR 
VIILMLTNQTVHRLLLPHPSRMYRSELWDSQMQSIFTDIGKVDFTDPCNYQLIPAVP 
GISPNSTASTAWLSSDGEALFALPCASGGIFVLKLPPYDIPGMVSWELKQSSVMQRL 
LTGWMPTAIRGDQSPSDRPLSLAVHCVEHDAFIFALCQDHKLRMWSYKEQMCLMVADM 
LEYVPVKKDLRLTAGTGHKLRLAYSPTMGLYLGIYMHAPKRGQFCIFQLVSTESNRYS 
LDHISSLFTSQE7LIDFALTSTDIWALWHDAENQTWKYINFEHNVAGQWNPVFMQPL 
PEEEIVIRDDQDPREMYLQSLFTPGQFTNEALCKALQIFCRGTERNLDLSWSELKKEV 
TLAVEMELQGSVTEYEFSQEEFRNLQQEFWCKFYACCLQYQEALSHPLALHLNPHTNM 
VCLLKKGYLSFLIPSSLVDHLYLLPYENLLTEDETTISDDVDIARDVICLIKCLRLIE 
ESVTVDMSVIMEMSCYNLQSPEKAAEQILEDMITIDVENVMEDICSKLQEIRNPIHAI 
GLLIREMDYETEVEMEKGFNPAQPLNIRMNLTQLYGSNTAGYIVCRGVHKIASTRFLI 
CRDLLILQQLLMRLGDAVIWGTGQLFQAQQDLLHRTAPLLLSYYLIKWGSECLATDVP 
LDTLESNLQHLSVLELTDSGALMANRFVSSPQTIVELFFQEVARKHIISHLFSQPKAP 
LSQTGLNWPEMITAITSYLLQLLWPSMPGCLFLECLMGNCQYVQLQDYIQLLHPWCQV 
NVGSCRFMLGRCYLVTGEGQKALECFCQAASEVGKEEFLDRLIRSEDGEIVSTPRLQY 
YDKVLRLLDVIGLPELVIQLATSAITEASDDWKSQATLRTCIFKHHLDLGHNSQAYEA 
LTQIPDSSRQLDCLRQLWVLCERSQLQDLVEFPYVMLHMEWGIIESRARAVDLMTH 
NYYELLYAFHIYRHNYRKAGTVMFEYGMRLGREVRTLRGLEKQGNCYLAALNCLRLIR 
PEYAWIVQPVSGAVYDRPGASPKRNHDGECTAAPTNRQIEILELEDLEKECSLARIRL 
TLAQHDPSAVAVAGSSSAEEMVTLLVQAGLFDTAISLCQTFKLPLTPVFEGLAFKCIK 
LQFGGEAAQAEAWAWLAANQLSSVITTKESSATDEAWRLLSTYLERYKVQNNLYHHCV 
INKLLSHGVPLPNWLINSYKKVDAAELLRLYLNYDLLEEAVDLVSEYVDAVLGKGHQY 
FGIEVGIYKWSPIGRDSANSHNIALSQKILDKLEDYQQKVDKATRDLLYRRTL 




SEQ ID NO: 179 | 


5264 bp 


NOV61b, 
CG98061-02 DNA Sequence 

} 


GCTCTTCTGGATTCCTCGCTGCCCACCCTCATTCTCCCGGTGGAAGCCTCGCAACGAA 
GGGCCGGAGCCGCCTTTCTGC7CCCGGAATGCTTCACCTGTCCGCAGCTCCGCCCGCC 
CCACCCCCGGAAGTGACGGCGACCGCGCGGCCTGCCTTTGTTCCGTTGGGCGTCGCGG 
C^CG^CGGGAAGATGGCGGCGGCGGGAGCCCTGGAACGGAGCTTCGTGGAGCTAAGC 
GGAGCTGAGCGCGAAAGGCCGAGGCACTTTCGGGAATTCACAGTCTGCAGCATTGGGA 
CTGCAAATGCCGTGGCTGGCGCCGTAAAATACAGTGAAAGCGCGGGAGGCTTTTACTA 
CGTGGAGAGTGGCAAGTTGTTCTCCGTAACCAGAAACAGGTTCATTCATTGGAAGACC 
TCTGGAGATACATTGGAGCTGATGGAGGAGTCACTGGACATAAATCTGTTGAATAATG 
CCATTCGCCTAAAATTCCAAAATTGCAGTGTTTTACCTGGAGGGGTTTATGTCTCTGA 
GACTCAGAATCGTGTGATAATCTTGATGTTAACCAATCAAACAGTGCACAGGTTACTT 
TTACCACACCCCTCCCGGATGTATAGGAGTGAGTTGGTAGTTGACAGTCAGATGCAGT 
CAATATTCACTGACATTGGAAAAGTTGATTTCACAGATCCTTGCAACTATCAGTTAAT 
TCCAGCAGTACCTGGAATATCTCCTAATTCCACCGCCTCTACAGCCTGGCTCAGCAGT 
GATGGGGAGGCCCTGTTTGCCTTACCATGTGCTTCTGGGGGAATCTTTGTTCTTAAGC 
TACCTCCTTATGACATACCTGGTATGGTGTCAGTCGTGGAACTGAAACAGAGTTCAGT 
AATGCAACGATTGCTTACAGGCTGGATGCCAACAGCTATCAGGGGTGACCAGTCGCCT 
TCAGATCGTCCCCTCAGTCTTGCTGTTCATTGTGTGGAGCATGATGCCTTCATCTTTG 
CTTTGTGTCAGGATCATAAACTACGAATGTGGTCTTACAAGGAGCAAATGTGCCTAAT 
GGTAGCTGACATGCTGGAGTATGTCCCTGTGAAGAAAGACCTTCGGCTTACTGCTGGA 

ACATGCATGCACCAAAACGAGGACAGTTCTGCATTTTCCAGTTGGTGAGCACTGAGAG 
TAATCGCTATAGTCTCGATCATATTTCTTCACTGTTCACTTCTCAGGAGACACTGATT 
GACTTTGCCTTAACTTCCACGGATATCTGGGCCCTGTGGCATGATGCTGAGAACCAAA 
CAGTAGTGAAATACATCAACTTTGAACATAATGTTGCAGGTCAGTGGAATCCAGTTTT 
rATGCAGCCTCTGCCAGAGGAAGAGATTGTCATCAGAGATGATCAAGACCCCAGAGAG 
ATGTATCTGCAAAGTCTTTTTACACCAGGACAATTCACAAATGAAGCTTTATGTAAGG 
CTTTACAGATTTTCTGCCGAGGAACTGAGAGGAATTTGGATCTTTCCTGGAGTGAACT 
3AAGAAAGAAGTTACTTTAGCTGTTGAAAATGAGCTTCAAGGAAGTGTAACAGAGTAT 
3AATTCTCCCAGGAGGAGTTTCGAAATTTACAACAAGAATTCTGGTGCAAGTTCTATG 
CCTGTTGTCTTCAGTATCAAGAAGCCCTCTCTCACCCTCTTGCCCTACATTTGAATCC 
^CACACAAACATGGTGTGCCTGCTGAAAAAAGGGTACCTGTCTTTCCTTATTCCCTCA 
rCCTTAGTGGATCATTTGTATCTCCTGCCTTATGAGAACCTTTTGACAGAAGATGAGA 

:aaccatatctgatgatgtggatatcgctcgggatgtcatatgtcttataaaatgcct 
-cggctgattgaagagtcagtaactgtggatatgtcagttataatggaaatgagttgt 
rataacctacagtctccggaaaaggctgcagagcagattctggaagatatgatcacta 

rTGATGTAGAAAATGTGATGGAGGATATTTGTAGTAAACTGCAAGAGATTAGGAACCC 
^ATCCATGCAATTGGACTACTTATACGGGAAATGGATTATGAAACAGAAGTGGAAATG 
aAAAAGGGATTCAATCCAGCTCAGCCTTTGAATATTCGAATGAATCTTACCCAGCTCT 
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ATGGTAGTAACACAGCAGGGTATATTGTGTGCAC3AGGGGTGCATAAAATCGCCAGTAC 

GATGCTGTGATTTGGGGAACTGGTCAGCTCTTTCAAGCTCAGCAAGACCTACTACATC 
GAACAGCTCCCCTACTCTTATCTTATTACCTCATTAAATGGGGAAGTGAGTGCTTGGC 
AACTGATGTTCCACTTGACACACTGGAGTCTAATCTCCAACACTTATCAGTACTGGAA 
TTAACAGACTCTGGTGCTTTAATGGCAAATAGGTTTGTATCTAGTCCTCAGACTATTG 
TGGAGTTATTCTTCCAAGAAGTTGCAAGAAAACACATTATATCTCACCTCTTCTCTCA 
GCCAAAGGCACCTCTGAGCCAAACTGGATTGAATTGGCCTGAAATGATTACTGCAATT 
ACCAGTTATTTATTGCAGCTTTTATGGCCTAGCAATCCTGGTTGTCTCTTTCTAGAAT 
GTTTGATGGGAAATTGCCAATATGTACAATTGCAGGATTATATTCAACTGCTACATCC 
CTGGTGTCAAGTCAATGTTGGTTCCTGTCGATTTATGCTGGGAAGGTGTTACCTAGTT 
ACAGGAGAAGGACAGAAGGCTCTGGAATGTTTTTGTCAGGCAGCATCTGAAGTAGGCA 
AAGAGGAATTCTTGGATCGCTTGATTCGCTCAGAGGATGGGGAGATCGTGTCTACCCC 
CAGGCTGCAGTATTATGACAAGGTTTTACGACTACTAGATGTCATTGGTTTGCCTGAA 
CTGGTTATTCAGTTGGCTACATCAGCCATAACTGAAGCAAGTGATGACTGGAAAAGTC 
AGGCTACTCTAAGGACATGTATTTTCAAACATCATTTGGATTTGGGTCACAATAGCCA 
AGCATATGAAGCCTTAACCCAAATTCCTGATTCCAGCAGGCAATTAGATTGTTTACGG 
CAGTTGGTGGTAGTTCTTTGTGAACGCTCACAGCTACAGGATCTTGTAGAGTTTCCCT 
ATGTGAATCTGCATAATGAGGTTGTGGGAATAATTGAGTCACGTGCTAGAGCTGTGGA 
CCTTATGACTCACAATTACTATGAACTTCTGTATGCCTTTCACATCTATCGCCACAAT 
TACCGCAAGGCTGGCACAGTGATGTTTGAGTATGGAATGCGGCTTGGCAGAGAAGTTC 
GAACTCTCCGGGGACTTGAGAAACAAGGCAACTGTTATCTGGCTGCTCTCAATTGTTT 
ACGACTTATTCGTCCAGAATATGCGTGGATTGTGCAGCCAGTGTCTGGTGCAGTGTAT 
GATCGCCCTGGAGCATCCCCTAAGAGGAATCATGATGGAGAATGCACAGCTGCCCCCA 
CAAATCGACAAATTGAAATCCTGGAACTGGAAGATCTGGAGAAAGAGTGTTCCTTGGC 
TCGCATCCGCCTCACTTTGGCTCAGCATGATCCATCAGCGGTTGCAGTTGCTGGAAGT 

TATCACTCTGTCAGACTTTTAAGCTTCCCTTAACGCCAGTCTTTGAAGGGCTTGCCTT 
CAAATGCATCAAATTGCAATTTGGAGGAGAGGCAGCACAAGCAGAAGCCTGGGCCTGG 
CTAGCAGCCAATCAGCTCTCATCTGTCATCACTACTAAGGAGTCTAGTGCTACAGATG 
AAGCATGGCGACTATTATCCACTTACCTGGAGAGGTACAAAGTCCAGAATAACTTGTA 
TCACCACTGTGTAATCAACAAGCTCTTGTCTCATGGAGTGCCTCTGCCTAATTGGCTT 
ATAAACAGTTACAAGAAGGTTGATGCTGCTGAATTGCTTCGTTTATACTTAAACTATG 
ACCTTTTAGAAGAAGCTGTGGATTTGGTGTCAGAATATGTGGATGCTGTATTGGGAAA 
AGGACATCAATACTTCGGAATTGAGTTTCCACTGTCCGCAACAGCCCCAATGGTGTGG 
CTTCCATACTCCTCTATTGATCAGCTTCTCCAAGCTCTGGGAGAGAACAGTGCCAACA 
GTCACAACATCGCACTGTCCCAGAAAATACTTGACAAATTGGAGGACTACCAGCAAAA 
AGTTGATAAGC-CAACACGGGATTTATTATATCGTCGGACCTTGTG ATTTGGATTGTCA 
CCTAGCCTTTGTAACCGCTTGGTGCCTCTTAGGACTTAAGACTACCCTACAGGAACCC 
TGTACTCAAGGCCGATTTTTGTAACTGTAAATGATGTGTACAACATTCAAGTCTGCAT 
TCTGCACAAGATAGGAGGGCGGAAGAGTCAGAGGACCCTGTGCTTGCTGGTGGTGCTA 
ACACAATTTCTGGTGTTCAACCTTGGTCTCAAATAGCTGCTTTTGTATATGATTCACG 
AGCTTTTTTAGAGTTTATATTTTTTTAAACTACCGAAGACATTCATIATCTGCAAATT 
AAGACTCACCTTCACTTTCCAAAATAGCTGAGGGTTGTTGGCTTGTTGTAGCTGACCA 

CAATTATTTTGGTCCAGAACTGACCTGTATTTTCTGTATTGTACACAAAAGCTAATAA 
TTTTGTGTACTTTTTATTTATTTTGGAGGTTTTATATGATCTTCAATTGAGTATTAAA 
TAATTTGCCTAGATTAAGCCTAAAATGATGACCAGCTAATTAAAGAAGATATTTTGAA 
TCTGGTT CTGAG CTAAAGTTGAGT AAATT CTT AGCTAAGAAAAAATTGGAAAT CCATC 
ATCTATATTAGCAACAGATTCTCAGAGTAAATTGTTAACTTCTATGATTTATGATAAT 
CAAGCTGGACTTGATCATACAAGTTAGTCTCATAATGTATTGGACCAAAATGTAAACT 
TCATTGGTCAGATTTAGAAGCATTCATGCTCACAAGTTTTGGGAAAGTGAAAAATAAT 
AAAATCATCTTGGATTTTATTCTGTATATTAAAATTTATCTTTT 
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ORF Start: ATG at 188 


ORF Stop: TGA at 4394 




SEQIDNO: 180 


1402 aa | MW at 158753.4kD 


NOV61b, 
CG98061-02 Protein Sequence 


MAAAGALERSFVELSGAERERPRHFREFTVCSIGTANAVAGAVKYSESAGGFYYVESG 
KLFSVTRNRFIHWKTSGDTLELMEESLDINLLNNAIRLKFQNCSVLPGGVYVSETQNR 
VIILMLTNQTVHRLLLPHPSRMYRSELVVDSQMQSIFTDIGKVDFTDPCNYQLIPAVP 
GISPNSTASTAWLSSDGEALFALPCASGGIFVLKLPPYDIPGMVSWELKQSSVMQRL 
LTGWMPTAIRGDQSPSDRPLSLAVHCVEHDAFIFALCQDHKLRMWSYKEQMCLMVADM 
LEYVPVKKDLRLTAGTGHKLRLAYSPTMGLYLGIYMHAPKRGQFCIFQLVSTESNRYS 
LDHISSLFTSQETLIDFALTSTDIWALVIHDAEMQTWKYINFEHNVAGQWNPVFMQPL 
PEEEIVIRDDQDPREMYLQSLFTPGQFTNEALCKALQIFCRGTERNLDLSWSELKKEV 
TLAVENELQGSVTEYEFSQEEFRNLQQEFWCKFYACCLQYQEALSHPLALHLNPHTNM 
VCLLKKGYLS FL I PS S LVOHLYLLPYENLLTEDETTI SDDVDI ARDVI CLI KCLRLIE 
ESVTVDMSVIMEMSCYNLQSPEKAAEQILEDMITIDVENVMEDICSKLQEIRNPIHAI 
GLLIREMDYETEVEMEKGFNPAQPLNIRMNLTQLYGSNTAGYIVCRGVHKIASTRFLI 
CRDLLILQQLLMRLGDAVIWGTGQLFQAQQDLLHRTAPLLLSYYLIKWGSECLATDVP 
LDTLESNLQHLSVLELTDSGALMANRFVSSPQTIVELFFQEVARKHIISHLFSQPKAP 
LSQTGLNWPEMITAITSYLLQLLWPSNPGCLFLECLMGMCQYVQLQDYIQLLHPWCQV 
NVGSCRFMLGRCYLVTGEGQKALECFCQAASEVGKEEFLDRLIRSEDGEIVSTPRLQY 
YDKVLRLLDVIGLPELVIQLATSAITEASDDWKSQATLRTCIFKHHLDLGHNSQAYEA 
LTQ I PDSSRQLDCLRQLWVLCERSQLQDLVE FPYVNLHNE WGI I ESRARAVDLMTH 
NYYELLYAFHIYRHNYRKAGTVMFEYGMRLGREVRTLRGLEKQGNCYLAALNCLRLIR 
PEYAWIVQPVSGAVYDRPGASPKRNHDGECTAAPTNRQIEILELEDLEKECSLARIRL 
TLAQHDPSAVAVAGSSSAEEMVTLLVQAGLFDTAISLCQTFKLPLTPVFEGLAFKCIK 
LQFGGEAAQAEAWAWLAANQLSSVITTKESSATDEAWRLLSTYLERYKVQNNLYHHCV 
INKLLSHGVPLPNWLINSYKKVDAAELLRLYLNYDLLEEAVDLVSEYVDAVLGKGHQY 
FGIEFPLSATAPMVWLPYSSIDQLLQALGEMSANSHNIALSQKILDKLEDYQQKVDKA 
TRDLLYRRTL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 61B. 



Table 61B. Comparison of NOV61a against NOV61b. 


Protein Sequence 


NO V6 la Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV61b 


1..1387 
1..1402 


1308/1402(93%) 
1312/1402 (93%) 



Further analysis of the NOV6 la protein yielded the following properties shown in 
Table 61 C. 



Table 61C. Protein Sequence Properties NOV61a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV61a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 6 ID. 



Table 61D. Geneseq Results for NOV61a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV61a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 


Expect 
Value 


AAM40894 


Human polypeptide SEQ ID NO 
5825 - Homo sapiens, 1401 aa. 
[WO200 1 533 12-A 1 , 26-JUL-200 1 ] 


1..1314 
75.. 1389 


1308/1315 
(99%) 

1311/1315 
(99%) 


0.0 


AAM39108 


Human polypeptide SEQ ID NO 
2253 - Homo sapiens, 1 3 1 6 aa. 
[ WO200 1 533 1 2- A 1 , 26-JUL-200 1 ] 


1 ..1307 
1.1312 


1299/1312 

(99%) 
1301/1312 

(99%) 


0.0 


ABB60919 


Drosophila melanogaster polypeptide 
SEQ ID NO 9549 - Drosophila 
melanogaster, 141 1 aa. 
[ WO200 1 7 1 042-A2, 27-SEP-200 1 ] 


43..1335 
41..1333 


389/1342 (28%) 
638/1342 (46%) 


e-143 


AAB38261 


Gene 1 3 human secreted protein 
homologous amino acid sequence 
#1 17 - Homo sapiens, 58 aa. 
[WO200058469-A1, 05-OCT-2000] 


471. .528 
1..58 


58/58 (100%) 
58/58 (100%) 


9e-29 


AAB38262 


Human secreted protein sequence 
encoded by gene 13 SEQ ID NO:l 18 

- Homo sapiens, 58 aa. 
[WO200058469-A 1 , 05-OCT-2000] 


471. .528 
1..58 


57/58 (98%) 
58/58 (99%) 


2e-28 



In a BLAST search of public sequence datbases, the NOV61a protein was found to 
5 have homology to the proteins shown in the BLASTP data in Table 6 IE. 
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Table 61E. Public BLASTP Results for NOV61a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV61a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q 12769 


Protein KIAA0197 - Homo sapiens 
(Human), 1 3 14 aa (fragment). 


3..1307 
1 — 13 10 


1298/1310 

(99%) 
1300/1310 

(99%) 


0.0 


Q9Z0W3 


Protein KIAA0197 (GTL-13) - 
Mus musculus (Mouse), 1402 aa. 


1.1387 
1..1402 


1274/1402 
(90%) 

1332/1402 
(94%) 


0.0 


Q9VKJ3 


Protein KIAA0197 homolog- 
Drosophila melanogaster (Fruit 
fly), 1411 aa. 


43. .1335 
41. .1333 


389/1342(28%) 
638/1342(46%) 


e-143 


Q9CZD9 


281001 1M03RIK PROTEIN - Mus 
musculus (Mouse), 215 aa. 


I181..1380 

1..215 


180/215 (83%) 
187/215 (86%) 


le-96 


Q96GB3 


KIAA0197 PROTEIN - Homo 
sapiens (Human), 190 aa. 


1-141 
1 ..141 


141/141 (100%) 
141/141 (100%) 


5e-75 



PFam analysis predicts that the NOV61a protein contains the domains shown in 
the Table 6 IF. 



Table 61F. Domain Analysis of NOV61a 



NO V6 la Match Region 



Identities/ 
Similarities 
for the Matched Region 



No Significant Matches Found 



Example 62. 

The NOV62 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 62A. 
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Table 62A. NOV62 Sequence Analysis 




SEQIDNO:181 | 5544 bp 


NOV62a, 
CG98071-01 DNA Sequence 


GCGGGACTCAAGAGTAGCCTTCCTCGAGGACCTGCCTTTCCCATTTGCTGCCTGAAGT 
TAATGTTTCTTGCTGGCCAAATCAGGGACATGCCGGCATTAGCGGGATGAGTGGGTGT 
TCCGGCAGGGATGTGGTCATTGACGGCCAGTGAGGGCGAGAGTACCACGGCCCACTTC 
TTCCTTGGAGCTGGAGATGAGGGGCTGGGCACCCGTGGAATAGGCATGAGGCCAGAAG 
AGAGTGACAGCGAGCTCCTTGAGGATGAGGAGGATGAAGTGCCTCCTGAACCTCAGAT 
CATTGTTGGCATCTGTGCCATGACCAAGAAATCCAAGTCCAAGCCAATGACTCAAATC 
CTAGAGCGACTCTGCAGATTTGACTACCTGACTGTTGTCATTCTGGGAGAAGATGTAA 
TCCTTAATGAACCTGTGGAAAACTGGCCATCCTGCCACTGCCTCATCTCTTTCCACTC 
CAAAGGCTTTCCTCTGGACAAAGCTGTTGCTTACTCCAAGCTTCGAAACCCCTTTCTT 
ATCAATGATCTGGCCATGCAGTATTACATCCAAGATAGGAGGGAGGTGTACCGGATCC 
TGCAGGAAGAGGGTATTGATCTGCCTCGATATGCTGTGCTCAACCGTGATCCTGCCCG 
GCCTGAGGAATGCAACCTGATAGAAGGTGAAGACCAAGTAGAGGTCAATGGAGCTGTC 
TTTCCCAAGCCCTTTGTGGAGAAGCCAGTGAGTGCAGAAGACCACAATGTTTACATCT 
ACTACCCCAGCTCAGCTGGAGGAGGAAGCCAGCGTCTCTTTCGTAAGATTGGCAGCCG 
AAGCAGTGTTTACTCTCCTGAGAGCAGCGTCCGAAAGACGGGGTCGTACATCTATGAG 
GAGTTTATGCCAACAGATGGCACAGATGTCAAGGTGTATACAGTGGGGCCAGATTATG 
CCCATGCTGAAGCTAGAAAATCTCCAGCTTTGGATGGGAAGGTTGAACGAGACAGTGA 
GGGGAAAGAGATTCGATATCCAGTCATGCTGACTGCCATGGAAAAGCTGGTGGCCAGG 
AAAGTCTGCGTAGCTTTCAAGCAAACAGTTTGTGGATTTGACCTTCTTCGTGCCAATG 
GTCATTCCTTTGTGTGTGATGTCAATGGCTTTAGTTTTGTCAAGAACTCGATGAAATA 
CTACGATGACTGTGCCAAGATTCTGGGGAACACCATAATGCGGGAGCTTGCCCCACAG 
TTCCAGATTCCATGGTCCATCCCCACGGAGGCTGAGGACATTCCCATTGTTCCCACCA 
CATCTGGCACTATGATGGAACTTCGTTGTGTCATTGCAATTATTCGTCATGGGGATCG 
TACTCCCAAGCAGAAGATGAAGATGGAAGTGAAACACCCAAGGTTTTTTGCTCTGTTT 
GAAAAACATGGTGGCTACAAGACAGGGAAATTAAAACTCAAGCGACCTGAGCAGCTCC 
AGGAGGTGCTGGATATCACAAGGCTGTTGTTGGCTGAACTGGAGAAAGAACCAGGTGG 
^GAGATCGAGGAGAAGACTGGAAAACTAGAGCAGCTGAAGTCTGTACTGGAGATGTAT 
GGTCACTTCTCAGGTATAAACCGGAAGGTACAATTGACTTACTACCCTCATGGAGTAA 
AAGCTTCTAATGAGGGGCAAGATCCACAGAGGGAAACTCTGGCCCCATCTCTGTTGCT 
GGTACTGAAGTGGGGTGGAGAACTGACTCCTGCTGGCCGTGTTCAGGCTGAGGAGCTG 
GGGCGAGCTTTTCGCTGCATGTACCCTGGAGGACAGGGTGACTATGCTGGCTTCCCTG 
GTTGTGGGCTGCTTCGTCTCCATAGCACTTTCCGCCACGATCTCAAGATCTATGCCTC 
TGATGAGGGTCGTGTTCAGATGACTGCTGCTGCCTTCGCCAAGGGCCTTCTGGCTCTA 
GAAGGGGAGCTGACACCCATTTTGGTGCAAATGGTGAAGAGTGCCAACATGAATGGGC 
TACTGGACAGCGATGGGGATTCCTTGAGCAGCTGCCAGCACCGGGTGAAGGCTCGGCT 
GCACCATATTCTACAGCAGGATGCACCCTTTGGCCCTGAGGACTACGATCAGCTGGCT 
CCCACCAGAAGTACTTCCCTGCTCAACTCCATGACTATCATCCAGAATCCTGTGAAGG 
TCTGTGATCAGGTATTTGCCCTGATCGAAAACCTCACCCACCAGATCCGGGAACGAAT 
GCAGGACCCCAGGTCTGTAGACCTGCAGCTCTACCACAGTGAGACACTAGAGCTAATG 
CTACAGCGTTGGAGCAAGCTGGAGCGTGACTTTCGACAGAAGAGTGGGCGCTATGATA 
TCAGTAAGATCCCTGACATCTATGACTGTGTCAAGTATGATGTGCAGCACAATGGGAG 
TCTGGGACTTCAAGGCACAGCAGAGTTGCTCCGTCTCTCTAAGGCACTGGCTGATGTG 
GTCATTCCCCAGGAGTACGGGATCAGTCGGGAGGAGAAACTGGAAATTGCTGTGGGCT 
TCTGTCTTCCACTGTTGCGGAAGATACTACTTGACCTGCAGAGAACCCACGAGGATGA 

GTTCGAACGCGTCTCTATTTCACCAGTGAGAGCCATGTCCACTCCCTGCTCAGTGTCT 
TCCGTTATGGAGGACTTCTTGATGAGACCCAGGATGCACAATGGCAGCGAGCTTTGGA 
TTATCTTAGTGCCATCTCAGAGCTTAACTACATGACCCAGATTGTCATCATGCTTTAT 
GAGGACAACACACAGGATCCCTTATCAGAGGAACGGTTCCATGTGGAGCTACACTTCA 

rCCAGCCTCTTCTGAGAATGAGGAGATGAAAACCAACCAAGGCAGTATGGAGAACCTG 
TGTCCAGGAAAGGCATCAGATGAACCAGACCGGGCATTGCAGACTTCACCCCAGCCTC 
CTGAGGGCCCTGGCCTTCCGAGGAGATCACCCCTCATTCGTAACCGAAAAGCTGGTTC 
CATGGAGGTACTTTCTGAGACTTCATCCTCGAGGCCTGGTGGCTACCGGCTCTTTTCA 
rCTTCACGGCCACCCACAGAAATGAAGCAGAGTGGCCTAGGGTTTGAAGGGTGTTCCA 
rGGTGCCTACCATCTACCCTCTGGAAACACTGCATAATGCCCTTTCCCTACGTCAAGT 
3AGTGAATTCTTGAGTAGAGTCTGCCAGCGCCACACTGATGCCCAGGCACAGGCATCT 
3CAGCCCTCTTTGATTCCATGCACAGCAGCCAGGCCTCAGATAACCCATTTTCTCCAC 

:acgtactcttcattcacctcccctgcaactccagcagcgctctgagaagcccccttg 
3ttagagacaaggttttgccatgttggccaggctggtttagagctcctgacctcaagt 
3atctgcctgcctcggcctcccaaagtgctgggattacaggcgtgagccaccgcaccc 

rACAGTAGATGGTAACTCCCAATTTGGCTTCAGTGATCAACCCTCCCTAAATTCACAC 
3TGGCTGAAGAACATCA^GGCCTTGGGCTGCTCCAGGAGACCCCTGGGAGTGGAGCAC 
4AGAGCTCTCCATAGAAGGGGAGCAAGAGCTTTTTGAACCAAATCAGTCCCCACAGGT 
3CCACCTATGGAAACCAGCCAGCCATACGAGGAGGTCAGCCAGCCATGTCAGGAGGTC 
:CTGACATCAGCCAGCCATGCCAGGACATTTCTGAGGCGCTCAGCCAGCCATGTCAGA 
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AGGTCCCTGACATCAGCCAGCAATGCCAGGAGAACCATGACAATGGTAACCACACATG 
CCAGGAGGTCCCTCACATCAGCCAGCCATGCCAGAAGTCCAGCCAACTGTGCCAGAAA 
GTCTCTGAGGAAGTTTGCCAGCTATGTCTGGAGAACTCCGAGGAGGTCAGCCAGCCAT 
GCCAGGGGGTCTCTGTGGAGGTTGGCAAGCTGGTCCATAAGTTCCATGTAGGGGTTGG 
TAGCTTGGTCCAGGAAACCCTTGTAGAAGTTGGCAGCCCAGCTGAAGAGATCCCTGAG 
GAGGTCATCCAGCCATACCAGGAGTTCTCTGTGGAGGTTGGCAGGCTGGCCCAGGAGA 
CTTCTGCGATCAATCTGTTATCTCAGGGCATCCCTGAGATTGATAAACCATCCCAGGA 
GTTCCCTGAGGAGATTGATCTGCAGGCCCAGGAGGTCCCTGAGGAGATAAATTAGAAG 
TCCTGGGTGGTCCCTGAAGTGATTGATCAGCTGCCTGGAGAGGGTATTCCTCAAGCCC 
AGCATCCATCTGGTGATCCAAACCCTCAGAGCCAGTCTCTAGCCCATGACCAGCACTC 
ACCCCTTCCACCAGCAACATGTGATTAATTTTCTCATTAGTGGTATCACACTATACCA 




GTGTCTGGGGAAGTAGTTCTCTTTGTATGAAGCATACCTGTGCCAGAGCTGTGGTTGA 
GGAGGAGCCAGTTTTAGGTTCGAAGAAGCCATTGGCTCCTCACTTAGCCATTAGACTT 
GAATAGGATTTCCTTGGGGTGGGTTGGTCTGTCATTACCCAGCCTTCTCTGAGCATCC 
TAGGAAATCACAGATTGTTAAAGGAAATGCCGCTTCACTGCTGAAGACACCATCTGGC 
GACAGCAAATGCAAAAGAGGGGACTCTAGGGTCTTCACTTTTCTGGGGAAATGTTCAC 
GACTTCTCAAGGTACGCTTAGACCATATGTGCATTCAGGGGACTCTTGTCTTTGCCAG 
CACTCCTCAGGTGAGCCCGGGATGGCTATCTGAAATGTCTGGAAAATAGAGTCCCTTC 
CCCAAACATCTGACATGGCCACTAAATCTTTAGACTTGCCTTATAGATCCTTCCATAT 
CATCCCCAAGTGCCCCTTCTGCCTCAGTCCTGTTCCTCTGGGCAATAGCTCTAGGGAA 
AAGTAGGTATTAGACTGCTGTGCAAAATTCAGAGCAACTACTTAAAATGCTCTAGAGG 

GATGCTGTAGTAGTAGAGATAATCAGGTTCTATTTTAAGCAATCAGCAGAGATCAATA 
TTGTACAGATACAAGCAAAGGTTTAATAAATATATATATATATATTTATTTCTGTAGT 
TGTGCCAGGGACAGACTGGCCAAAGACCAAACACTGCAGGGTCCCCAAGAAATTTGTC 
CTTATATCATTGTGTCTGAGGGCCAGATATGATTATGAAGCTTTTTCCAAAGATCCAG 
GGATGGGAATGGGAGTGGGTAGGAGGGGTAATGCGGTCATTGGAGTCGGGGGCTGGAA 
CATTATGAGTGCTCAATAAATATAAACTAATGAG 






ORF Start: ATG at 127 


ORF Stop: TAG at 4345 




SEQ1D NO: 182 


1406 aa I MW at 15631 8.6kD 


NOV62a, 
CG98071-01 Protein Sequence 


MWSLTASEGESTTAHFFLGAGDEGLGTRGIGMRPEESDSELLEDEEDEVPPEPQIIVG 
I CAMTKKSKS KPMTQ ILERLCRFDYLTWILGEDVILHEPVENWPSCHCLI SPHSKGF 
PLDKAVAYSKLRNPFLINDLAMQYYIQDRREVyRILQEEGIDLPRYAVTjNRDPARPEE 
CNLIEGEDQVEVNGAVFPKPFVEKPVEAED1INVYIYYPSSAGGGSQRLFRKIGSRSSV 
YSPESSVRKTGSYIYEEFMPTDGTDVKVYTVGPDYAHAEARKSPALDGKVERDSEGKE 
IRYPVMLTAMEKLVARKVCVAFKQTVCGFDLLRANGHSFVCDVNGFSFVKNSMKYYDD 
CAKILGNTIMRELAPQFQIPWSIPTEAEDIPIVPTTEGTMMELRCVIAIIRHGDRTPK 
QKMKMEVKHPRFFALFEKHGGYKTGKLKLKRPEQLQEVLDITRLLLAELEKEPGGEIE 
EKTGKLEQLKSVLEMYGHFSGINRKVQLTYYPHGVKASNEGQDPQRETLAPSLLLVLK 
WGGELTPAGRVQAEELGRAFRCMYPGGQGDYAGFPGCGLLRLHSTFRHDLKIYASDEG 
RVQOTAAAFAKGLIALEGELTPILVQMVKSANMNGLLDSDGDSLSSCQHRVKARLHHI 
LQQDAP FG PEDYDQLAPTRS TELLNSMTI I QNPVKVCDQVFAL IENLTHQI RERMQDP 
RSVDLQLYHSETLELMLQRWSKLERDFRQKSGRYDISKIPDIYDCVKYDVQHNGSLGL 
QGTAELLRLSKAliADWIPQEYGISREEKLEIAVGFCLPLLRKILLDLQRTHEDESVN 
KLHPLYSRGVLSPGRHVRTRLYFTSESHVHSLLSVFRYGGLLDETQEIAQWQRALDYLS 
AISELNYMTQIVIMLYEDNTQDPLSEERFHVELHFSPGVKGVEEEGSAPAGCGFRPAS 
SENEEMKTNQGSMENLCPGKASDEPDRALQTSPQPPEGPGLPRRSPLIRNRKAGSMEV 
LSETSSSRPGGYRLFSSSRPPTEMKQSGLGFEGCSMVPTIYPLETLHNALSLRQVSEF 
LSRVCQRHTDAQAQASAALFDSMHSSQASDNPFSPPRTLHSPPLQLQQRSEKPPWLET 
RFCHVGQAGLELLTSSDLPASASQSAGITGVSHRTQPDSSGPSSTVSSAGPSSPTTVD 
GNSQFGFSDQPSLNSHVAEEHQGLGLLQETPGSGAQELSIEGEQELFEPNQSPQVPPM 
ETSQPYEEr/SQPCQEVPDISQPCQDISEALSQPCQKVPDISQQCQENHDNGNHTCQEV 
PHISQPCQKSSQLCQKVSEEVCQLCLENSEEVSQPCQGVSVEVGKLVHKFHVGVGSLV 




EIDLQAQEVPEEIN 





Further analysis of the NOV62a protein yielded the following properties shown in 
Table 62B. 
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Table 62B. Protein Sequence Properties NOV62a 


PSort analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in Iysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV62a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 62C. 





Table 62C. Geneseq Results for NOV62a 


! 

1 Geneseq 
• Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV62a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


1 ABG15880 


Novel human diagnostic protein 
#15871 - Homo sapiens, 1136aa. 
[WO200175067-A2, ll-OCT-2001] 


161. .1008 
239.. 1056 


621/850 (73%) 
695/850 (81%) 


0.0 


ABG 14341 


Novel human diagnostic protein 
#14332 - Homo sapiens, 1136aa. 
[WO200175067-A2, ll-OCT-2001] 


161..1008 
239.. 1056 


621/850 (73%) 
695/850 (81%) 


0.0 


ABG15880 


Novel human diagnostic protein 
#15871 - Homo sapiens, 1136aa. 
[WO200175067-A2, ll-OCT-2001] 


161..1008 
239.. 1056 


621/850 (73%) 
695/850 (81%) 


0.0 


ABG 14341 


Novel human diagnostic protein 
#14332 - Homo sapiens, 1 136 aa 
[WO200175067-A2, ll-OCT-2001] 


161..1008 
239..1056 


621/850 (73%) 
695/850 (81%) 


0.0 


ABB69600 




Drosophila melanogaster 
polypeptide SEQ ID NO 35592 - 
Drosophila melanogaster, 1751 aa. 
[WO200171042-A2,27-SEP-2001] | 


16..954 
33..991 


619/966 (64%) 
730/966 (75%) 


0.0 



In a BLAST search of public sequence datbases, the NOV62a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 62D. 
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Table 62D. Public BLASTP Results for NOV62a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV62a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 




KIAA0377 PROTEIN - Homo 
sapiens (Human), 1406 aa. 


1.1406 


1406/1406 
(100%) 

1406/1406 
(100%) 


0.0 


AAH24591 


SIMILAR TO K.IAA0433 
PROTEIN - Homo sapiens 
(Human), 1222 aa. 


11. .997 
2..986 


743/989 (75%) 
835/989 (84%) 


0.0 


043314 


KIAA0433 PROTEIN - Homo 
sapiens (Human), 1243 aa. 


11. .997 
2..986 


743/989 (75%) 
835/989 (84%) 


0.0 


Q9VR59 


CGI 46 16 PROTEIN - 
Drosophila melanogaster (Fruit 
fly), 1751 aa. 


16..954 
33..991 


619/966 (64%) 
730/966 (75%) 


0.0 


T25770 


hypothetical protein F46FI 1.1 - 
Caenorhabditis elegans, 1224 aa. 


54..909 
16..917 


491/910 (53%) 
627/910 (67%) 


0.0 



PFam analysis predicts that the NOV62a protein contains the domains shown in 
the Table 62E. 



Table 62E. Domain Analysis of NOV62a 


Pfam Domain 


NOV62a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


acidjphosphat 


488..947 


78/531 (15%) 
290/531 (55%) 


0.4 


zf-NF-Xl 


1269.. 1292 


8/27 (30%) 
20/27(74%) 


0.75 



Example 63. 



5 The NOV63 clone was analyzed, and the nucleotide and encoded polypeptide 

sequences are shown in Table 63A. 
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Table 63A. NOV63 Sequence Analysis 




SEQ ID NO: 183 


885 bp 


NOV63a, 
CG98131-0I DNA Sequence 


CGTTCCCTCTCCTCTCACCCACACCCCTGGCCATGGCTAACTACTATGAAGTGCTGGG 

CGTTGGCACCCCGACAAGAACCCTGACAATAAGGAGGAGGCGGAGAAGAAGTTCAAGC 
TGGTGTCTGAGGCCTATGAGGTTCTGTCTGACTCCAAGAAACGCTCCCTGTATGACCG 

TTCGACACCGGCTACACCTTCCGTAACCCTGAGGACATCTTCCGGGAGTTTTTTGGTG 
GCCTGGACCCTTTCTCCTTTGAGTTCTGGGACAGCCCATTCAATAGTGACCGTGGTGG 
CCGGGGCCATGGCCTGAGGGGGGCCTTCTCGGCAGGCTTTGGAGAATTTCCGGCCTTC 
ATGGAGGCCTTCTCATCCTTCAACATGCTGGGCTGCAGCGGGGGCAGCCACACCACCT 
TCTCATCCACCTCCTTCGGGGGCTCCAGTTCTGGCAGCTCGGGGTTCAAGTCGGTGAT 
GTCATCCACCGAGATGATCAATGGCCACAAGGTCACCACCAAGCGCATCGTGGAGAAC 
GGGCAGGAGCGCGTGGAGGTGGAGGAAGACGGGCAGCTCAAGTCGGTGACTGTGAACG 
GCAAGGAGCAGCTCAAATGGATGGACAGCAAGTAGGCGCTGGCCACCCGGCCCTGCCT 
TCCCACCACCACCACCGTGCATGGGGCAGCAAACACGTGGGGCCGCAGACATAGCCTG 
ATGGTTAATAAATGT 




ORF Start: ATG at 9 1 


ORF Stop: TAG at 787 




SEQ ID NO: 184 


232 aa I MW at 25686. lkD 


NOV63a, 
CG98 131-01 Protein Sequence 


MANYYEVLGVQASASPEDIKKAYRKLALRWHPDKNPDNKEEAEKKFKLVSEAYEVLSD 
SKKRSLYDPJU3CDSWRAGGGASTPYHSPFDTGYTFRNPEDIFREFFGGLDPFSFEFWD 
SPFNSDRGGRGHGLRGAFSAGFGEFPAFMEAFSSFNMLGCSGGSHTTPSSTSrr'c 3£ 
GSSGFKSVMSSTEMINGHKVTTKRIVENGQERVEVEEDGQLKSVTVNGKEQLKWMDSK 



Further analysis of the NOV63a protein yielded the following properties shown in 
Table 63B. 



Table 63B. Protein Sequence Properties NOV63a 


PSort analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV63a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 63 C. 
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Table 63C. Geneseq Results for NOV63a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV63a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU 17599 


Novel signal transduction pathway 
protein, Seq ID 1 164 - Homo sapiens, 
267 aa. [WO200154733-A1, 02-AUG- 


1..232 
3 6. .267 


231/232 (99%) 
231/232 (99%) 


e-136 


AAM9543 1 


Human reproductive system related 
antigen SEQ ID NO: 4089 - Homo 
sapiens, 267 aa. [WO200155320-A2, 
02-AUG-2001] 


1..232 
S6..267 


231/232 (99%) 
231/232 (99%) 


e-136 


AAW94066 


Human DnaJ-like protein, HSPJ2 - 
Homo sapiens, 330 aa. [WO9855509- 
A2, 10-DEC-1998] 


I..222 
1..231 


147/233 (63%) 
179/233 (76%) 


4e-78 


AAY74126 


Human prostate tumor EST fragment 
derived protein U3 13 - Homo sapiens, 
3 1 7 aa. [DE 1 9820 1 90-A 1 , 04-NOV- 
1999] 


1..222 
2..2S2 


147/233 (63%) 
179/233 (76%) 


4e-78 


AAM40013 


Human polypeptide SEQ ID NO 3158 

- Homo sapiens, 309 aa. 
[WO200 1 533 1 2-A 1 , 26-JUL-200 1] 


1..221 
1..225 


106/232 (45%) 
149/232 (63%) 


6e-48 



In a BLAST search of public sequence datbases, the NOV63a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 63D. 
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Table 63D. Public BLASTP Results for NOV63a 


Accession 
Number 


. Protein/Organism/Length 


NOV63a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 


Q9QY17 


DnaJ homolog subfamily B member 8 
(mDJ6) - Mus musculus (Mouse), 227 
aa. 


1..230 


190/230 (82%) 
203/230 (87%) 


e-107 


075190 


DnaJ homolog subfamily B member 6 
(Heat shock protein J2) (HSJ-2) (MSJ- 
1) (HHDJ1) (MRJ) - Homo sapiens 
(Human), 326 aa. 


1..222 
1..231 


147/233 (63%) 
179/233 (76%) 


9e-78 


054946 


DnaJ homolog subfamily B member 6 
(Heat shock protein J2) (HSJ-2) (MRJ) 
(mDj4) - Mus musculus (Mouse), 242 
aa. 


1..232 
1..242 


151/246 (61%) 
185/246 (74%) 


3e-77 


Q8WN90 


MOLECULAR CHAPERONE MRJ - 
Bos taurus (Bovine), 242 aa. 


1..232 
1..242 


148/246 (60%) 
180/246 (73%) 


4e-77 


035723 


DnaJ homolog subfamily B member 3 
(DnaJ protein homolog 3) (Heat shock 
J3 protein) (HSJ-3) (MSJ-1)- Mus 
musculus (Mouse), 242 aa. 


1..226 
1..236 


133/238(55%) 
169/238(70%) 


2e-65 



PFam analysis predicts that the NOV63a protein contains the domains shown in 
the Table 63E. 



Table 63E. Domain Analysis of NOV63a 


Pfam Domain 


NOV63a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


DnaJ 


3 69 


45/79 (57%) 
62/79 (78%) 


4.7e-40 



Example 64. 



The NOV64 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 64A. 
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Table 64A. NOV64 Sequence Analysis 




SEQ ID NO: 185 


! 2711 bp 


NOV64a, 


CCTAGGCAACCCCGCTAAACAAGATSGCGACCTCCGAGAGGGCTCTCCTGAGGACCAG 


CG98164-01 DNA Sequence 


TTTCGCGCAGAAAAAGAGCGTCAGCCTTGCTGGTCTTTTCCCATGGGGCAGAAGACGA 
AAGGCAGCTCTAACATAGCCTCCTCCTACCTGCTCCAGCAGCTCATGCACCGCTATCA 
GGAGCTGGACTCGGACGGAGATGAGGACCACGGCGAGGGCGAGGCGGGATCCGAGGAG 
TCCTCAGAGTCCGAAATGCTGAATTTGGAGGAGGAATTTGATGGGGTCCTGAGAGAGG 
AGGCTGTGGCCAAAGCACTCCATCACTTGGGGCGCTCAGGCTCTGGGACTGAGCAAGT 
CTACCTCAATCTAACTTTATCAGGTTGTAATTTAATTGATGTTAGCATTCTCTGTGGA 
TATGTTCATCTACAGAAGTTGGATCTTTCAGCGAATAAAATTGAAGATTTATCTTGTG 
TGAGTTGTATGCCTTATCTCCTAGAACTTAATGCTTCTCAAAATAATTTGACTACGTT 
CTTCAATTTCAAGCCACCCAAAAACCTCAAGAAGGCGGATTTTTCCCACAACCAAATT 
TCTGAAATTTGTGATTTGTCAGCGTATCATGCTCTCACTAAACTAATTTTGGATGGCA 
ATGAGATAGAAGAAATCAGTGGACTAGAGATGTGCAACAACCTAATTCACCTTAGTTT 
GGCCAACAATAAGATCACGACAATTAATGGCTTAAACAAGTTACCAATCAAAATACTT 
TGTCTGAGCAATAACCAGATTGAGATGATCACAGGTTTGGAGGATCTGAAAGCCCTGC 
AGAACCTGGATCTGTCCCACAATCAGATAAGCAGCCTCCAAGGCTTAGAGAATCATGA 
CCTCCTGGAAGTGATCAACCTGGAGGATAATAAGATTGCTGAGCTGAGAGAAATAGAA 
TACATTAAAAATTTACCCATCCTTCGAGTTCTCAATCTTCTAGAAAATCCAATTCAGG 






AAAAGTCTGAATATTGGTTCTTCGTAATTTTTATGCTTCTGCGATTAACAGAATTAGA 
TCAGAAGAAGATTAAAGTGGAAGAAAAGGTTTCAGCAGTGAATAAATATGATCCTCCC 
CCTGAAGTGGTTGCAGCTCAAGACCACCTGACCCATGTTGTCAACAGCGTGATGCAGC 
CGCAGAGGATCTTTGACAGCACTCTTCCCAGCCTGGATGCCCCTTATCCCATGCTGAT 
ACTAGCTGGTCCTGAAGCTTGTGGGAAACGAGAGCTTGCCCATCGCCTCTGCAGACAG 
TTTAGCACTTACTTCAGATATGGGGCCTGTCATACCACAAGACCACCTTACTTTGGAG 
AAGGGGATOGAGTTGATTATCATTTTATCTCTCAAGACGTTTTTGAl'GAAATGGTGAA 
CaTGGGGAAATTCATTCTAACATTTAGTTATGGTAATCACAAGTATGGATTAAATAGG 
GACACCGTAGAAGGTATCGCAAGAGATGGTTTGGCAAGCTGTATTCATATGGAAATAG 
AAGGTGTAAGAAGTTTGAAATATTCCTATTTTGAGCCTCGTTATATCCTGGTGGTGCC 




GAAATTGAATTTGCTGTCTCCAGAGTGGACCTTTATATTAAAATTAAXCAGAATTTTC 
CGGGATATTTTGATGAGGTAATCAATGCAGATGATCTGGATGTTGCCTACCAAAAACT 
GAGTCAGCTCATTAGAGAATACCTTGGATTGACTGAGGAACCTGCCAAGAGTTTGGCT 
ACAACTGCAGATGTTAAGACATCCCACCTGAAACCTGAAGCGCATCCTACAAAGTATA 
TTTCTTCGAATATGGGTGATTTCCTGCATTCTACAGACAGAAACTACCTCATTAAATT 
TTGGGCCAAACTTTCAGCCAAAAAAACACCAGCGGAAAGAGATTCTATACACAGACAG 
CATGAGGCAGCCCGGCAAGCTCTAATGGGAAGGATACGCCCTGATCACACACTCCTAT 
TTCAAAGGGGTCCAGTACCAGCACCTCTCACCAGTGGTCTACACTATTATACAACTTT 
AGAAGAACTCTGGAAAAGTTTTGATCTTTGTGAAGACTATTTTAAACCTCCATTTGGA 
CCATATCCTGAAAAGAGTGGGAAGGATTCCTTGGTTTCCATGAAATGTTCATTGTTTC 
GGTTCTGTCCGTGGTCAAAAGAATTGCCTTTCCAGCCTCCGGAGGGGAGCATTTCTTC 
ACACCTAGGATCAGGAGCCAGTGACAGTGAGACCGAAGAGACCCGGAAAGCACTACCT 
ATACAATCATTTTCACATGAAAAAGAGTCTCACCAACACAGACAACACTCGGTCCCAG 
TCATCAGTCGCCCAGGTTCCAACGTCAAACCCACCCTCCCTCCAATCCCTCAGGGCCG 
CAGGTAGACTAGCACTTGATGTCTGATCCTAACATGGAAAACCTGCTCTGCTGATGTC 
GAATTCCTTGCCTTACCTGGCCATGGGTCCAGCTGTTTCTCACTCAACCCATTACCCA 
CGGAAGAATGTTCTACCTGCCTTAATTCTATCAGCCAGTTTCTCTTGTGATTCTTTGG 
CTGGTGTCTTTTAGTTTTTTAATTAAAAAATTGTTTCTTAAAA 




ORF Start: ATG at 24 


ORF Stop: TAG at 2499 




SEQ ID NO: 186 


825 aa | MW at 93626.3kD 


NOV64a, 
CG98 164-01 Protein Sequence 


MATSERALLRTRAASLLRGLGRSRTGARSLOFRAEKERQPCWSFPMGQKTKGSSNIAS 
SYLLQQLMHRYQELDSDGDEDHGEGEAGEEESSESEMLNLEEEFDGVLREEAVAKALH 
HLGRSGSGTEQVYLOTjTLSGCNLIDVSILCGYVHLQKLDLSANKIEDLSCVSCMPYLL 
ELNASQNN1TTFFNFKPPKNLKKADFSHNQISEICDLSAYHALTKLILDGNEIEEISG 
LEMOONLIHLSLANNKITTINGLNKLPIKILCLSKNQIEMITGLEDLKALQNLDLSHN 
QISSLQGLENHDLLEVINLEDNKIAELREIEYIKNLPILRVLNLLENPIQEKSEYWFF 
VIFMLLRLTELDQKKIKVEEKVSAVNKYDPPPEWAAQDHLTHWNSVMQPQRIFDST 
LPSLDAPYPMLILAGPEACGKRELAHRLCRQFSTYFRYGACHTTRPPYFGEGDRVDYH 
FISQDVFDE1WNMGKFILTFSYGNHKYGLNRDTVEGIARDGLASCIHMEIEGVRSLKY 
SYFEPRYILWPMKKEKYEGYLRRKGLFSRAEIEFAVSRVDLYIKINQNFPGYFDEVI 
NADDLDVAYQKLSQLIREYLGLTEEPAKSLATTADVKTSHLKPEAHPTKYISSNMGDF 
LHSTDRNYLIKFWAKLSAKKTPAERDSIHRQHEAARQALMGRIRPDHTLLFQRGPVPA 
PLTSGLHYYTTLEELWKSFDLCEDYFKPPFGPYPEKSGKDSLVSMKCSLFRFCPWSKE 
LPFQPPEGSISSHLGSGASDSETEETRKALPIQSFSHEKESHQHRQHSVPVISRPGSN 
VKPTLPPIPQGRR 
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SEQIDNO:187 


2550 bp 


NOV64b, 
CG98 164-02 DNA Sequence 


GCTTGGGCAGATCCCGAACTGGAGCCCGATCGTTACAGTTTCGCGCAGAAAAAGAGCG 
TCAGCCTTGCTGGTCTTTTCCCATGGGGCAGAAGACGAAAGGCAGCTCTAACATAGCC 
TCCTCCTACCTGCTCCAGCAGCTCATGCACCGCTATCAGGAGCTGGACTCGGACGGAG 
ATGAGGACCAGGGCGAGGGCGAGGCGGGATCCGAGGAGTCCTCAGAGTCCGAAATGCT 
GAATTTGGAGGAGGAATTTGATGGGGTCCTGAGAGAGGAGGCTGTGGCCAAAGCACTC 
CATCACTTGGGGCGCTCAGGCTCTGGGACTGAGCAAGTCTACCTCAATCTAACTTTAT 
CAGGTTGTAATTTAATTGATGTTAGCATTCTCTGTGGATATGTTCATCTACAGAAGTT 
GGATCTTTCAGCGAATAAAATTGAAGATTTATCTTGTGTGAGTTGTATGCCTTATCTC 
CTAGAACTTAATGCTTCTCAAAATAATTTGACTACGTTCTTCAATTTCAAGCCACCCA 
AAAACCTCAAGAAGGCGGATTTTTCCCACAACCAAATTTCTGAAATTTGTGATTTGTC 
AGCGTATCATGCTCTCACTAAACTAATTTTGGATGGCAATGAGATAGAAGAAATCAGT 
GGACTAGAGATGTGCAACAACCTAATTCACCTTAGTTTGGCCAACAATAAGATCACGA 
CAATTAATGGCTTAAACAAGTTACCAATCAAAATACTTTGTCTGAGCAATAACCAGAT 
TGAGATGATCACAGGTTTGGAGGATCTGAAAGCCCTGCAGAACCTGGATCTGTCCCAC 
AATCAGATAAGCAGCCTCCAAGGCTTAGAGAATCATGACCTCCTGGAAGTGATCAACC 
TGGAGGATAATAAGATTGCTGAGCTGAGAGAAATAGAATACATTAAAAATTTACCCAT 
CCTTCGAGTTCTCAATCTTCTAGAAAATCCAATTCAGGAAAAGTCTGAATATTGGTTC 
TTCGTAATTTTTATGCTTCTGCGATTAACAGAATTAGATCAGAAGAAGATTAAAGTGG 
AAGAAAAGGTTTCAGCAGTGAATAAATATGATCCTCCCCCTGAAGTGGTTGCAGCTCA 
AGACCACCTGACCCATGTTGTCAACAGCGTGATGCAGCCGCAGAGGATCTTTGACAGC 
ACTCTTCCCAGCCTGGATGCTCCTTATCCCATGCTGATACTAGCTGGTCCTGAAGCTT 
GTGGGAAACGAGAGCTTGCCCATCGCCTCTGCAGACAGTTTAGCACTTACTTCAGATA 
TGGGGCCTGTCATACCACAAGACCACCTTACTTTGGAGAAGGGGATCGAGTTGATTAT 
CATTTTATCTCTCAAGACGTTTTTGATGAAATGGTGAACATGGGGAAATTCATTCTAA 
CATTTAGTTATGGTAATCACAAGTATGGATTAAATAGGGACACCGTAGAAGGTATCGC 
AAGAGATGGTTTGGCAAGCTGTATTCATATGGAAATAGAAGGTGTAAGAAGTTTGAAA 
TATTCCTATTTTGAGCCTCGTTATATCCTGGTGGTGCCCATGAACAAGGAAAAATATG 
AGGGATATTTGCGGAGAAAAGGATTATTCAGTCGTGCAGAAATTGAATTTGCTGTCTC 
CAGAGTGGACCTTTATATTAAAATTAATCAGAATTTTCCGGGATATTTTGATGAGGTA 
ATCAATGCAGATGATCTGGATGTTGCCTACCAAAAACTGAGTCAGCTCATTAGAGAAT 
ACCTTGGATTGACTGAGGAACCTGCCAAGAGTTTGGCTACAACTGCAGATGTTAAGAC 
ATCCCACCTGAAACCTGAAGCGCATCCTACAAAGTATATTTCTTCGAATATGGGTGAT 
TTCCTGCATTCTACAGACATAAACTACCTCATTAAATTTTGGGCCAAACTTTCAGCCA 
AAAAAACACCAGCGGAAAGAGATTCTATACACAGACAGCACGAGGCAGCCCGGCAAGC 
TCTAATGGGAAGGATACGCCCTGATCACACACTCCTATTTCAAAGGGGTCCAGTACCA 
GCACCTCTCACCAGTGGTCTACACTATTATACAACTTTAGAAGAACTCTGGAAAAGTT 
TTGATCTTTGTGAAGACTATTTTAAACCTCCATTTGGACCATATCCTGAAAAGAGTGG 
GAAGGATTCCTTGGTTTCCATGAAATGTTCATTGTTTCGGTTCTGTCCGTGGTCAAAA 
GAATTGCCTTTCCAGCCTCCGGAGGGGAGCATTTCTTCACACCTAGGATCAGGAGCCA 
GTGACAGTGAGACCGAAGAGACCCGGAAAGCACTACCTATACAATCATTTTCACATGA 
AAAAGAGTCTCACCAACACAGACAACACTCGGTCCCAGTCATCAGTCGCCCAGGTTCC 
AArGTrnAArrCACCCTCCCTCCAATCCCTCGGGGCCGCAGGTAGACTAGCACTTGAT 




ORF Start: ATG at 4 


ORF Stop: TAG at 2479 




SEQ ID NO: 188 


825 aa j MW at 93602.3kD 


NOV64b, 
CG98 164-02 Protein Sequence 


MATSERALLRTRAASLLRGLGRSRTGARSLQFRAEKERQPCWSFPMGQKTKGSSNIAS 
SYLLQQLMHRYQELDSDGDEDQGEGEAGSEESSESEMLNLEEEFDGVLREEAVAKALH 
HLGRSGSGTEQVYLNLTLSGCNLIDVSILCGYVHLQKLDLSANKIEDLSCVSCMPYLL 
ELNASQNNLTTFFNFKPPKNLKKADFSHNQIEEICDLSAYHALTKLILDGNEIEEISG 
LEMOWLIHLSLANHKITTINGLNKLPIKILCLSNNQIEMITGLEDLKALQNLDLSHN 
QISSLQGLENHDLLEVINLEDNKIAELREIEYIKNLPILRVLNLLENPIQEKSEYWFF 
VIFMLLRLTELDQKKIKVEEKVSAVNKYDPPPEWAAQDHLTHVWSVMQPQRIFDST 
LPSLDAPYPMLILAGPEACGKRELAHRLCRQFSTYFRYGACHTTRPPYFGEGDRVDYH 
FISQDVFDEMVNMGKFILTFSYGNHKYGLNRDTVEGIARDGLASCIHMEIEGVRSLKY 
SYFEPRYILWPMNKEKYEGYLRRKGLFSRAEIEFAVSRVDLYIKINQNFPGYFDEVI 
NADDLDVAYQKLSQLIREYLGLTEEPAKSLATTADVKTSHLKPEAHPTKYISSNMGDF 
LHSTDINYLIKFWAKLSAKKTPAERDSIHRQHEAARQALMGRIRPDHTLLFQRGPVPA 
PLTSGLHYYTTLEELWKSFDLCEDYFKPPFGPYPEKSGKDSLVSMKCSLFRFCPWSKE 
LPFQPPEGSISSHLGSGASDSETEETRKALPIQSFSHEKESHQHRQHSVPVISRPGSN 
VKPTLPPIPRGRR 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 64B. 
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Table 64B. Comparison of NOV64a against NOV64b. 


Protein Sequence 


NOV64a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV64b 


31. .825 
31. .825 


750/795 (94%) 
751/795(94%) 



Further analysis of the NOV64a protein yielded the following properties shown in 
Table 64C. 



Table 64C. Protein Sequence Properties NOV64a 


PSort analysis: 


0.9074 probability located in mitochondrial matrix space; 0.6000 probability 
located in mitochondrial inner membrane; 0.6000 probability located in 
mitochondrial intermembrane space; 0.6000 probability located in mitochondrial 
outer membrane 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV64a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 64D. 
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Table 64D. Geneseq Results for NOV64a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, Date] 


NOV64 
a 

Residue 

s/ 
Match 
Residue 

s 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


ABG07962 


Novel human diagnostic protein #7953 - 
Homo sapiens, 1175 aa. [WO200 175067- 
A2, ll-OCT-2001] 


235..614 
1..389 


347/389 (89%) 
354/389 (90%) 


0.0 


ABG07962 


Novel human diagnostic protein #7953 - 
Homo sapiens, 1 175 aa. [WO200 175067- 
A2, ll-OCT-2001] 


235..614 
1..389 


347/389 (89%) 
354/389 (90%) 


0.0 


AAM05508 


Peptide #4190 encoded by probe for 
measuring breast gene expression - Homo 
sapiens, 61 aa. [WO200157270-A2, 09- 
AUG-2001] 


31. .91 
1..61 


60/61 (98%) 
60/61 (98%) 


2e-28 


AAM30367 


Peptide #4404 encoded by probe for 
measuring placental gene expression - 
Homo sapiens, 61 aa. [WO200 1 57272-A2, 
09-AUG-2001] 


31. .91 
1..61 


60/61 (98%) 
60/61 (98%) 


2e-28 


AAM 17861 


Peptide #4295 encoded by probe for 
measuring cervical gene expression - 
Homo sapiens, 61 aa. [WO2001 57278- A2, 
09-AUG-2001] 


31..9I 
1..61 


60/61 (98%) 
60/61 (98%) 


2e-28 



In a BLAST search of public sequence datbases, the NOV64a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 64E. 
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Table 64E. Public BLASTP Results for NOV64a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV64a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q96M69 


CDNA FLJ32786 FIS, CLONE 
TESTI2002256, WEAKLY SIMILAR 
TO GUANYLATE KINASE (EC 2.7.4.8) 
- Homo sapiens (Human), 825 aa. 


1..825 
1..825 


824/825 (99%) 
824/825 (99%) 


0.0 


Q9D5S7 


4921528H16RIK PROTEIN - Mus 
musculus (Mouse), 820 aa. 


1..825 
1..820 


608/827 (73%) 
680/827 (81%) 


0.0 


BAB85845 


AXONEMAL LEUCINE-RICH 
REPEAT PROTEIN - Ciona intestinalis, 

325 aa. 


92. .377 
8. .305 


97/300 (32%) 
1 60/300 (53%) 


6e-31 


035125 


. CHROMOSOME 6 BAC-284H 1 2 
(RESEARCH GENETICS MOUSE BAC 
LIBRARY) COMPLETE SEQUENCE - 
Mus musculus (Mouse), 340 aa. 


71. .372 
9..312 


96/306(31%) 
162/306 (52%) 


2e-30 


Q99620 


LEUCINE RICH PROTEIN - Homo 
sapiens (Human), 3 12 aa. 


73 ..3 18 
16..261 


81/249 (32%) 
130/249 (51%) 


6e-23 



PFam analysis predicts that the NOV64a protein contains the domains shown in 
the Table 64F. 



Table 64F. Domain Analysis of NOV64a 


Pfam Domain 


NOV64a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


LRR 


23S..259 


8/25 (32%) 
17/25 (68%) 


0.21 


LRR 


281. .302 


10/25(40%) 
16/25(64%) 


0.69 


Guanylate_kin 


450..554 


28/108(26%) 
66/108(61%) 


1.7e-07 



Example 65. 



The NOV65 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 65A. 
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Table 65A. NOV65 Sequence Analysis 




SEQ ID NO: 189 


723 bp 


NOV65a, 
CG99588-01 DNA Sequence 


AGGCCACCTGCGTGTATTTTGTGCTCTGGCTGCCCTCATCTAGCCCATCGTGGGTCAG 
CACCCTCATCAAGTGCCTGCCTATCTTCTGCCTCTGGCTCTTCCTTCTGGCCCATGGC 

CTGCTGTAGGTGACGCCTTCCTCATCTGGCAGGACCAAGGATACTTCGTGCATGGTCT 
GCTGATGTTTGCTGTGACCCACATGTTCTACGCCTCGGCCTTTGGCATGCAGCCACTG 
GCTCTTCGGACAGGTCTGGTGATGGCAGCGCTGTCGGGCCTGTGCTATGCCCTCCTCT 
ACCCATGCCTCTCAGGTGCCTTCACCTACCTGGTGGGGGTCTATGTGGCCCTTATCGG 
CTTCATGGGCTGGCGAGCTATGGCAGGGCTGCGGCTGGCCGGGGCAGACTGGCGCTGG 
ACAGAGCTGGCAGCTGGCAGTGGTGCACTCTTCTTTATCATCTCAGACCTGACCATCG 
CCCTCAACAAATTCTGTTTTCCTGTGCCCTACTCTCGGGCGCTTATCATGTCCACCTA 
CTATGCGGCCCAGATGCTCGTCGCCTTGTCAGCTGTCGAAAGCCGGGAGCCTGTGGAA 
CACTACAGACTGACCAAGGCCAACTGA 




ORF Start: ATG at 1 


ORF Stop: TGA at 721 




SEQ ID NO: 190 


240 aa ) MW at 26370. lkD 


NOV65a, 
CG99588-01 Protein Sequence 


MVSPVTWKSEGPKLVPFFKATCVYFVLWLPSSSPSWVSTLIKCLPIFCLWLFLLAHG 
LGFLLAHPSATRIFVGLVFSAVGDAFLIWQDQGYFVHGLLMFAVTHMFYASAFGMQPL 
ALRTGLVMAALSGLCYALLYPCLSGAFTYLVGVYVALIGFMGWRAI4AGLRLAGADWRW 
TELAAGSGALFFIISDLTIALNKFCFPVPYSRALIMSTYYAAQMLVALSAVESREPVE 
HYRLTKAN 



Further analysis of the NOV65a protein yielded the following properties shown in 
Table 65B. 



Table 65B. Protein Sequence Properties NOV65a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in Golgi 
body; 0.3000 probability located in endoplasmic reticulum (membrane); 0.0300 
probability located in mitochondrial inner membrane 


SignalP analysis: 


Cleavage site between residues 65 and 66 



A search of the NOV65a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 65C. 
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Table 65C. Geneseq Results for NOV65a 


Identifier 


Date] 


NOV65a 

Match 
Residues 


Identities/ 

the Matched 
Region 


Value 


ABG 18549 


Novel human diagnostic protein #18540 
[WO200175067-A2, ll-OCT-2001] 


1..240 


240/240(100%) 
240/240 (100%) 


e-139 


ABG 18549 


Novel human diagnostic protein #18540 
[WO200175067-A2, ll-OCT-2001] 


1..240 
1 1 1..350 


240/240(100%) 
240/240 (100%) 


e-139 


AAM93259 


Human polypeptide, SEQ ID NO: 2709 
- Homo sapiens, 240 aa. [EP1 130094- 
A2, 05-SEP-2001] . 


1..240 
1..240 


239/240 (99%) 
239/240 (99%) 


e-138 


ABB59182 


Drosophila melanogaster polypeptide 

SEQ ID NO 4338 - Drosophila 
melanogaster, 254 aa. [WO200171042- 
A2,27-SEP-2001] 


9..226 
9..240 


84/238(35%) 
118/238 (49%) 


2e-26 


AAY11618 


Human 5' EST secreted protein SEQ ID 
NO:270 - Homo sapiens, 100 aa. 
[WO9906439-A2, ll-FEB-1999] 


I5..87 
25..9S 


34/76 (44%) 
47/76 (61%) 


6e-10 



In a BLAST search of public sequence datbases, the NOV65a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 65D. 



Table 65D. Public BLASTP Results for NOV65a 


Protein 
Accession 
Number 


Protein/Organism/ Length 


NOV65a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Va P lue 


Q96AJ0 


SIMILAR TO RIKEN CDNA 
1810054013 GENE - Homo sapiens 
(Human), 224 aa (fragment). 


17..240 
1..224 


220/224 (98%) 
220/224 (98%) 


e-126 


Q9D8N3 


1 8 1 0054O13RIK PROTEIN - Mus 
musculus (Mouse), 241 aa. 


1..240 
1..241 


214/241 (88%) 
224/241 (92%) 


e-122 


Q9VAJ0 


CG7582 PROTEIN - Drosophila 
melanogaster (Fruit fly), 254 aa. 


9..226 
9..240 


84/238 (35%) 
118/238(49%) 


5e-26 


Q9KLP0 


HYPOTHETICAL PROTEIN 
VCA0703 - Vibrio cholerae, 229 aa. 


21. .225 
31. .228 


68/215(31%) 
105/215 (48%) 


le-12 


Q8YDD4 


HYPOTHETICAL PROTEIN 
BMEII0243 - Brucella melitensis, 184 
aa. 


66..223 
15..176 


53/168 (31%) 
79/168(46%) 


3e-ll 
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PFam analysis predicts that the NOV65a protein contains the domains shown in 
the Table 65E. 



Table 65E. Domain Analysis of NOV65a 



NOV65a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Example 66. 

The NOV66 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 66A. 



Table 66 A. NOV66 Sequence Analysis 



NOV66a, 
S CG996 18-01 DNA Sequence 



SEQ ID NO: 191 



1773 bp 



GGAGGAACATGRCATCATGGAAATGGTTTCACCCAAATATCACTGGTGTGGAGGCAGA 
AAACCTACTGTTGACAAGAGGAGTTGATGGCAGTTTTTCGGCAAGGCCTAGTAAAAGT 
AACCCTGGAGACGTCACACTTTCTGTTAGAAGAAATGGAGCTGTCACCCACACCAAGA 
TTCAGAACACTGGTGATTGCTATGACCTGTATGGAGGGGAGAAGTTTGCCACTTTGGC 
TGAGTTGGTCCAGTATTACATGGAACATCATGGGCAATTAAAAGAGAAGAATGGAGAT 
GTTATTGAGCTTAAAAATCCTCTGAACTGTGCAGATCCTACTTCTCAAAGGTGGTTTC 
ATGGACACCTCTCTGGAAAAGAAGCAGAGAAATTGTTAACTGAAAAAGGAAAGCATAG 
TAGCTTTCTTGTACGAGAGAGCCAGAGCCACCCTGATTTTGTTCTCTCCGTGTGCACC 
GGTGATGACAAAGGAGAGAGCAATGACGGCAAGTCTAAAGTGACTCATGTCATGATTC 
ACTGTCAGGAACTGAAATACGATGTTGGTGGAGGAGAACGGTTTGATGCTTTGACAGA 
TCTTAAGAAGAATCCTATGGTGGAAACACTGGGTACAGTACTACAACTCAAGCAGCCC 
CTTAACACGACTTGTACAAATGCTGCTGAAATAGAAAGCAGAGTTCGAGAACTAAGCA 
AATTAGCTGAGACCACAGATAAAGTCAGACAAGGCTTTTGGGAAGAATTTGAGACACT 
GCAACAACAGGAGTGCAAACTTCTCTACAGCCAAAAAGAGGGTCGAAGGCAAGAAAAC 
AAAACCAAAAATAGATATAAAAACATCCTGCCCTTTGATCATAGGGTTGTCCTACACA 
ATGGTGATCCCAGTGAGGCTGTTTCAGATTACATCAACGCAGATATCATCATGCCTGA 
ATTTGAACCAAGTGCACAATTTGTAGCCAAAAAAAGCTACTTGGGAACCAGCCTGATT 
ACTGGGTTATTTCTTAATGAGTTCTGGCACATGGTTCTCATCTTTTTAAGAGGGAAAA 
GTGTAACTCGGCAGCACCAAGTGGAGAGAGGAAAGAGTAAATGTGTCAAATACTGGCC 
CGATGAGTATGCTCTAA.AAGAATATGGCATCGTGCATGTTAGGAACGTCAAAGAAAGC 
ACCACTCATGACTATAAGCTAAGAAAACTTAAACTTTCAAAGATTGGACAAGGGAATA 
TGGAGAGAATGGTCTGGCAATACCACTTGTGGACCTGGCTGTACCACGGAGTGCCCTG 
CGACCCCAAAGGCATGCTGCACTTCCTGGACGAAGTGCACCATAAGCAGGAGAGCATC 

TCATTGTGATTGATATTCTTATTGACATCATCAGAGAGAAAGGTGTTGACTGCGAAAT 
TGACGTTCCCAAAACCATCCAGATGGTGCGGTCTCAGAGGTCAGAGATGGTCCAGACA 
GAAGCACAGTACCGATTTATCTATACGGTGGTCCAGCATCATATTGAAACACTACAGC 
GCAGGATTGAGGAAGAGCAGAAAAGCAAGAGGAAAGGGCACGAATATACAAATATTAA 
GTATTCTCTAGTGGACCAGACGAGTGGGGATCAGAGCCCCCTCCTGCCTTATACTCCA 
ATGCCACCCTGTGCAGAAATGAGAGAAGAGAGTGCCAGAGTCTATGAAAACGTGGGCC 
TGATGCAACAGCAGAAACATTTAAGATG AGAAA 
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ORF Start: ATG at 9 


ORF Stop: TGA at 1767 




SEQ ID NO: 192 


586 aa j MW at 67260.0WD 


NOV66a, 
CG996 18-01 Protein Sequence 


MTSWKWFHPNITGVEAENLLLTRGVDGSFSARPSKSNPGDVTLSVRRNGAVTHTKIQN 
TGDCYDLYGGEKFATLAELVQYYMEHHGQLKEKNGDVIELKNPLNCADPTSQRWFHGH 
LSGKEAEKLLTEKGKHSSFLVRESQSHPDFVLSVCTGDDKGESNDGKSKVTHVMIHCQ 
ELKYDVGGGER?DALTDLKKN?MVETLGTVLQLKQPLNTTCTNAAEIESRVRELSKLA 
ETTDKVRQGFWEEFETLQQQECKLLYSQKEGRRQENKTKNRYKNILPFDHRWLHNGD 
PSEAVSDYINADIIMPEFEPSAQFVAKKSYLGTSLITGLFLNEFWHMVLIFLRGKSVT 
RQHQVERGKSKCVKYWPDEYALKEYGIVHVRNVKESTTHDYKXRKLKrjSKIGQGNMER 
MVWQYHLWTWLYHGVPCDPKGMLHFLDEVHHKQESIMDAGLVWPCSAGIGQTGTFIV 
IDILIDIIREKGVDCEIDVPKTIQMVRSQRSEMVQTEAQYRFIYTWQHHIETLQRRI 
EEEQKSKRKGHEYTNIKYSLVDQTSGDQSPLLPYTPMPPCAEMREESARVYENVGLMQ 
QQKHLR 



Further analysis of the NOV66a protein yielded the following properties shown in 
Table 66B. 



Table 66B. Protein Sequence Properties NOV66a 


j PSort analysis: 


0.4500 probability located in cytoplasm; 0.3719 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV66a protein against the Geneseq database, a proprietary 
5 database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 66C. 
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Table 66C. Geneseq Results for NOV66a 


Geneseq 
Identifier 


ProteiiVOrganism/Length [Patent #, j Jj^** 

Dat6] | Match Sues 


Identities/ 
Similarities for the 
Matched Region 


Expect 
Value 


AAB59215 


SHP-2 protein - Unidentified, 593 aa. 
[US6 156551 -A, 05-DEC-2000] 


1..586 


498/593 (83%) 
521/593 (86%) 


0.0 


AAY 13476 


Peptide Seq ID No: 8 of W09923493 
- Homo sapiens, 593 aa. 
[W09923493-A1, 14-MAY-1999] 


1..586 
1..593 


498/593 (83%) 
521/593 (86%) 


0.0 


AAR99313 


Human SH-PTP2 (protein tyrosine 
phosphatase-2, with SH2 domains) - 
Homo sapiens, 593 aa. [US5536636- 
A, 16-JUL-1996] 


1..586 
1..593 


498/593 (83%) 
521/593 (86%) 


0.0 


AAB59221 


SHP-2 activated double compound 
mutant E76A/D425A protein - 
Unidentified, 593 aa. [US6156551-A, 
05-DEC-2000] 


1..586 
1..593 


497/593 (83%) 
520/593 (86%) 


0.0 


AAB59214 


SHP-2 mutant E76A protein - 
Unidentified. 593 aa. [US6 156551 -A, 
05-DEC-2000] 


1..586 
1..593 


497/593 (83%) 
520/593 (86%) 


0.0 



In a BLAST search of public sequence datbases, the NOV66a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 66D. 
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Table 66D. Public BLASTP Results for NOV66a 


Protein 
Accession 
Number 


Protein/Organ i sm/Length 


NOV66a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q06124 


receptor type M (EC 3.1.3.48) 
(Protein-tyrosine phosphatase 2C) 
(PTP-2C) (PTP-ID) (SH-PTP3) (SH- 
PTP2) (SHP-2) - Homo sapiens 
(Human), 593 aa. 


1..586 
1..593 


498/593 (83%) 
521/593 (86%) 


0.0 


Q64509 


PROTEIN TYROSINE 
PHOSPHATASE (EC 3.1.3.48) - Mus 
musculus (Mouse), 597 aa. 


1..586 
1..597 


497/597 (83%) 
522/597 (87%) 


0.0 


P41499 

! 

1 


Protein-tyrosine phosphatase, non- 
receptor type 1 1 (EC 3.1.3.48) 
(Protein-tyrosine phosphatase SYP) - 
Rattus norvegicus (Rat), 593 aa. 


1..586 
1..593 


494/593 (83%) 
520/593 (87%) 


0.0 


: Q90687 

i 


PROTEIN-TYROSINE 
PHOSPHATASE Nl 1 (EC 3.1.3.48) 

(PROTEIN TYROSINE 
PHOSPHATASE. NON-RECEPTOR 
TYPE 1 1) - Gallus gallus (Chicken), 
593 aa. 


1..586 
1..593 


491/593 (82%) 
518/593 (86%) 


0.0 


; A53593 


protein-tyros ine-phosphatase (EC 
3.1.3.48), nonreceptor type 1 1 - rat, 

597 aa. 


1..586 
1..597 


493/597 (82%) 
519/597 (86%) 


0.0 



PFam analysis predicts that the NOV66a protein contains the domains shown in 
the Table 66E. 



Table 66E. Domain Analysis of NOV66a 


Pfam Domain 


NOV66a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


SH2 


6..81 


39/79 (49%) 
66/79 (84%) 


8e-38 


SH2 


112..I92 


33/84 (39%) 
72/84 (86%) 


3.6e-29 


Yjhosphatase 


268..513 


98/298 (33%) 
187/298(63%) 


1.7e-58 
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Example 67. 

The NOV67 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 67 A. 



Table 67A. NOV67 Sequence Analysis 




SEQlDNO:193 


1311 bp 


NOV67a, 
CG99832-01 DNA Sequence 

j 


CCGGGATGCACTGTTCCTGCTGTGGGTCCTCATCATGGAGACCAAACGGGTGGAGATT 
CCCGGCAGCGTCCTGGACGATTTCTGCAGCCGATTTATTTTGCATATTCCCAGCGAGG 
AAAGAGACAATGCAATCCGAGTGTGTTTTCAGATTGAACTTGCCCATTGGTTTTACTT 
GGATTTCTACATGCAGAACACACCAGGATTACCTCAGTGTGGGATAAGAGACTTTGCT 
AAAGCTCTCTTCAGTCATTGTCCGTTTTTGCTGCCTCAAGGTGAAGATGTGGAAAAAG 
TTTTGGATGAATGGAAGGAATATAAAATGGGAGTACCAACATATGGTGCAATTATTCT 
TGATGAGACACTTGAAAATGTACTACTAGTTCAGGGGTACCTAGCAAAATCAGGCTGG 
GGATTTCCAAAAGGAAAAGTAAATAAAGAAGAAGCTCCTCATGATTGTGCTGCTAGAG 
AGGTATTTGAAGAAACTGGTTTTGATATCAAAGACTATATTTGTAAGGATGATTACAT 
TGAACTTCGAATCAATGACCAGCTTGCTCGTTTGTACATCATTCCAGGAATTCCAAAA 
GACACAAAATTTAACCCAAAAACTAGAAGAGAAATTCGGAACATTGAGTGGTTCTCTA 
TTGAGAAATTGCCTTGTCATAGAAATGATATGACCCCCAAATCCAAACTTGGTTTGGC 
ACCTAACAAATTTTTTATGGCCATTCCCTTTATCAGGCCATTAAGGGACTGGCTTTCT 
CGAAGATTTGGCGATTCCTCAGACAGTGACAATGGATTTTCCTCAACTGGTAGCACGC 
CGGCTAAACCCACTGTGGAAAAATTGAGGAGTCGAACCAAATTCCGCCACAGTCAGCA 
GTTATTTCCTGACGGTTCTCCTGGTGACCAGTGGGTAAAGCACAGGCAACCACTGCAG 
CAAAAGCCATATAATAATCATTCTGAAATGTCTGACCTTTTAAAAGGAAAGAATCAAA 
GTATGAGGGGAAATGGCAGAAAACAGTATCAAGATTCACCTAATCAAAAGAAAAGAAC 
AAATGGGCTTCAGCCAGCAAAGCAGCAGAATTCTTTGATGAAGTGTGAAAAGAAACTT 
CATCCACGGAAACTTCAGGATAATTTTGAAACAGATGCTGTATATGACTTGCCTAGCT 
CCAGTGAAGACCAGTTGCTAGAACATGCTGAGGGACAGCCCGTGGCATGTAATGGACA 
TTGCAAGTTCCCCTTTTCATCCAGAGCCTTTTTGAGTTTCAAGTTTGACCATAATGCT 
ATAATGAAAATCTTGGACCTTTGATAGCAGCACAT 




ORF Start: ATG at 35 


ORF Stop: TGA at 1298 




SEQ ID NO: 194 


421 aa | MW at 48626. 8kD 


NOV67a, 
CG99832-01 Protein Sequence 


METKRVEIPGSVIiDDFCSRFILHIPSEERDNAIRVCFQIELAHWFYLDFYMQNTPGLP 
QCGIRDFAKALFSHCPFLLPQGEDVEKVLDEWKEYKMGVPrYGAIILDETLENVLLVQ 
GYLAKSGWGFPKGKVNKEEAPHDCAAREVFEETGFDIKDYICKDDYIELRINDQLARL 
YIIPGIPKDTKFNPKIRREIRNIEWFSIEKLPCHRNDMTPKSKLGLAPNKFFMAIPFI 
RPLRDWLSRRFGDSSDSDNGFSSTGSTPAKPTVEKLRSRTKFRHSQQLFPDGSPGDQW 
VKHRQPLQQKPYNNHSEMSDLLKGKNQSMRGNGRKQYQDSPNQKKRTNGLQPAKQQNS 
LMKCEKKLHPRKLQDNFETDAVYDLPSSSEDQLLEHAEGQPVACNGHCKFPFSSRAFL 
SFKFDHNAIMKILDL 



Further analysis of the NOV67a protein yielded the following properties shown in 
Table 67B. 



Table 67B. Protein Sequence Properties NOV67a 


PSort analysis: 


0.4741 probability located in microbody (peroxisome); 0.4500 probability located in 
cytoplasm; 0.1000 probability located in mitochondrial matrix space; 0.1000 
probability located in lysosome (lumen) 


SignalP analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV67a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 67C. 



Table 67C. Geneseq Results for NOV67a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV67a 
Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB62134 


Drosophila melanogaster polypeptide 
SEQ ID NO 13 194 - Drosophila 
melanogaster, 791 aa. 
[WO200171042-A2, 27-SEP-2001] 


4..261 
205..474 


131/270 (48%) 
171/270 (62%) 


4e-72 


AAM90822 


Human immune/haematopoietic 
antigen SEQ ID NO: 184 15 - Homo 
sapiens, 95 aa. [WO200157182-A2, 
09-AUG-2001] 


253..348 
1..92 


91/96 (94%) 
92/96 (95%) 


8e-46 


AAW87487 


S. cerevisiae T1H 1 polypeptide - 
Saccharomyces cerevisiae, 595 aa. 
[US5846764-A, 08-DEC-1998] 


12. .239 
18..235 


82/228 (35%) 
125/228 (53%) 


2e-36 


AAW54167 


S.cerevisiae TIH 1 protein - 
Saccharomyces cerevisiae, 595 aa. 
[WO9813502-A2, 02-APR-1998] 


12. .239 
18..235 


82/228 (35%) 
125/228 (53%) 


2e-36 


AAR81314 


Yeast TIH1 - Saccharomyces 
cerevisiae, 595 aa. [W09519988-A1, 
27-JUL-1995] 


12..239 
18..235 


82/228 (35%) 
125/228(53%) 


2e-36 



In a BLAST search of public sequence datbases, the NOV67a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 67D. 
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Table 67D. Public BLASTP Results for NOV67a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV67a 
Residues/ 
Match Residues 


Identities/ 

the Matched 
Portion 


Value 


Q9CYC6 


5730537H0IRIK PROTEIN - Mus 
musculus (Mouse), 422 aa. 


1..421 
1..422 


387/423 (91%). 
402/423 (94%) 


0.0 


: Q9VUU4 


CG6169 PROTEIN - Drosophila 
melanogaster (Fruit fly), 791 aa. 


4..261 
205..474 


131/270 (48%) 
171/270 (62%) 


le-71 


013828 


HYPOTHETICAL 83.2 KDA 
PROTEIN C19A8.12 IN 
CHROMOSOME I - 
Schizosaccharomyces pombe (Fission 
yeast), 74 1 aa. 


12..242 
11. .236 


109/231 (47%) 
153/231 (66%) 


le-59 


j Q948G1 


PUTATIVE ABC TRANSPORTER - 
Oryza sativa (Rice), 1372 aa. 


18..265 
1 14.368 


96/258 (37%> 
148/258 (57%) 


9e-46 


| E88886 

! 


protein F52G2. 1 a [imported] - 
Caenorhabditis elegans, 328 aa. 


4.. 195 
129..323 


89/195 (45%) 
123/195 (62%) 


8e-44 



PFam analysis predicts that the NOV67a protein contains the domains shown in 
the Table 67E. 



Table 67E. Domain Analysis of NOV67a 


Pfam Domain 


NOV67a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


NUDIX 


96..221 


25/137(18%) 
78/137 (57%) 


4.7e-12 



Example 68. 



The NOV68 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 68A. 
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Table 68A. NOV68 Sequence Analysis 




SEQIDNO:195 j 5474 bp 


NOV68a, 
CG99842-01 DNA Sequence 


TGATCATAaACAATACTTTTGGAGCTGGATTAATATTAAGTCaCTTGAGAGATATAGA 


TTGAGACCGTAATACTTCAGGTTCTAATTGTGTCCAGAAGGCTATGTGTTAACATTCA 
TAGCAAACTGGTTACCTTTATTTCCTGAAAGTTAAATTTTGAAAGAAAGGGATACCAA 
GTGAGAAAGCAACCTTCAGTTAAACAAAGGGGTACATCAAGTTTGATACTCTTGGATT 
AAAAATCAGGTGACAATTGGAACATGGAGGAGAAATTCATGATTATAGAAACCATGAA 
GCCAATCTTTTTTACAACAAAGCTGCTTGTTTTTTTCTTAATCATGTAGTGGCTGCTA 
TCAGGCCAGTTCTGGAACTTCACTGAGTCCCTCTGCTCTTCTGAATCAGCTCTTGATT 
GGTGGTGACTTCTCGGAAGTTCTTATCCCTACCCTCCACCTTCTGCTCCAGGGTGGCA 
CGACTCARACCTGCCCTACGGCCCATTAGCTCCATTCCATGAAGTTCCTATCCAGTAC 
CAGGCACGTTTCTTCTGCGTCTGTCTGGCTCGCGTGTTTCTTTTTGCCTGTGTCCCGC 




TCTCTCCAACTCTCTCCCCCTCTCAGCCACTTCCTGTCTCCAGTATGTCCCTGGCGCG 
GCCCGGCTGGCCGTCTGCGCACCCTCTCTCCCCTCGGCTCTTTCCTAGGAAAGCTGAG 
CCTCATAGCTTCCGGGAGAAGGTTTTCCGGAAGAAACCTCCAGTCTGTGCAGTATGTA 
AGGTGACCATCGATGGGACAGGCGTTTCGTGCAGAGTCTGCAAGGTGGCGACGCACAG 
AAAATGTGAAGCAAAGGTGACTTCAGCCTGTCAGGCCTTGCCTCCCGTGGAGTTGCGG 
CGAAACACGGCCCCAGTCAGGCGCATAGAGCACCTGGGATCCACCAAATCTCTGAACC 
ACTCAAAGCAGCGCAGCACTCTGCCCAGGAGCTTCAGCCTGGACCCGCTCATGGAGCG 
GCGCTGGGACTTAGACCTCACCTACGTGACGGAGCGCATCTTGGCCGCCGCCTTCCCC 
GCGCGGCCCGATGAACAGCGGCACCGGGGCCACCTGCGCGAGCTGGCCCATGTGCTGC 
AATCCAAGCACCGGGACAAGTACCTGCTCTTCAACCTTTCAGAGAAAAGGCATGACCT 
GACCCGCTTAAACCCCAAGGTTCAAGACTTCGGCTGGCCTGAGCTGCATGCTCCACCC 
CTGGACA^GCTGTGCTCCATCTGCAAAGCCATGGAGACATGGCTCAGTGCTGACCCAC 
AGCACGTGGTCGTACTATACTGCAAGGGAAACAAGGGCAAGCTTGGGGTCATCGTTTC 
TGCCTACATGCACTACAGCAAGATCTCTGCAGGGGCGGACCAGGCACTGGCCACTCTT 
ACCATGCGGAAATTCTGCGAGGACAAGGTGGCCACAGAACTGCAGCCCTCCCAGCGTC 
GATATATCAGCTACTTCAGTGGGC7GCTATCTGGCTCCATCAGAATGAACAGCAGCCC 
TCTCTTCCTGCACTATGTGCTCATCCCCATGCTGCCAGCCTTTGAACCTGGCACAGGC 
TTCCAGCCCTTCCTTAAAATCTACCAGTCCATGCAGCTTGTCTACACATCTGGAGTCT 
ATCACATTGCAGGCCCTGGTCCCCAGCAGCTTTGCATCAGCCTGGAGCCAGCCCTCCT 
CCTCAftAGGCGATGTCATGGTAACATGTTATCACAAGGGTGGCCGGGGCACAGACCGG 




TCCCCAAGGACCAGCTTGACGAGGCCTGGACTGATGAGAGGTTCCCCTTCCAAGCCTC 
CGTGGAGTTTGTCTTCTCCTCCAGCCCCGAGAAGATCAAAGGCAGCACTCCACGGAAC 
GACCCCTCGGTCTCTGTCGACTACAACACCACTGAGCCAGCCGTGCGCTGGGACTCCT 
ATGAGAACTTCAACCAGCACCACGAGGACAGTGTGGATGGCTCCTTGACCCACACCCG 
GGGTCCCCTGGATGGCAGTCCTTATGCCCAGGTGCAGCGGCCTCCCCGGCAGACCCCC 
CCGGCACCCTCTCCAGAGCCTCCACCACCCCCCATGCTCTCTGTCAGCAGCGACTCAG 
GCCATTCCTCCACGCTGACCACAGAGCCGGCTGCTGAGTCCCCTGGCCGGCCGCCCCC 
TACAGCTGCTGAACGGCAGGAGCTGGATCGCCTCCTAGGAGGCTGCGGAGTGGCCAGT 
GGGGGCCGGGGAGCTGGGCGCX3AGACGGCCATCCTAGATGACGAAGAGCAGCCCACTG 




CGGCCAGAGGGAACCCTGGAGAGGAGGCGACTGGCCTACGGGGGCTATGAGGGATCCC 
CCCAGGGCTACGCCGAGGCCTCGATGGAGAAGAGGCGCCTCTGCCGATCGCTGTCAGA 
GGGGCTATACCCCTACCCACCTGAGATGGGGAAACCAGCCACTGGGGACTTTGGCTAC 
CGCGCCCCAGGCTACCGGGAGGTGGTCATCCTGGAGGACCCTGGGCTGCCTGCCCTAT 
ACCCATGCCCAGCCTGCGAGGAGAAGCTGGCGCTGCCTACAGCAGCCTTGTATGGACT 
GCGGCTGGAGAGGGAGGCTGGAGAAGGGTGGGCAAGTGAGGCTGGCAAGCCTCTCCTG 
CACCCAGTGCGGCCTGGGCACCCGCTGCCTCTGCTCTTGCCTGCCTGTGGGCATCACC 
ATGCCCCGATGCCTGACTACAGCTGCCTGAAGCCACCCAAGGCAGGCGAGGAAGGGCA 
CGAGGGCTGCTCCTACACCATGTGCCCCGAAGGCAGGTATGGGCATCCAGGGTACCCT 
GCCCTGGTGACATACAGCTATGGAGGAGCAGTTCCCAGTTACTGCCCAGCATATGGCC 
GTGTGCCTCATAGCTGTGGCTCTCCAGGAGAGGGCAGAGGGTATCCCAGCCCTGGTGC 
CCACTCCCCACGGGC7GGCTCCATTTCCCCGGGCAGCCCGCCCTATCCACAATCTAGG 
AAGCTGAGCTACGAGATCCCTACGGAGGAGGGAGGGGACAGGTACCCATTGCCTGGGC 
ACCTGGCCTCAGCAGGACCTTTGGCATCTGCAGAGTCGCTGGAGCCGGTGTCCTGGAG 
GGAGGGCCCCAGTGGGCACAGCACACTGCCTCGGTCTCCCCGAGATGCCCCATGCAGT 
GCTTCGTCAGAGTTGTCTGGTCCCTCCACGCCCCTGCACACCAGCAGTCCAGTCCAGG 
GCAAGGAAAGCACCCGGCGACAGGACACCAGGTCCCCCACCTCAGCGCCCACTCAGAG 




ACTGAGTCCTGGCGAGGCCTTGCCCCCTGTTTCCCAGGCAGGCACCGGAAAGGCCCCT 
GAGCTGCCGTCGGGAAGTGGGCCTGAGCCTCTGGCCCCTAGCCCAGTCTCTCCGACCT 
TCCCTCCCAGCTCGCCCAGTGACTGGCCTCAGGAAAGGAGTCCAGGGGGCCACTCAGA 
TGGCGCCAGTCCTCGGAGCCCTGTGCCCACCACACTTCCTGGCCTCCGCCACGCCCCC 
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TGCTTTCCCCCTGGCTGCCTCCTRTGACACCAATGGCCTTAGCCAGCCCCCACTTCCT 




CATCATCGCCAGCCAGAGGCATCAGTCACCATGTCACCTTCGCACCTCTGCTCTCAGA 
TAATGTCCCCCAAR.CCCCAGAGCCTCCTACACAAGAGAGCCAAAGCAATGTCAAGTTT 
GTCCAGGATACATCCAAGTTCTGGTACAAGCCACACCTGTCCCGTGACCAAGCCATTG 
CCCTGCTGAAGGACAAGGACCCTGGGGCCTTCCTGATCAGGGACAGTCATTCATTCCA 
AGGAGCTTATGGGCTGGCCCTCAAGGTGGCCACACCGCCACCCAGTGCCCAGCCCTGG 
AAAGGGGACCCCGTGGAACAGCTGGTCCGCCATTTCCTCATCGAGACTGGGCCCAAAG 
GGGTGAAGATCAAGGGCTGCCCCAGTGAGCCCTACTTTGGCAGCCTGTCCGCCTTGGT 
CTCCCAGCACTCCATCTCCCCCATCTCCCTGCCCTGCTGCCTGCGCATTCCCAGCAAA 
GATCCTCTGGAAGAGRCCCCAGAGGCTCCAGTGCCCACCAACATGAGCACAGCGGCAG 
ACCTCCTGCGTCAGGGTGCTGCCTGCAGCGTGCTCTACTTGACCTCAGTGGAGACAGA 
GTCACTGACGGGCCCCCAAGCTGTGGCCCGGGCCAGCTCTGCAGCTCTGAGCTGTAGC 
CCCCGCCCGACACCAGCTGTTGTCCACTTCAAGGTGTCAGCCCAGGGCATTACACTGA 
CGGACAACCAAAGGAAGCTCTTCTTTCGCCGCCATTATCCAGTGAACAGCATCACCTT 
CTCCAGCACTGACCCTCAAGACCGGAGATGGACCAACCCAGACGGGACCACCTCCAAG 
ATCTTTGGTTTCGTGGCCAAGAAGCCGGGAAGCCCCTGGGAGAATGTGTGTCACCTCT 
TTGCAGAGCTTGACCCAGATCAGCCTGCTGGCGCCATTGTCACCTTCATCACCAAAGT 
TCTACTGGGCCAGAGAAAATGAAGGAAGGCCACAAGCTCAGAGCCCACATCAACACTG 
CCCCCCTCCCAGCACCCCACAGCCCTCACATCCCCTGGCCTGGACCCAGGAGACCCAG 
GAGAAAGCACCCTCCCTTAGGAATGAGGAGTGGGCATCAGGCCTGGGACACTGCTCTC 
CTTCCCCGCCCCCAGCCTGCTAAGTTAAGTGGACAGGCCCACAAGATGACCTTGCATG 
TGAGCAGATGGCAGAGATGGGTGTGTGAGGGGTGAGGAGGCATCAGCAGTTGAGCCCC 
GAAGGAGATCAGGCAGCCCCACCTGCAGGAGAACGTCAGCCCTCCAGGGGATCAGCCC 
CTGCCAGTTCCACCCAGCTGCAGGTGCCAGCACGGCAGGGATGGGAGAGGGGTGGGGA 
GCGAGTCACTGCCTCCTCTGAGCAGAGATTCAGAGTAGGATCACATGAATAGGGGAAA 
AAAGAGAGTCTATTTTTGTCTA^TAATAAAGAATTTCTATAAACTTTAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA 




ORF Start: ATG at 56 1 


ORF Stop: TGA at 4950 




SEQIDNO: 196 


1463 aa | MW at 158629.7kD 


NOV68a, 
CG99842-01 Protein Sequence 


•I LSST 'S AS /WL'VCFFLPVSRRVRLSPSLRLSRPRVLRGRLSPTLSPSQPLPV 
SSMSLARPGWPSAHPLSPRLFPRKAEPHSFREKVFRKKPPVCAVCKVTIDGTGVSCRV 
CKVATHRKCEAKVTSACQALPPVELRRNTAPVRRIEHLGSTKSLNHSKQRSTLPRSFS 
LDPLMERRWDLDLTYVTERIIAAAFPARPDEQRHRGHLRELAHVLQSKHRriKYI.LFNr, 
SEKRHDLTRLNPKVQDFGWPELHAPPLDKLCSICKAMETWLSADPQHVWLYCKGNKG 
KLGVIVSAYMHYSKISAGADQALATLTMRKFCEDKVATELQPSQRRYISYFSGLLSGS 
IRMNSSPLFLHYVLIPMLPAFEPGTGFQPFLKIYQSMQLVYTSGVYHIAGPGPQQLCI 
SLEPALLLKGDVMVTCYHKGGRGTDRTLVFRVQFHTCTIHGPQLTFPKDQLDEAWTDE 
RFPFQASVEFVFSSSPEKIKGSTPRNDPSVSVDYNTTEPAVRWDSYENFNQHHEDSVD 
GSLTHTRGPLDGSPYAQVQRPPRQTPPAPSPEPPPPPMLSVSSDSGHSSTLTTEPAAE 
SPGRPPPTAAERQELDRLLGGCGVASGGRGAGRETAILDDEEQPTVGGGPHLGVYPGH 
RPGLSRHCSCRQGYREPCGVPNGGYYRPEGTLERRRLAYGGYEGSPQGYAEASMEKRR 
LCRSLSEGLYPYPPEMGKPATGDFGYRAPGYREWILEDPGLPALYPCPACEEKLALP 
TAALYGLRLEREAGEGWASEAGKPLLHPVRPGHPLPLLLPACGHHHAPMPDYSCLKPP 
KAGEEGHEGCSYTMCPEGRYGHPGYPALVTYSYGGAVPSYCPAYGRVPHSCGSPGEGR 
GYPSPGAHSPRAGSISPGSPPYPQSRKLSYEIPTEEGGDRYPLPGHLASAGPLASAES 
LEPVSWREGPSGHSTLPR5PRDAPCSASSELSGPSTPLHTSSPVQGKESTRRQDTRSP 
TSAPTQRLSPGEALPPVSQAGTGKAPELPSGSGPEPIAPSPVSPTFPPSSPSDWPQER 
SPGGHSDGASPRSPVPTTLPGLRHAPWQGPRGPPDSPDGSPLTPVPSQMPWLVASPEP 
PQSSPTPAFPLAASYDTNGLSQPPLPEKRHLPGPGQQPGPWGPEQASSPARGISHHVT 
FAPLLSDNVPQTPEPPTQESQSNVKFVQDTSKFWYKPHLSRDQAIALLKDKDPGAFLI 
RDSHSFQGAYGLALKVATPPPSAQPWKGDPVEQLVRHFLIETGPKGVKIKGCPSEPYF 
GSLSALVSQHSISPISLPCCLRIPSKDPLEETPEAPVPTNMSTAADLLRQGAACSVLY 
LTSVETESLTGPQAVARASSAALSCSPRPTPAWHFKVSAQGITLTDNQRKLFFRRHY 
PVNSITFSSTDPQDRRWTNPDGTTSKIFGFVAKKPGSPWENVCHLFAELDPDQPAGAI 
VTFITKVLLGQRK 



Further analysis of the NOV68a protein yielded the following properties shown 
Table 68B. 
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Table 68B. Protein Sequence Properties NOV68a 


PSort analysis: 


0.3700 probability located in outside; 0.1900 probability located in lysosome 
(lumen); 0. 1080 probability located in nucleus; 0.1000 probability located in 
endoplasmic reticulum (membrane) 


SignalP analysis: 


Cleavage site between residues 28 and 29 



A search of the NOV68a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 68C. 



Table 68C. Geneseq Results for NOV68a 


1 

; Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV68a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


! AAB40638 


Human ORFX ORF402 polypeptide 
sequence SEQ ID NO:804 - Homo 
sapiens, 1400 aa. [WO200058473-A2, 
05-OCT-2000] 


64.. 1463 
1..1400 


1400/1400 
(100%) 

1400/1400 
(100%) 


0.0 


AAM40312 


Human polypeptide SEQ ID NO 3457 - 
Homo sapiens, 1409 aa. 
[WO200153312-A1, 26-JUL-2001] 


80.. 1463 
26.. 1409 


1383/1384 
(99%) 

1383/1384 
(99%) 


0.0 


AAU17196 


Novel signal transduction pathway 
protein, Seq ID 761 - Homo sapiens, 524 
aa. [WO200154733-Al,02-AUG-2001] 


942..1463 
3.-524 


522/522 (100%) 
522/522 (100%) 


0.0 


AAU 17573 


Novel signal transduction pathway 
protein, Seq ID 1 138 - Homo sapiens, 
522 aa. [WO200154733-A1, 02-AUG- 
2001] 


942..1463 
1..522 


521/522 (99%) 
521/522 (99%) 


0.0 


AAB42301 


Human ORFX ORF2065 polypeptide 
sequence SEQ ID NO:4130 - Homo 
sapiens, 523 aa. [WO200058473-A2, 
05-OCT-2000] 


149..582 
48..492 


232/445 (52%) 
311/445(69%) 


e-131 



In a BLAST search of public sequence datbases, the NOV68a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 68D. 
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Table 68D. Public BLASTP Results for NOV68a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV68a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UPS7 


K1AA1075 PROTEIN - Homo 
sapiens (Human), 1400 aa (fragment). 


64.. 1463 
1-1400 


1399/1400 
(99%) 

1399/1400 
(99%) 


0.0 




TENSIN2 - Homo sapiens (Human), 
1285 aa. 


1..1285 


1284/1285 
(99%) 

1284/1285 
(99%) 




Q9NT29 


HYPOTHETICAL 68.7 KDA 
PROTEIN - Homo sapiens (Human), 
649 aa (fragment). 


813.1463 
I..649 


648/651 (99%) 
648/651 (99%) 


0.0 


AAH25818 


SIMILAR TO KIAA1075 PROTEIN 
- Mus musculus (Mouse), 655 aa 
(fragment). 


808.. 1463 
2..655 


564/663 (85%) 
580/663 (87%) 


0.0 


A54970 


tensin, cardiac muscle - chicken, 1744 
aa. 


149.. 1140 
31..I088 


382/1 1 1 1 (34%) 
513/1111 (45%) 


e-139 



PFam analysis predicts that the NOV68a protein contains the domains shown in 
the Table 68E. 



Table 68E. Domain Analysis of NOV68a 


Pfam Domain 


NOV68a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


DAG_PE-bind 


86..133 


16/51 (31%) 
33/51 (65%) 


3e-05 


PHD 


99..136 


9/51 (18%) 
20/51 (39%) 


0.22 


SH2 


1194..1286 


28/97 (29%) 
66/97 (68%) 


5.4e-15 



Example 69. 



The NOV69 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 69A. 
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Table 69 A. NOV69 Sequence Analysis 




SEQIDNO.197 | 1502 bp 


NOV69a, 
CG99944-01 DNA Sequence 

1 
j 

! 

i 


GAATAAGGAGGAAGAATCATGAATGAAAAAGTCATAGAAATGCAGAATATT&rr'aAar: 
AATTTGGAACATTCAAGGCAAATGACAACATCAATTTACAAGTAAAGGCAGGAGAGAT 
TCATGCGTTGCTTGGAGAAAACGGTGCTGGCAAATCTACATTGATGAAGGTGCTTTCC 
GGATTATTAGAGCCGACATCAGGGAAAATTTTGATGCGTGGGAAAGAAGTACAGATCA 
CAAGCCCGACAAAAGCCAATCAATTAGGGATTGGGATGGTCCATCAGCACTTTATGCT 
TGTTGATGCCTTTACTGTAACAGAAAACATCGTGTTGGGAAGCGAACCTAGTCGTGCA 
GGGATGCTTGACCATAAAAAAGCGCGAAAAGAGATCCAAAAAGTTTCTGAACAATATG 
GATTATCAGTCAACCCGGATGCTTATGTTCGTGATATTTCAGTTGGGATGGAACAACG 
GGTAGAAATTTTAAAAACACTTTACCGAGGAGCAGATGTACTGATTTTTGATGAGCCG 
ACAGCTGTATTGACCCCTCAGGA.AATTGATGAATTAATCGTGATCATGAAGGAATTAG 
TCAAAGAAGGCAAGTCAATCATTTTGATTACGCATAAGTTAGATGAAATCAAAGCAGT 
AGCTGACCGTTGTACAGTTATCCGCCGTGGAAAAGGAATCGGTACAGTCAACGTTAAA 
GACGTTACCTCACAGCAATTAGCTGATATGATGGTCGGAAGAGCGGTTTCATTCAAAA 
CGATGAAAAAAGAAGCGAAGCCTCAAGAAGTCGTTTTGTCTATTGAAAATCTAGTGGT 
AAAAGAAAATCGTGGATTAGAAGCCGTGAAAAACCTGAACTTAGAGGTTCGTGCTGGC 
GAAGTACTTGGTATCGCTGGAATCGATGGAAACGGGCAGTCGGAGTTGATCCAAGCTT 
TGACTGGTTTGCGAAAGGCAGAAAGCGGACATATCAAGCTAAAAGGGGAAGACATCAC 
CAATAAAAAACCTCGAAAGATCACTGAACATGGTGTAGGACATGTGCCAGAAGACCGT 
CATAAATACGGGTTGGTCCTAGATATGACATTGTCTGAAAACATTGCCCTGCAAACGT 
ATCATCAAAAACCTTACAGTAAAAACGGTATGCTGAATTATTCAGTGATAAATGAACA 
TGCCAGAGAATTGATCGAAGAATATGATGTTCGAACAACGAATGAACTTGTTCCTGCA 
AAAGCTTTATCAGGCGGAAATCAGCAAAAAGCAATCATCGCTCGGATAGTCGACCGAG 
ATCCTGATCTGTTGATCGTTGCAAATCCAACTCGTGGGCTGGATGTAGGTGCGATCGA 
ATTTATTCATAAACGTCTGATCGAACAAAGGGACAAATACAAAGCAGTGTTATTGATT 
AGTTTCGAATTAGAAGAAATTTTAAATGTTTCGGATCGTATTGCTG'ITATCCATGAAG 
GAGAAATCGTCGGGATCGTTGATCCGAAAGAAACATCTGAAAATTAATAGGG 




ORF Start: ATG at 19 


ORF Stop: TA A at 1495 




SEQ ID NO: 198 


492 aa j MW at 54439.3 kD 


NOV69a, 
CG99944-0 1 Protein Sequence 


IKALLGENGAGKSTLMNVLSGLLRPT 
SGKILMRGKEVQITSPTKANQLGIGMVHQHFMLVDAFTVTENIVLGSEPSRAGMLDIIK 
KARKEIQKVSEQYGLSVNPDAyVRDISVGMEQRVEILKTLYRGADVI.IFDEPTAVLTP 
QEIDELIVIMKELVKEGKSIILITHKLDEIKAVADRCTVIRRGKGIGTVNVKDVTSQQ 
LADMMVGRAVSFKTMKKEAKPQEWLSIENLWKENRGLEAVKNLNLEVRAGEVLGIA 
GIDGNGQSELIQALTGLRKAESGHIKLKGEDITNKKPRKITEHGVGHVPEDRHKYGLV 
LDMTLSENIALQTYHQKPYSKNGMLNYSVINEHARELIEEYDVRTTNELVPAKALSGG 
NQQKAIIARIVDRDPDLLIVANPTRGLDVGAIEFIHKRLIEQRDKYKAVLLISFELEE 
ILNVSDRIAVIHEGEIVGIVDPKETSEN 



Further analysis of the NOV69a protein yielded the following properties shown in 
Table 69B. 



Table 69B. Protein Sequence Properties NOV69a 


PSort analysis: 


0.6000 probability located in nucleus; 0.1000 probability located in mitochondrial 
matrix space; 0.1000 probability located in lysosome (lumen); 0.0000 probability 
located in endoplasmic reticulum (membrane) 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV69a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 69C. 
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Table 69C. Geneseq Results for NOV69a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV69a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAE 16766 


Human transporter and ion channel- 
3 (TRICH-3) protein - Homo 
sapiens, 485 aa. [WO200 192304- 
A2, 06-DEC-2001] 


S..492 
1..485 


485/485 (100%) 
485/485 (100%) 


0.0 


AAU35015 


Enterococcus faecal is cellular 
proliferation protein #302 - 
Enterococcus faecalis, 513 aa. 
[WO200 1 70955-A2, 27-SEP-200 1] 


S..492 
1..485 


395/485 (81%) 
450/485 (92%) 


0.0 


AAU35015 


Enterococcus faecalis cellular 
proliferation protein #302 - 
Enterococcus faecalis, 513 aa. 
[ WO200 1 70955-A2, 27-SEP-200 1 ] 


S.,492 
1..485 


395/485 (81%) 
450/485 (92%) 


0.0 


AAU38034 


Streptococcus pneumoniae cellular 

proliferation protein #463 - 
Streptococcus pneumoniae, 511 aa. 
[WO200 1 70955-A2, 27-SEP-2001] 


2..491 
3..492 


352/490 (71%) 
424/490 (85%) 


0.0 


;\AU37574 


Streptococcus pneumoniae cellular 

proliferation protein #3 - 
Streptococcus pneumoniae, 51 1 aa. 
[WO200 170955-A2, 27-SEP-200I] 


2..491 
3..492 


352/490 (71%) 
424/490 (85%) 


0.0 



In a BLAST search of public sequence datbases, the NOV69a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 69D. 
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Table 69D. Public BLASTP Results for NOV69a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV69a Residues/ 
Match Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAK99552 


ABC TRANSPORTER ATP- 
BINDING PROTEIN- 

RIBOSE/G A LACTOSE 
TRANSPORT - Streptococcus 

pneumoniae R6, 5 1 1 aa. 


2. .49 1 
3.. 492 


352/490 (71%) 
424/490 (85%) 




Q97RG9 


SUGAR ABC TRANSPORTER, 

ATP-BINDING PROTEIN - 
Streptococcus pneumoniae, 511 
aa. 


2. .491 
3.. 492 


352/490 (71%) 
424/490 (85%) 




Q99ZH5 


PUTATIVE SUGAR ABC 
TRANSPORTER (ATP- 
BINDING PROTEIN) - 
Streptococcus pyogenes, 5 10 aa. 


1..491 
1..491 


347/491 (70%) 
424/491 (85%) 


0.0 


AAL97795 


PUTATIVE SUGAR ABC 
TRANSPORTER (ATP- 
BINDING PROTEIN) - 
Streptococcus pyogenes 
MGAS8232, 510 aa. 


1..491 
1..491 


347/491 (70%) 
424/491 (85%) 


0.0 


Q8Y7A1 


HYPOTHETICAL PROTEIN 
LM01389- Listeria 
monocytogenes, 5 1 3 aa. 


5..491 
4..490 


345/487 (70%) 
407/487 (82%) 


0.0 



PFam analysis predicts that the NOV69a protein contains the domains shown in 
the Table 69E. 



Table 69E. Domain Analysis of NOV69a 


Pfam Domain 


NOV69a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ABCjran 


31..217 


73/201 (36%) 
144/201 (72%) 


5.8e-49 


ABCjran 


284..478 


59/207 (29%) 
134/207 (65%) 


7.4e-18 



Example 70. 



The NOV70 clone was analyzed, and the nucleotide and encoded polypeptide 
sequences are shown in Table 70A. 
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Table 70A. NOV70 Sequence Analysis 




SEQIDNO: 199 


j 689 bp 


NOV70a, 
CG99963-01 DNA Sequence 


ftGCGTGAGCCACCGCACCTGGCCTATTATTGTTTTGTTAAAAGCAGGGTCTCGGCTTT 
GCAAATGCAGCCACCGACGCCAGGAGCTCTGTACGATCAGCCATGGTTAACCCCACCG 
TGTTCGACATCGCTGTCGACGGCAAGCCCTTGGGCCGCGTGTCCTTCGAGCCGTTTGC 
AGACAAGGTTCCAAAGGCAGCAGAAAACTTTCGTGCTCTGAGCAATGTAGAGAAAGGA 
TTTGGTTATAAAGGTTCCTGCTTTCACAGAATTATTCCAGGGTTTATGTGTCAGGGTG 
GTGACTTCACATGCCATAATGGCACTGGTGGCAAGTACACCTGCGGGGAGAAATTTGA 
TGACGAGAGCTTCGTCCTAAAGCATACACATCCTGGCATCTTGTCCATGGCAAATGCT 
GGACCCAACACAAATGGTTCCCAGTGTTTCATCTGCACTGCCAAGACTGAGTGGCTGG 
ATGGCAATCCTGTGGTCTTTGGCAAGGTGAAAGAAGGCATGAATATTGTGGAGGCCAT 
GGGGCACTTTGGGTCCGAGAATGGCAAGACCAGCAAGAAGATCACCATTGCTGACTGT 




TAGCTCAGAAGAGCACCCCTTCACCCCATTTGCTTGCAGTATTTTTTTTTT 




ORF Start: ATG at 101 


ORF Stop: TAA at 590 




SEQ ID NO: 200 


163 aa j MW at 17503 .8kD 


NOV70a, 
CG99963-01 Protein Sequence 


MVNPTVFDIAVDGKPLGRVSFEPFADKVPKAAENFRALSNVEKGFGYKGSCFHRIIPG 
FMCQGGDFTCHNGTGGKYTCGEKFDDESFVLKHTHPGILSMANAGPNTNGSQCFICTA 
KTEIMLDGNPWFGKVKEGMNIVEAMGHFGSENGKTSKKITIADCGQL, 




SEQ ID NO: 201 


596 bp 


NOV70b, 
CG99963-02 DNA Sequence 


TTGTTTTGTTAAAAGCAGGGTCTCGGCTTTGCAAATGCAGCCACCGACGCCAGGAGCT 
CTGTACGATCAGCCATGGTTAACCCCACCGTGTTnOArATrnrTRTrwarrsrar'aararT- 


CTTGGACCGCGTGTCCTTCGAGCCGTTTGCAGACAAGGTTCCAAAGGCAGCAGAAAAC 
TTTCGTGCTCTGAGCAATGTAGAGAAAGGATTTGGTTATAAAGGTTCCTGCTTTCACA 
GAATTATTCCAGGGTTTATGTGTCAGGGTGGTGACTTCACATGCCTVTAATGGCACTGG 
TGGCAAGTACACCTGCGGGGAGAAATTTGATGACGAGAGCTTCGTCCTAAAGCATACA 
CATCCTGGCATCTTGTCCATGGCAAATGCTGGACCCAACACAAATGGTTCCCAGTGTT 
TCATCTGCACTGCCAAGACTGAGTGGCTGGATGGCAATCCTGTGGTCTTTGGCAAGGT 
GAAAGAAGGCATGAATATTGTGGAGGCCATGGGGCACTTTGGGTCCGAGAATGGCAAG 
ACCGGCAAGAAGATCACCATTGCTGACTGTGGACAACTCTAAC'^ "TP rTTI t 
TTTATCTTAACCACCA 




ORF Start: ATG at 73 


ORF Stop: TAA at 562 




SEQ ID NO: 202 


163 aa j MW at 17531. 8kD 


NOV70b, 
CG99963-02 Protein Sequence 


MVNPTVFDIAVDGKPLDRVSFEPFADKVPKAAENFRALSNVEKGFGYKGSCFHRIIPG 
FMCQGGDFTCHNGTGGKYTCGEKFDDESFVLKHTHPGILSMANAGPNTNGSQCFICTA 
KTEWLDGNPWFGKVKEGMNIVEAMGHFGSENGKTGKKITIADCGQL 



Sequence comparison of the above protein sequences yields the following 
sequence relationships shown in Table 70B. 



Table 70B. Comparison of NOV70a against NOV70b. 


Protein Sequence 


NOV70a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV70b 


1..163 
1-163 


161/163 (98%) 
161/163 (98%) 



Further analysis of the NOV70a protein yielded the following properties shown in 
Table 70C. 
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Table 70C. Protein Sequence Properties NOV70a 


PSort analysis: 


0.6000 probability located in plasma membrane; 0.5689 probability located in 
microbody (peroxisome); 0.4500 probability located in cytoplasm; 0.1000 
probability located in mitochondrial matrix space 


SignalP analysis: 


No Known Signal Sequence Predicted 



A search of the NOV70a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 70D. 



Table 70D. Geneseq Results for NOV70a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent #, 
Date] 


NOV70a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU01195 


Human cvclophilin A protein - Homo 
sapiens, 165 aa. [WO200132876-A2, 
10-MAY-2001] 


1..163 
] ..164 


144/164 (87%) 
148/164 (89%) 


le-79 


AAW56028 


Calcineurin protein - Mammalia, 165 aa. 
[WO9808956-A2, 05-MAR-1998] 


1..163 
1..164 


144/164 (87%) 
148/164 (89%) 


le-79 


AAR13726 


Bovine cyclophilin - Bos taurus, 163 aa. 
[US5047512-A, IO-SEP-1991] 


2.. 163 
1..163 


143/163 (87%) 
147/163 (89%) 


2e-79 


AAG65275 


Haematopoietic stem cell proliferation 
agent related human protein #2 - Homo 
sapiens, 164aa. [JP2001 163798-A, 19- 
JUN-2001] 


2..163 
1..163 


143/163 (87%) 
147/163 (89%) 


4e-79 


AAP90431 


Cyclophilin - Homo sapiens (human), 
164 aa. [EP326067-A, 02-AUG-1989] 


2..163 
1..163 


143/163 (87%) 
147/163 (89%) 


4e-79 



In a BLAST search of public sequence datbases, the NOV70a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 70E. 
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Table 70E. Public BLASTP Results for NOV70a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV70a 
Residues/ 

Match 
Residues 


Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC39529 


SEQUENCE 26 FROM PATENT 
WOO 132876 - Homo sapiens (Human) 
165 aa. 


1..163 


144/164 (87%) 
148/164 (89%) 


2e-79 


Q9TTC6 


CYCLOPHILIN 1 8 - Oryctolagus 
cuniculus (Rabbit), 164aa. 


1-163 
1..164 


143/164 (87%) 
150/164 (91%) 


4e-79 


Q9BRU4 


PEPTIDYLPROLYL ISOMERASE A 
(CYCLOPHILIN A) - Homo sapiens 
(Human), 165 aa. 


1-163 
1-164 


144/164(87%) 
147/164 (88%) 


5e-79 


P04374 


Peptidyl-prolyl cis-trans isomerase A 
(EC 5.2. 1 .8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A- 
binding protein) - Bos taurus (Bovine), 
and, 1 63 aa. 


2.. 163 
1-163 


143/163 (87%) 
147/163 (89%) 


5e-79 


P05092 


Peptidyl-prolyl cis-trans isomerase A 
(EC 5.2.1.8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A- 
binding protein) - Homo sapiens 
(Human),, 164 aa. 


2.. 163 
I..163 


143/163 (87%) 
147/163 (89%) 


9e-79 



PFam analysis predicts that the NOV70a protein contains the domains shown in 
the Table 70F. 



Table 70F. Domain Analysis of NOV70a 


Pfam Domain 


NOV70a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


pro_isomerase 


7..163 


> 108/179(60%) 
~\ 140/179 (78%) 


3.7e-87 



Example B: Sequencing Methodology and Identification of NOVX Clones 

1. GeneCalling™ Technology: This is a proprietary method of performing 
10 differential gene expression profiling between two or more samples developed at CuraGen 
and described by Shimkets, et al., "Gene expression analysis by transcript profiling 
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coupled to a gene database query" Nature Biotechnology 17:198-803 (1999). cDNA was 
derived from various human samples representing multiple tissue types, normal and 
diseased states, physiological states, and developmental states from different donors. 
Samples were obtained as whole tissue, primary cells or tissue cultured primary cells or 
cell lines. Cells and cell lines may have been treated with biological or chemical agents 
that regulate gene expression, for example, growth factors, chemokines or steroids. The 
cDNA thus derived was then digested with up to as many as 120 pairs of restriction 
enzymes and pairs of linker-adaptors specific for each pair of restriction enzymes were 
hgated to the appropriate end. The restriction digestion generates a mixture of unique 
cDNA gene fragments. Limited PCR amplification is performed with primers 
homologous to the linker adapter sequence where one primer is biotinylated and the other 
is fluorescently labeled. The doubly labeled material is isolated and the fluorescently 
labeled single strand is resolved by capillary gel electrophoresis. A computer algorithm 
compares the electropherograms from an experimental and control group for each of the 
restriction digestions. This and additional sequence-derived information is used to predict 
the identity of each differentially expressed gene fragment using a variety of genetic 
databases. The identity of the gene fragment is confirmed by additional, gene-specific 
competitive PCR or by isolation and sequencing of the gene fragment. 

2. SeqCalling™ Technology: cDNA was derived from various human samples 
representing multiple tissue types, normal and diseased states, physiological states, and 
developmental states from different donors. Samples were obtained as whole tissue, 
primary cells or tissue cultured primary cells or cell lines. Cells and cell lines may have 
been treated with biological or chemical agents that regulate gene expression, for example, 
growth factors, chemokines or steroids. The cDNA thus derived was then sequenced using 
CuraGen's proprietary SeqCalling technology. Sequence traces were evaluated manually 
and edited for corrections if appropriate. cDNA sequences from all samples were 
assembled together, sometimes including public human sequences, using bioinformatic 
programs to produce a consensus sequence for each assembly. Each assembly is included 
in CuraGen Corporation's database. Sequences were included as components for assembly 
when the extent of identity with another component was at least 95% over 50 bp. Each 
assembly represents a gene or portion thereof and includes information on variants, such 
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as splice forms single nucleotide polymorphisms (SNPs), insertions, deletions and other 
sequence variations. 

3. PathCalling™ Technology: 

5 The NOVX nucleic acid sequences are derived by laboratory screening of cDNA 

library by the two-hybrid approach. cDNA fragments covering either the full length of the 
DNA sequence, or part of the sequence, or both, are sequenced. In silico prediction was 
based on sequences available in CuraGen Corporation's proprietary sequence databases or 
in the public human sequence databases, and provided either the full length DNA 
10 sequence, or some portion thereof. 

The laboratory screening was performed using the methods summarized below: 

cDNA libraries were derived from various human samples representing multiple 
tissue types, normal and diseased states, physiological states, and developmental states 
from different donors. Samples were obtained as whole tissue, primary cells or tissue 

1 5 cultured primary cells or cell lines. Cells and cell lines may have been treated with 

biological or chemical agents that regulate gene expression, for example, growth factors, 
chemokines or steroids. The cDNA thus derived was then directionally cloned into the 
appropriate two-hybrid vector (Gal4-activation domain (Gal4-AD) fusion). Such cDNA 
libraries as well as commercially available cDNA libraries from Clontech (Palo Alto, CA) 

20 were then transferred from E.coli into a CuraGen Corporation proprietary yeast strain 
(disclosed in U. S. Patents 6,057,101 and 6,083,693, incorporated herein by reference in 
their entireties). 

Gal4-binding domain (Gal4-BD) fusions of a CuraGen Corportion proprietary 
library of human sequences was used to screen multiple Gal4-AD fusion cDNA libraries 
25 resulting in the selection of yeast hybrid diploids in each of which the Gal4-AD fusion 
contains an individual cDNA. Each sample was amplified using the polymerase chain 
reaction (PCR) using non-specific primers at the cDNA insert boundaries. Such PCR 
product was sequenced; sequence traces were evaluated manually and edited for 
corrections if appropriate. cDNA sequences from all samples were assembled together, 
30 sometimes including public human sequences, using bioinformatic programs to produce a 
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consensus sequence for each assembly. Each assembly is included in CuraGen 
Corporation's database. Sequences were included as components for assembly when the 
extent of identity with another component was at least 95% over 50 bp. Each assembly 
represents a gene or portion thereof and includes information on variants, such as splice 
5 forms single nucleotide polymorphisms (SNPs), insertions, deletions and other sequence 
variations. 

Physical clone: the cDNA fragment derived by the screening procedure, covering 
the entire open reading frame is, as a recombinant DNA, cloned into pACT2 plasmid 
(Clontech) used to make the cDNA library. The recombinant plasmid is inserted into the 
10 host and selected by the yeast hybrid diploid generated during the screening procedure by 
the mating of both CuraGen Corporation proprietary yeast strains N106' and YULH (U. S. 
Patents 6,057,101 and 6,083,693). 

4. RACE: Techniques based on the polymerase chain reaction such as rapid 

1 5 amplification of cDN A ends (RACE), were used to isolate or complete the predicted 

sequence of the cDNA of the invention. Usually multiple clones were sequenced from one 
or more human samples to derive the sequences for fragments. Various human tissue 
samples from different donors were used for the RACE reaction. The sequences derived 
from these procedures were included in the SeqCalling Assembly process described in 

20 preceding paragraphs. 

5. Exon Linking: The NOVX target sequences identified in the present invention 
were subjected to the exon linking process to confirm the sequence. PCR primers were 
designed by starting at the most upstream sequence available, for the forward primer, and 

25 at the most downstream sequence available for the reverse primer. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
sequence, until a suitable sequence that is either unique or highly selective was 
encountered, or, in the case of the reverse primer, until the stop codon was reached. Such 
primers were designed based on in silico predictions for the full length cDNA, part (one or 

30 more exons) of the DNA or protein sequence of the target sequence, or by translated 

homology of the predicted exons to closely related human sequences from other species. 
These primers were then employed in PCR amplification based on the following pool of 
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human cDNAs: adrenal gland, bone marrow, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal 
kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, pancreas, 
pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, spinal 
cord, spleen, stomach, testis, thyroid, trachea, uterus. Usually the resulting amplicons 
were gel purified, cloned and sequenced to high redundancy. The PCR product derived 
from exon linking was cloned into the pCR2.1 vector from Invitrogen. The resulting 
bacterial clone has an insert covering the entire open reading frame cloned into the pCR2.1 
vector. The resulting sequences from all clones were assembled with themselves, with 
other fragments in CuraGen Corporation's database and with public ESTs. Fragments and 
ESTs were included as components for an assembly when the extent of their identity with 
another component of the assembly was at least 95% over 50 bp. In addition, sequence 
traces were evaluated manually and edited for corrections if appropriate. These procedures 
provide the sequence reported herein. 

6. Physical Clone: 

Exons were predicted by homology and the intron/exon boundaries were 
determined using standard genetic rules. Exons were further selected and refined by means 
of similarity determination using multiple BLAST (for example, tBlastN, BlastX, and 
BlastN) searches, and, in some instances, GeneScan and Grail. Expressed sequences from 
both public and proprietary databases were also added when available to further define and 
complete the gene sequence. The DNA sequence was then manually corrected for apparent 
inconsistencies thereby obtaining the sequences encoding the full-length protein. 

The PCR product derived by exon linking, covering the entire open reading frame, 
was cloned into the pCR2.1 vector from Invitrogen to provide clones used for expression 
and screening purposes. 

Example C: Quantitative expression analysis of clones in various cells and tissues 

The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines 
and tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed on an 
Applied Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence 
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Detection System. Various collections of samples are assembled on the plates, and 
referred to as Panel 1 (containing normal tissues and cancer cell lines), Panel 2 (containing 
samples derived from tissues from normal and cancer sources), Panel 3 (containing cancer 
cell lines), Panel 4 (containing cells and cell lines from normal tissues and cells related to 
inflammatory conditions), Panel 5D/5I (containing human tissues and cell lines with an 
emphasis on metabolic diseases), AI_comprehensive_panel (containing normal tissue and 
samples from autoimmune diseases), Panel CNSD.01 (containing central nervous system 
samples from normal and diseased brains) and CNS_neurodegeneration_panel (containing 
samples from normal and Alzheimer's diseased brains). 

PvNA integrity from all samples is controlled for quality by visual assessment of 
agarose gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio 
as a guide (2: 1 to 2.5: 1 28s: 1 8s) and the absence of low molecular weight RNAs that 
would be indicative of degradation products. Samples are controlled against genomic 
DNA contamination by RTQ PCR reactions run in the absence of reverse transcriptase 
using probe and primer sets designed to amplify across the span of a single exon. 

First, the RNA samples were normalized to reference nucleic acids such as 
constitutively expressed genes (for example, p-actin and GAPDH). Normalized RNA (5 
ul) was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master 
Mix Reagents (Applied Biosystems; Catalog No. 4309169) and gene-specific primers 
according to the manufacturer's instructions. 

In other cases, non-normalized RNA samples were converted to single strand 
cDNA (sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 18064-147) 
and random hexamers according to the manufacturer's instructions. Reactions containing 
up to 1 0 ug of total RNA were performed in a volume of 20 pi and incubated for 60 
minutes at 42°C. This reaction can be scaled up to 50 pg of total RNA in a final volume of 
100 pi. sscDNA samples are then normalized to reference nucleic acids as described 
previously, using IX TaqMan® Universal Master mix (Applied Biosystems; catalog No. 
4324020), following the manufacturer's instructions. 

Probes and primers were designed for each assay according to Applied Biosystems 
Primer Express Software package (version I for Apple Computer's Macintosh Power PC) 
or a similar algorithm using the target sequence as input. Default settings were used for 
reaction conditions and the following parameters were set before selecting primers: primer 
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concentration = 250 nM, primer melting temperature (Tm) range = 58°-60°C, primer 
optimal Tm = 59°C, maximum primer difference = 2°C, probe does not have 5'G, probe 
Tm must be 10°C greater than primer Tm, amplicon size 75bp to lOObp. The probes and 
primers selected (see below) were synthesized by Synthegen (Houston, TX, USA). Probes 
were double purified by HPLC to remove uncoupled dye and evaluated by mass 
spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' ends of the 
probe, respectively. Their final concentrations were: forward and reverse primers, 900nM 
each, and probe, 200nM. 

PCR conditions: When working with RNA samples, normalized RNA from each 
tissue and each cell line was spotted in each well of either a 96 well or a 384-well PCR 
plate (Applied Biosystems). PCR cocktails included either a single gene specific probe and 
primers set, or two multiplexed probe and primers sets (a set specific for the target clone 
and another gene-specific set multiplexed with the target probe). PCR reactions were set 
up using TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 
4313803) following manufacturer's instructions. Reverse transcription was performed at 
48°C for 30 minutes followed by amplification/PCR cycles as follows: 95°C 10 min, then 
40 cycles of 95°C for 15 seconds, 60°C for 1 minute. Results were recorded as CT values 
(cycle at which a given sample crosses a threshold level of fluorescence) using a log scale, 
with the difference in RNA concentration between a given sample and the sample with the 
lowest CT value being represented as 2 to the power of delta CT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and 
multiplying by 100. 

When working with sscDNA samples, normalized sscDNA was used as described 
previously for RNA samples. PCR reactions containing one or two sets of probe and 
primers were set up as described previously, using IX TaqMan® Universal Master mix 
(Applied Biosystems; catalog No. 4324020), following the manufacturer's instructions. 
PCR amplification was performed as follows: 95°C 10 min, then 40 cycles of 95°C for 15 
seconds, 60°C for 1 minute. Results were analyzed and processed as described previously. 
Panels 1, 1.1, 1.2, and 1.3D 

The plates for Panels 1, 1.1, 1.2 and 1. 3D include 2 control wells (genomic DNA 
control and chemistry control) and 94 wells containing cDNA from various samples. The • 
samples in these panels are broken into 2 classes: samples derived from cultured cell lines 
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and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
gastric cancer and pancreatic cancer. Cell lines used in these panels are widely available 
through the American Type Culture Collection (ATCC), a repository for cultured cell 
lines, and were cultured using the conditions recommended by the ATCC. The normal 
tissues found on these panels are comprised of samples derived from all major organ 
systems from single adult individuals or fetuses. These samples are derived from the 
following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, fetal heart, adult 
kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various regions of the 
brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, pituitary gland, 
adrenal gland, spinal cord, thymus, stomach, small intestine, colon, bladder, trachea, 
breast, ovary, uterus, placenta, prostate, testis and adipose. 

In the results for Panels 1, 1.1, 1.2 and 1.3D, the following abbreviations are used: 

ca. = carcinoma, 

* = established from metastasis, 

met = metastasis, 

s cell var = small cell variant, 

non-s = non-sm = non-small, 

squam = squamous, 

pi. eff = pi effusion = pleural effusion, 

glio = glioma, 

astro = astrocytoma, and 

neuro = neuroblastoma. 

General_screening_panel_vl.4 and General_screening_panel_vl.5 

ITie plates for Panels 1 .4 and 1 .5 include 2 control wells (genomic DNA control 
and chemistry control) and 94 wells containing cDNA from various samples. The samples 
in Panels 1 .4 and 1 .5 are broken into 2 classes: samples derived from cultured cell lines 
and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
gastric cancer and pancreatic cancer. Cell lines used in Panel 1.4 are widely available 
through the American Type Culture Collection (ATCC), a repository for cultured cell 
lines, and were cultured using the conditions recommended by the ATCC. The normal 
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tissues found on Panels 1.4 and 1.5 are comprised of pools of samples derived from all 
major organ systems from 2 to 5 different adult individuals or fetuses. These samples are 
derived from the following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, 
fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various 
5 regions of the brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, 
pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, colon, 
bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. Abbreviations 
are as described for Panels 1, 1.1, 1.2, and 1.3D. 
Panels 2D and 2.2 

10 The plates for Panels 2D and 2.2 generally include 2 control wells and 94 test 

samples composed of RNA or cDNA isolated from human tissue procured by surgeons 
working in close cooperation with the National Cancer Institute's Cooperative Human 
Tissue Network (CHTN) or the National Disease Research Initiative (NDRI). The tissues 
are derived from human malignancies and in cases where indicated many malignant 

1 5 tissues have "matched margins" obtained from noncancerous tissue just adjacent to the 
tumor. These are termed normal adjacent tissues and are denoted "NAT" in the results 
below. The tumor tissue and the "matched margins" are evaluated by two independent 
pathologists (the surgical pathologists and again by a pathologist at NDRI or CHTN). This 
analysis provides a gross histopathological assessment of tumor differentiation grade. 

20 Moreover, most samples-include the original surgical pathology report that provides 

information regarding the clinical stage of the patient. These matched margins are taken 
from the tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated 
"NAT", for normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples 
were obtained from various human tissues derived from autopsies performed on elderly 

25 people or sudden death victims (accidents, etc.). These tissues were ascertained to be free 
of disease and were purchased from various commercial sources such as Clontech (Palo 
Alto, CA), Research Genetics, and Invitrogen. 
Panel 3D 

The plates of Panel 3D are comprised of 94 cDNA samples and two control 
30 samples. Specifically, 92 of these samples are derived from cultured human cancer cell 
lines, 2 samples of human primary cerebellar tissue and 2 controls. The human cell lines 
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are generally obtained from ATCC (American Type Culture Collection), NCI or the 
German tumor cell bank and fall into the following tissue groups: Squamous cell 
carcmoma of the tongue, breast cancer, prostate cancer, melanoma, epidermoid carcinoma 
sarcomas, bladder carcinomas, pancreatic cancers, kidney cancers, leukemias/lymphomas ' 
5 ovanan/uterine/cervical, gastric, colon, lung and CNS cancer cell lines. In addition there ' 
are two independent samples of cerebellum. These cells are all cultured under standard 
recommended conditions and RNA extracted using the standard procedures. The cell lines 
in panel 3D and 1 .3D are of the most common cell lines used in the scientific literature. 
Panels 4D, 4R, and 4.1 D 

3 Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 

composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human 
cell hnes or tissues related to inflammatory conditions. Total RNA from control normal 
tissues such as colon and lung (Stmtagene, La Jolla, CA) and thymus and kidney 
(Clontech) was employed. Total RNA from liver tissue from cirrhosis patients and kidney 
• from lupus patients was obtained from BioChain (Biochain Institute, Inc., Hayward CA) 
Intestinal tissue for RNA preparation from patients diagnosed as having Crohn's disease 
and ulcerative colitis was obtained from the National Disease Research Interchange 
(NDRI) (Philadelphia, PA). 

Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle 
cells, small airway epithelium, bronchial epithelium, microvascular dermal endothelial 
cells, microvascular lung endothelial cells, human pulmonary aortic endothelial cells 
human umbilical vein endothelial cells were all purchased from Clonetics (Walkersville 
MD) and grown in the media supplied for these cell types by Clonetics. These primary cell 
types were activated with various cytokines or combinations of cytokines for 6 and/or 12- 
14 hours, as indicated. The following cytokines were used; IL-1 beta at approximately 1- 
5ng/ml, TNF alpha at approximately 5-lOng/ml, IFN gamma at approximately 20- 
50ng/ml, IL-4 at approximately 5-lOng/ml, IL-9 at approximately 5-10ng/ml, IL-13 at • 
approximately 5-10ng/ml. Endothelial cells were sometimes starved for various times by 
culture in the basal media from Clonetics with 0.1% serum. 

Mononuclear cells were prepared from blood of employees at CuraGen 
Corporation, using Ficoll. LAK cells were prepared from these cells by culture in DMEM 
5% PCS (Hyclone), 100uM non essential amino acids (Gibco/Life Technologies, 
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Rockville, MD), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and 
1 OmM Hepes (Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 
10-20ng/ml PMA and l-2ug/ml ionomycin, IL-12 at 5-10ng/ml, IFN gamma at 20- 
50ng/ml and IL-18 at 5-10ng/ml for 6 hours. In some cases, mononuclear cells were 
> cultured for 4-5 days in DMEM 5% FCS (Hyclone), 1 OOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and lOmM 
Hepes (Gibco) with PHA (phytohemagglutinin) or PWM (pokeweed mitogen) at 
approximately 5txg/ml. Samples were taken at 24, 48 and 72 hours for RNA preparation. 
MLR (mixed lymphocyte reaction) samples were obtained by taking blood from two 
donors, isolating the mononuclear cells using Ficoll and mixing the isolated mononuclear 
cells 1 : 1 at a final concentration of approximately 2x1 0 6 cells/ml in DMEM 5% FCS 
(Hyclone), lOOuM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol (5.5xlO" 5 M) (Gibco), and lOmM Hepes (Gibco). The MLR was cultured 
and samples taken at various time points ranging from 1- 7 days for RNA preparation. 

Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve 
VS selection columns and a Vario Magnet according to the manufacturer's instructions. 
Monocytes were differentiated into dendritic cells by culture in DMEM 5% fetal calf 
serum (FCS) (Hyclone, Logan, UT), lOOuM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), and lOmM Hepes (Gibco), 
50ng/ml GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of 
monocytes for 5-7 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), lOmM 
Hepes (Gibco) and 10% AB Human Serum or MCSF at approximately 50ng/ml. 
Monocytes, macrophages and dendritic cells were stimulated for 6 and 12-14 hours with 
lipopolysaccharide (LPS) at lOOng/ml. Dendritic cells were also stimulated with anti- 
CD40 monoclonal antibody (Pharmingen) at lOug/ml for 6 and 12-14 hours. 

CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from 
mononuclear cells using CD4, CDS and CD56 Miltenyi beads, positive VS selection 
columns and a Vario Magnet according to the manufacturer's instructions. CD45RA and 
CD45RO CD4 lymphocytes were isolated by depleting -mononuclear cells of CD8, CD56, 
CD14 and CD19 cells using CDS, CD56, CD 14 and CD19 Miltenyi beads and positive 
selection. CD45RO beads were then used to isolate the CD45RO CD4 lymphocytes with 
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the remaining cells being CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and 
CD8 lymphocytes were placed in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0~ 5 M (Gibco), and 
lOmM Hepes (Gibco) and plated at 10 6 cells/ml onto Falcon 6 well tissue culture plates 
5 that had been coated overnight with 0.5ug/ml anti-CD28 (Pharmingen) and 3ug/ml anti- 
CD3 (OKT3, ATCC) in PBS. After 6 and 24 hours, the cells were harvested for RNA 
preparation. To prepare chronically activated CD8 lymphocytes, we activated the isolated 
CD8 lymphocytes for 4 days on anti-CD28 and anti-CD3 coated plates and then harvested 
the cells and expanded them in DMEM 5% FCS (Hyclone), lOOuM non essential amino 

10 acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and 
lOmM Hepes (Gibco) and IL-2. The expanded CDS cells were then activated again with 
plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was 
isolated 6 and 24 hours after the second activation and after 4 days of the second 
expansion culture. The isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 

15 lOOuM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 

mercaptoethanol 5.5x1 0" S M (Gibco), and lOmM Hepes (Gibco) and IL-2 for 4-6 days 
before RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with 
sterile dissecting scissors and then passed through a sieve. Tonsil cells were then spun 

20 down and resupended at 1 0 6 cells/ml in DMEM 5% FCS (Hyclone), 1 OOuM non essential 
amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0~ 5 M (Gibco), 
and lOmM Hepes (Gibco). To activate the cells, we used PWM at 5ug/ml or anti-CD40 
(Pharmingen) at approximately 10|ag/ml and IL-4 at 5-10ng/ml. Cells were harvested for 
RNA preparation at 24,48 and 72 hours. 

25 To prepare the primary and secondary Thl/Th2 and Trl cells, six- well Falcon 

plates were coated overnight with lOug/ml anti-CD28 (Pharmingen) and 2|ig/ml OKT3 
(ATCC), and then washed twice with PBS. Umbilical cord blood CD4 lymphocytes 
(Poietic Systems, German Town, MD) were cultured at 10 5 -10 6 cells/ml in DMEM 5% 
FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM sodium pyruvate 

30 (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). 
IL-12 (5ng/ml) and anti-IL4 (1 ug/ml) were used to direct to Thl, while IL-4 (5ng/ml) and 
anti-IFN gamma (lug/ml) were used to direct to Th2 and IL-10 at 5ng/ml was used to 
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direct to Trl. After 4-5 days, the activated Thl, Th2 and Trl lymphocytes were washed 
once in DMEM and expanded for 4-7 days in DMEM 5% FCS (Hyclone), lOOuM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (lng/ml). Following this, the activated Thl, 
'< Th2 and Trl lymphocytes were re-stimulated for 5 days with anti-CD28/OKT3 and 
cytokines as described above, but with the addition of anti-CD95L (1 ug/ml) to prevent 
apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then 
expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes were 
maintained in this way for a maximum of three cycles. RNA was prepared from primary 
and secondary Thl, Th2 and Trl after 6 and 24 hours following the second and third 
activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second 
and third expansion cultures in Interleukin 2. 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, 
KU-812. EOL cells were further differentiated by culture in O.lmM dbcAMP at 
SxlO'cells/ml for 8 days, changing the media every 3 days and adjusting the cell 
concentration to 5xl0 5 cells/ml. For the culture of these cells, we used DMEM or RPMI (as 
recommended by the ATCC), with the addition of 5% FCS (Hyclone), lOOuM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco). RNA was either prepared from resting cells or cells 
activated with PMA at lOng/ml and ionomycin at lug/ml for 6 and 14 hours. Keratinocyte 
line CCD106 and an airway epithelial tumor line NCI-H292 were also obtained from the 
ATCC. Both were cultured in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), and 
lOmM Hepes (Gibco). CCD1 106 cells were activated for 6 and 14 hours with 
approximately 5 ng/ml TNF alpha and lng/ml IL-1 beta, while NCI-H292 cells were 
activated for 6 and 14 hours with the following cytokines: 5ng/ml IL-4, 5ng/ml IL-9, 
5ng/ml IL-1 3 and 25ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 
10 7 cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane 
(Molecular Research Corporation) was added to the RNA sample, vortexed and after 10 
minutes at room temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. 
The aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 
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isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down 
at 9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet 
was redissolved in 300ul of RNAse-free water and 35ul buffer (Promega) 5ul DTT, 7ul 
RNAsin and 8ul DNAse were added. The tube was incubated at 37°C for 30 minutes to 
5 remove contaminating genomic DNA, extracted once with phenol chloroform and re- 
precipitated with 1/10 volume of 3M sodium acetate and 2 volumes of 100% ethanol. The 
RNA was spun down and placed in RNAse free water. RNA was stored at -80°C. 
AI_comprehensive panelvl.O 

The plates for AI_comprehensive panel vl.O include two control wells and 89 test 
1 0 samples comprised of cDNA isolated from surgical and postmortem human tissues 
obtained from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was 
extracted from tissue samples from the Backus Hospital in the Facility at CuraGen. Total 
RNA from other tissues was obtained from Clinomics. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained 
1 5 from patients undergoing total knee or hip replacement surgery at the Backus Hospital. 
Tissue samples were immediately snap frozen in liquid nitrogen to ensure that isolated 
RNA was of optimal quality and not degraded. Additional samples of osteoarthritis and 
rheumatoid arthritis joint tissues were obtained from Clinomics. Normal control tissues 
were supplied by Clinomics and were obtained during autopsy of trauma victims. 

Surgical specimens of psoriatic tissues and adjacent matched tissues were provided 
as total RNA by Clinomics. Two male and two female patients were selected between the 
ages of 25 and 47. None of the patients were taking prescription drugs at the time samples 
were isolated. 

Surgical specimens of diseased colon from patients with ulcerative colitis and 
Crohns disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue 
from three female and three male Crohn's patients between the ages of 41-69 were used. 
Two patients were not on prescription medication while the others were taking 
dexamethasone, phenobarbital, or tylenol. Ulcerative colitis tissue was from three male 
and four female patients. Four of the patients were taking lebvid and two were on 
phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or 
with emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients 
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ranged in age from 40-70 and all were smokers, this age range was chosen to focus on 
patients with cigarette-linked emphysema and to avoid those patients with alpha-1 anti- 
trypsin deficiencies. Asthma patients ranged in age from 36-75, and excluded smokers to 
prevent those patients that could also have COPD. COPD patients ranged in age from 35- 
5 80 and included both smokers and non-smokers. Most patients were taking corticosteroids, 
and bronchodilators. 

In the labels employed to identify tissues in the AI_comprehensive panel_vl.O 
panel, the following abbreviations are used: 

AI = Autoimmunity 
10 Syn = Synovial 

Normal = No apparent disease 

Rep22 /Rep20 = individual patients 

RA = Rheumatoid arthritis 

Backus = From Backus Hospital 
15 OA = Osteoarthritis 

(SS) (BA) (MF) = Individual patients 

Adj = Adjacent tissue 

Match control = adjacent tissues 

-M = Male 
20 -F = Female 

COPD = Chronic obstructive pulmonary disease 

Panels 5D and 51 

The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
25 isolated from human tissues and cell lines with an emphasis on metabolic diseases. 

Metabolic tissues were obtained from patients enrolled in the Gestational Diabetes study. 
Cells were obtained during different stages in the differentiation of adipocytes from 
human mesenchymal stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are young (18-40 years), otherwise 
30 healthy women with and without gestational diabetes undergoing routine (elective) 

Caesarean section. After delivery of the infant, when the surgical incisions were being 
repaired/closed, the obstetrician removed a small sample (<1 cc) of the exposed metabolic 
tissues during the closure of each surgical level. The biopsy material was rinsed in sterile 
saline, blotted and fast frozen within 5 minutes from the time of removal. The tissue was 
35 then flash frozen in liquid nitrogen and stored, individually, in sterile screw-top tubes and 
kept on dry ice for shipment to or to be picked up by CuraGen. The metabolic tissues of 
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interest include uterine wall (smooth muscle), visceral adipose, skeletal muscle (rectus) 
and subcutaneous adipose. Patient descriptions are as follows: 

Patient 2: Diabetic Hispanic, overweight, not on insulin 

Patient 7-9: Nondiabetic Caucasian and obese (BMI>30) 

Patient 10: Diabetic Hispanic, overweight, on insulin 

Patient 1 1 : Nondiabetic African American and overweight 

Patient 12: Diabetic Hispanic on insulin 

Adipocyte differentiation was induced in donor progenitor cells obtained from 
Osirus (a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had 
only two replicates. Scientists at Clonetics isolated, grew and differentiated human 
mesenchymal stem cells (HuMSCs) for CuraGen based on the published protocol found in 
Mark F. Pittenger, et ah, Multilineage Potential of Adult Human Mesenchymal Stem Cells 
Science Apr 2 1999: 143-147. Clonetics provided Trizol lysates or frozen pellets suitable 
for mRNA isolation and ds cDNA production. A general description of each donor is as 
follows: 

Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 
Donor 2 and 3 AM: Adipose, AdiposeMidway Differentiated 
Donor 2 and 3 AD: Adipose, Adipose Differentiated 

Human cell lines were generally obtained from ATCC (American Type Culture 
Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver 
HepG2 cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These 
cells are all cultured under standard recommended conditions and RNA extracted using 
the standard procedures. All samples were processed at CuraGen to produce single 
stranded cDNA. 

Panel 51 contains all samples previously described with the addition of pancreatic 
islets from a 58 year old female patient obtained from the Diabetes Research Institute at 
the University of Miami School of Medicine. Islet tissue was processed to total RNA at an 
outside source and delivered to CuraGen for addition to panel 51. 

In the labels employed to identify tissues in the 5D and 51 panels, the following 
abbreviations are used: 

GO Adipose = Greater Omentum Adipose 
SK = Skeletal Muscle 
UT = Uterus 
PL = Placenta 

AD = Adipose Differentiated 
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AM = Adipose Midway Differentiated 
U = Undifferentiated Stem Cells 

Panel CNSD.01 

The plates for Panel CNSD.01 include two control wells and 94 test samples 
comprised of cDNA isolated from postmortem human brain tissue obtained from the 
Harvard Brain Tissue Resource Center. Brains are removed from calvaria of donors 
between 4 and 24 hours after death, sectioned by neuroanatomists, and frozen at -80°C in 
liquid nitrogen vapor. All brains are sectioned and examined by neuropathologists to 
confirm diagnoses with clear associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains two brains 
from each of the following diagnoses: Alzheimer's disease, Parkinson's disease, 
Huntington's disease, Progressive Supernuclear Palsy, Depression, and "Normal controls". 
Within each of these brains, the following regions are represented : cingulate gyrus, 
temporal pole, globus palladus, substantia nigra, Brodman Area 4 (primary motor strip), 
Brodman Area 7 (parietal cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 
17 (occipital cortex). Not all brain regions are represented in all cases; e.g., Huntington's 
disease is characterized in part by neurodegeneration in the globus palladus, thus this 
region is impossible to obtain from confirmed Huntington's cases. Likewise Parkinson's 
disease is characterized by degeneration of the substantia nigra making this region more 
difficult to obtain. Normal control brains were examined for neuropathology and found to 
be free of any pathology consistent with neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following 

abbreviations are used: 

PSP = Progressive supranuclear palsy 
Sub Nigra = Substantia nigra 
Glob Palladus= Globus palladus 
Temp Pole = Temporal pole 
Cing Gyr = Cingulate gyrus 
BA 4 = Brodman Area 4 

Panel CNSNeurodegenerationVl.O 

The plates for Panel CNS_Neurodegeneration_V1.0 include two control wells and 
47 test samples comprised of cDNA isolated from postmortem human brain tissue 
obtained from the Harvard Brain Tissue Resource Center (McLean Hospital) and the 
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Human Brain and Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare 
System). Brains are removed from calvaria of donors between 4 and 24 hours after death, 
sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are 
sectioned and examined by neuropathologists to confirm diagnoses with clear associated 
5 neuropathology. 

Disease diagnoses are taken from patient records. The panel contains six brains 
from Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who 
showed no evidence of dementia prior to death. The eight normal control brains are 
divided into two categories: Controls with no dementia and no Alzheimer's like pathology 
1 0 (Controls) and controls with no dementia but evidence of severe Alzheimer's like 
pathology, (specifically senile plaque load rated as level 3 on a scale of 0-3; 0 = no 
evidence of plaques, 3 = severe AD senile plaque load). Within each of these brains, the 
following regions are represented: hippocampus, temporal cortex (Brodman Area 21), 
parietal cortex (Brodman area 7), and occipital cortex (Brodman area 17). These regions 
1 5 were chosen to encompass all levels of neurodegeneration in AD. The hippocampus is a 
region of early and severe neuronal loss in AD; the temporal cortex is known to show 
neurodegeneration in AD after the hippocampus; the parietal cortex shows moderate 
neuronal death in the late stages of the disease; the occipital cortex is spared in AD and 
therefore acts as a "control" region within AD patients. Not all brain regions are 
20 represented in all cases. 

In the labels employed to identify tissues in the CNS_Neurodegeneration_V1.0 
panel, the following abbreviations are used: 

AD = Alzheimer's disease brain; patient was demented and showed AD-like 
pathology upon autopsy 
25 Control = Control brains; patient not demented, showing no neuropathology 

Control (Path) = Control brains; pateint not demented but showing sever AD-like 
pathology 

SupTemporal Ctx = Superior Temporal Cortex 
Inf Temporal Ctx = Inferior Temporal Cortex 

30 

The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines 
and tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed on an 
35 Applied Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence 
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Detection System. Various collections of samples are assembled on the plates, and 
referred to as Panel 1 (containing normal tissues and cancer cell lines), Panel 2 (containing 
samples derived from tissues from normal and cancer sources), Panel 3 (containing cancer 
cell lines), Panel 4 (containing cells and cell lines from normal tissues and cells related to 
5 inflammatory conditions), Panel 5D/5I (containing human tissues and cell lines with an 
emphasis on metabolic diseases), AI_comprehensive_panel (containing normal tissue and 
samples from autoinflammatory diseases), Panel CNSD.01 (containing samples from 
normal and diseased brains) and CNS_neurodegeneration_panel (containing samples from 
normal and Alzheimer's diseased brains). 

1 0 RNA integrity from all samples is controlled for quality by visual assessment of 

agarose gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio 
as a guide (2:1 to 2.5:1 28s:l 8s) and the absence of low molecular weight RNAs that 
would be indicative of degradation products. Samples are controlled against genomic 
DNA contamination by RTQ PCR reactions run in the absence of reverse transcriptase 

1 5 using probe and primer sets designed to amplify across the span of a single exon. 

First, the RNA samples were normalized to reference nucleic acids such as 
constitutively expressed genes (for example, P-actin and GAPDH). Normalized RNA (5 
ul) was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master 
Mix Reagents (Applied Biosystems; Catalog No. 4309169) and gene-specific primers 

20 according to the manufacturer's instructions. 

In other cases, non-normalized RNA samples were converted to single strand 
cDNA (sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 18064-147) 
and random hexamers according to the manufacturer's instructions. Reactions containing 
up to 10 p.g of total RNA were performed in a volume of 20 ul and incubated for 60 

25 minutes at 42°C. This reaction can be scaled up to 50 ug of total RNA in a final volume of 
100 ul. sscDNA samples are then normalized to reference nucleic acids as described 
previously, using IX TaqMan® Universal Master mix (Applied Biosystems; catalog No. 
4324020), following the manufacturer's instructions. 

Probes and primers were designed for each assay according to Applied Biosystems 

30 Primer Express Software package (version I for Apple Computer's Macintosh Power PC) 
or a similar algorithm using the target sequence as input. Default settings were used for 
reaction conditions and the following parameters were set before selecting primers: primer 
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concentration = 250 nM, primer melting temperature (Tm) range = 58°-60°C, primer 
optimal Tm = 59°C, maximum primer difference = 2°C, probe does not have 5'G, probe 
Tm must be 1 0°C greater than primer Tm, amplicon size 75bp to 1 OObp. The probes and 
primers selected (see below) were synthesized by Synthegen (Houston, TX, USA). Probes 
were double purified by HPLC to remove uncoupled dye and evaluated by mass 
spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' ends of the 
probe, respectively. Their final concentrations were: forward and reverse primers, 900nM 
each, and probe, 200nM. 

PCR conditions: When working with RNA samples, normalized RNA from each 
tissue and each cell line was spotted in each well of either a 96 well or a 384-well PCR 
plate (Applied Biosystems). PCR cocktails included either a single gene specific probe and 
primers set, or two multiplexed probe and primers sets (a set specific for the target clone 
and another gene-specific set multiplexed with the target probe). PCR reactions were set 
up using TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 
43 13803) following manufacturer's instructions. Reverse transcription was performed at 
48°C for 30 minutes followed by amplification/PC R cycles as follows: 95°C 1 0 min, then 
40 cycles of 95°C for 15 seconds, 60°C for 1 minute. Results were recorded as CT values 
(cycle at which a given sample crosses a threshold level of fluorescence) using a log scale, 
with the difference in RNA concentration between a given sample and the sample with the 
lowest CT value being represented as 2 to the power of delta CT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and 
multiplying by 100. 

When working with sscDNA samples, normalized sscDNA was used as described 
previously for RNA samples. PCR reactions containing one or two sets of probe and 
primers were set up as described previously, using IX TaqMan® Universal Master mix 
(Applied Biosystems; catalog No. 4324020), following the manufacturer's instructions. 
PCR amplification was performed as follows: 95°C 10 min, then 40 cycles of 95°C for 15 
seconds, 60°C for 1 minute. Results were analyzed and processed as described previously. 

Panels 1, 1.1, 1.2, and 1. 3D 

The plates for Panels 1, 1.1, 1.2 and 1. 3D include 2 control wells (genomic DNA 
control and chemistry control) and 94 wells containing cDNA from various samples. The 
samples in these panels are broken into 2 classes: samples derived from cultured cell lines 
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and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
gastric cancer and pancreatic cancer. Cell lines used in these panels are widely available 
through the American Type Culture Collection (ATCC), a repository for cultured cell 
lines, and were cultured using the conditions recommended by the ATCC. The normal 
tissues found on these panels are comprised of samples derived from all major organ 
systems from single adult individuals or fetuses. These samples are derived from the 
following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, fetal heart, adult 
kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various regions of the 
brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, pituitary gland, 
adrenal gland, spinal cord, thymus, stomach, small intestine, colon, bladder, trachea, 
breast, ovary, uterus, placenta, prostate, testis and adipose. 

In the results for Panels 1, 1.1, 1.2 and 1. 3D, the following abbreviations are used: 

ca. = carcinoma, 

* = established from metastasis, 

met = metastasis, 

s cell var = small cell variant, 

non-s = non-sm = non-small, 

squam = squamous, 

pi. eff = pi effusion = pleural effusion, 

glio = glioma, 

astro = astrocytoma, and 

neuro = neuroblastoma. 



General_screening_panel_vl.4 and General_screening_panel_vl.5 
The plates for Panels 1.4 and 1.5 include 2 control wells (genomic DNA control 
and chemistry control) and 94 wells containing cDNA from various samples. The samples 
in Panels 1.4 and 1.5 are broken into 2 classes: samples derived from cultured cell lines 
and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 

371 



WO 03/010327 



PCT/US02/14199 



gastric cancer and pancreatic cancer. Cell lines used in Panels 1.4 and 1.5 are widely 
available through the American Type Culture Collection (ATCC), a repository for cultured 
cell lines, and were cultured using the conditions recommended by the ATCC. The normal 
tissues found on Panels 1 .4 and 1 .5 are comprised of pools of samples derived from all 
5 major organ systems from 2 to 5 different adult individuals or fetuses. These samples are 
derived from the following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, 
fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various 
regions of the brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, 
pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, colon, 
1 0 bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. Abbreviations 
are as described for Panels 1, 1.1, 1.2, and 1.3D. 
Panels 2D, 2.2, 2.3 and 2.4 

The plates for Panels 2D, 2.2, 2.3 and 2.4 generally include 2 control wells and 94 
test samples composed of RNA or cDNA isolated from human tissue procured by 
1 5 surgeons working in close cooperation with the National Cancer Institute's Cooperative 
Human Tissue Network (CHTN) or the National Disease Research Initiative (NDR1) or 
from Ardais or Clinomics). The tissues are derived from human malignancies and in cases 
where indicated many malignant tissues have "matched margins" obtained from 
noncancerous tissue just adjacent to the tumor. These are termed normal adjacent tissues 
20 and are denoted "NAT" in the results below. The tumor tissue and the "matched margins" 
are evaluated by two independent pathologists (the surgical pathologists and again by a 
pathologist at NDRI/ CHTN/Ardais/Clinomics). Unmatched RNA samples from tissues 
without malignancy (normal tissues) were also obtained from Ardais or Clinomics. This 
analysis provides a gross histopathological assessment of tumor differentiation grade. 
Moreover, most samples include the original surgical pathology report that provides 
information regarding the clinical stage of the patient. These matched margins are taken 
from the tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated 
"NAT", for normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples 
were obtained from various human tissues derived from autopsies performed on elderly 
people or sudden death victims (accidents, etc.). These tissues were ascertained to be free 
of disease and were purchased from various commercial sources such as Clontech (Palo 
Alto, CA), Research Genetics, and Invitrogen. 
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HASS Panel v 1.0 

The HASS panel v 1 .0 plates are comprised of 93 cDNA samples and two controls. 
Specifically, 81 of these samples are derived from cultured human cancer cell lines that 
had been subjected to serum starvation, acidosis and anoxia for different time periods as 
well as controls for these treatments. 3 samples of human primary cells, 9 samples of 
malignant brain cancer (4 medulloblastomas and 5 glioblastomas) and 2 controls. The 
human cancer cell lines are obtained from ATCC (American Type Culture Collection) and 
fall into the following tissue groups: breast cancer, prostate cancer, bladder carcinomas, 
pancreatic cancers and CNS cancer cell lines. These cancer cells are all cultured under 
standard recommended conditions. The treatments used (serum starvation, acidosis and 
anoxia) have been previously published in the scientific literature. The primary human 
cells were obtained from Clonetics (Walkersville, MD) and were grown in the media and 
conditions recommended by Clonetics. The malignant brain cancer samples are obtained 
as part of a collaboration (Henry Ford Cancer Center) and are evaluated by a pathologist 
prior to CuraGen receiving the samples . RNA was prepared from these samples using the 
standard procedures. The genomic and chemistry control wells have been described 
previously. 

Panel 3D 

The plates of Panel 3D are comprised of 94 cDNA samples and two control 
samples. Specifically, 92 of these samples are derived from cultured human cancer cell 
lines, 2 samples of human primary cerebellar tissue and 2 controls. The human cell lines 
are generally obtained from ATCC (American Type Culture Collection), NCI or the 
German tumor cell bank and fall into the following tissue groups: Squamous cell 
carcinoma of the tongue, breast cancer, prostate cancer, melanoma, epidermoid carcinoma, 
sarcomas, bladder carcinomas, pancreatic cancers, kidney cancers, leukemias/lymphomas, 
ovarian/uterine/cervical, gastric, colon, lung and CNS cancer cell lines. In addition, there 
are two independent samples of cerebellum. These cells are all cultured under standard 
recommended conditions and RNA extracted using the standard procedures. The cell lines 
in panel 3D and 1.3D are of the most common cell lines used in the scientific literature. 
Panels 4D, 4R, and 4. ID 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 
composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human 
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cell lines or tissues related to inflammatory conditions. Total RNA from control normal 
tissues such as colon and lung (Stratagene, La Jolla, CA) and thymus and kidney 
(Clontech) was employed. Total RNA from liver tissue from cirrhosis patients and kidney 
from lupus patients was obtained from BioChain (Biochain Institute, Inc., Hayward, CA). 
5 Intestinal tissue for RNA preparation from patients diagnosed as having Crohn's disease 
and ulcerative colitis was obtained from the National Disease Research Interchange 
(NDRI) (Philadelphia, PA). 

Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle 
cells, small airway epithelium, bronchial epithelium, microvascular dermal endothelial 

1 0 cells, microvascular lung endothelial cells, human pulmonary aortic endothelial cells, 
human umbilical vein endothelial cells were all purchased from Clonetics ( Walkersville, 
MD) and grown in the media supplied for these cell types by Clonetics. These primary cell 
types were activated with various cytokines or combinations of cytokines for 6 and/or 1 2- 
14 hours, as indicated. The following cytokines were used; IL-1 beta at approximately 1- 

1 5 5ng/ml, TNF alpha at approximately 5- 1 Ong/ml, IFN gamma at approximately 20- 

50ng/ml, IL-4 at approximately 5-10ng/ml, IL-9 at approximately 5-1 Ong/ml, IL-13 at 
approximately 5-1 Ong/ml. Endothelial cells were sometimes starved for various times by 
culture in the basal media from Clonetics with 0.1% serum. 

Mononuclear cells were prepared from blood of employees at CuraGen 

20 Corporation, using FicoII. LAK cells were prepared from these cells by culture in DMEM 
5% FCS (Hyclone), lOOuM non essential amino acids (Gibco/Life Technologies, 
Rockville, MD), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and 
lOmM Hepes (Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 
10-20ng/ml PMA and l-2ug/ml ionomycin, IL-12 at 5-10ng/ml, IFN gamma at 20- 

25 50ng/ml and IL- 1 8 at 5- 1 Ong/ml for 6 hours. In some cases, mononuclear cells were 
cultured for 4-5 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM 
Hepes (Gibco) with PHA (phytohemagglutinin) or PWM (pokeweed mitogen) at 
approximately 5ug/ml. Samples were taken at 24, 48 and 72 hours for RNA preparation. 

30 MLR (mixed lymphocyte reaction) samples were obtained by taking blood from two 

donors, isolating the mononuclear cells using Ficoll and mixing the isolated mononuclear 
cells 1 :1 at a final concentration of approximately 2xl0 6 cells/ml in DMEM 5% FCS 
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(Hyclone), lOOuM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol (5.5xl()- 5 M) (Gibco), and lOmM Hepes (Gibco). The MLR was cultured 
and samples taken at various time points ranging from 1- 7 days for RNA preparation. 

Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve 
VS selection columns and a Vario Magnet according to the manufacturer's instructions. 
Monocytes were differentiated into dendritic cells by culture in DMEM 5% fetal calf 
serum (FCS) (Hyclone, Logan, UT), lOOuM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), and lOmM Hepes (Gibco), 
50ng/ml GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of 
monocytes for 5-7 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), lOmM 
Hepes (Gibco) and 10% AB Human Serum or MCSF at approximately 50ng/ml. 
Monocytes, macrophages and dendritic cells were stimulated for 6 and 12-14 hours with 
lipopolysaccharide (LPS) at 1 OOng/ml. Dendritic cells were also stimulated with anti- 
CD40 monoclonal antibody (Pharmingen) at lOug/ml for 6 and 12-14 hours. 

CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from 
mononuclear cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection 
columns and a Vario Magnet according to the manufacturer's instructions. CD45RA and 
CD45RO CD4 lymphocytes were isolated by depleting mononuclear cells of CD8, CD56, 
CD14 and CD19 cells using CD8, CD56, CD14 and CD19 Miltenyi beads and positive 
selection. CD45RO beads were then used to isolate the CD45RO CD4 lymphocytes with 
the remaining cells being CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and 
CDS lymphocytes were placed in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and 
lOmM Hepes (Gibco) and plated at 10 6 cells/ml onto Falcon 6 well tissue culture plates 
that had been coated overnight with 0.5ug/ml anti-CD28 (Pharmingen) and 3ug/ml anti- 
CD3 (OKT3, ATCC) in PBS. After 6 and 24 hours, the cells were harvested for RNA 
preparation. To prepare chronically activated CD8 lymphocytes, we activated the isolated 
CD8 lymphocytes for 4 days on anti-CD28 and anti-CD3 coated plates and then harvested 
the cells and expanded them in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and 
lOmM Hepes (Gibco) and IL-2. The expanded CDS cells were then activated again with 
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plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was 
isolated 6 and 24 hours after the second activation and after 4 days of the second 
expansion culture. The isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 
lOOuM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
5 mercaptoethanol 5.5x1 0" 5 M (Gibco), and 1 OmM Hepes (Gibco) and IL-2 for 4-6 days 
before RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with 
sterile dissecting scissors and then passed through a sieve. Tonsil cells were then spun 
down and resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), lOOuM non essential 

10 amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), 
and lOmM Hepes (Gibco). To activate the cells, we used PWM at 5ug/ml or anti-CD40 
(Pharmingen) at approximately lOug/ml and IL-4 at 5-10ng/ml. Cells were harvested for 
RNA preparation at 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon 

1 5 plates were coated overnight with 1 Oug/ml anti-CD28 (Pharmingen) and 2ug/ml OKT3 
(ATCC), and then washed twice with PBS. Umbilical cord blood CD4 lymphocytes 
(Poietic Systems, German Town, MD) were cultured at 10 5 -10 6 cells/ml in DMEM 5% 
FCS (Hyclone), lOOpM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). 

20 IL-12 (5ng/ml) and anti-IL4 (lug/ml) were used to direct to Thl, while IL-4 (5ng/ml) and 
anti-IFN gamma (1 p.g/ml) were used to direct to Th2 and IL-1 0 at 5ng/ml was used to 
direct to Trl. After 4-5 days, the activated Thl, Th2 and Trl lymphocytes were washed 
once in DMEM and expanded for 4-7 days in DMEM 5% FCS (Hyclone), lOOuM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 

25 5 M (Gibco), 1 OmM Hepes (Gibco) and IL-2 (1 ng/ml). Following this, the activated Thl , 
Th2 and Trl lymphocytes were re-stimulated for 5 days with anti-CD28/OKT3 and 
cytokines as described above, but with the addition of anti-CD95L (1 pg/ml) to prevent 
apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then 
expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes were 

30 maintained in this way for a maximum of three cycles. RNA was prepared from primary 
and secondary Thl , Th2 and Trl after 6 and 24 hours following the second and third 
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activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second 
and third expansion cultures in Interleukin 2. 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, 
KU-812. EOL cells were further differentiated by culture in 0.1 mM dbcAMP at 
5 5xl0 3 cells/ml for 8 days, changing the media every 3 days and adjusting the cell 

concentration to 5xl0 3 cells/ml. For the culture of these cells, we used DMEM or RPMI (as 
recommended by the ATCC), with the addition of 5% FCS (Hyclone), 1 OOuM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco). RNA was either prepared from resting cells or cells 

1 0 activated with PMA at 1 Ong/ml and ionomycin at 1 ug/ml for 6 and 14 hours. Keratinocyte 
line CCD106 and an airway epithelial tumor line NCI-H292 were also obtained from the 
ATCC. Both were cultured in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0'^M (Gibco), and 
lOmM Hepes (Gibco). CCD 1 106 cells were activated for 6 and 14 hours with 

1 5 approximately 5 ng/ml TNF alpha and lng/ml IL-1 beta, while NCI-H292 cells were 
activated for 6 and 14 hours with the following cytokines: 5ng/ml IL-4, 5ng/ml IL-9, 
5ng/ml IL-1 3 and 25ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 
10 7 cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane 

20 (Molecular Research Corporation) was added to the RNA sample, vortexed and after 1 0 
minutes at room temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. 
The aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 
isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down 
at 9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet 

25 was redissolved in 300ul of RNAse-free water and 35ul buffer (Promega) 5ul DTT, 7ul 
RNAsin and 8ul DNAse were added. The tube was incubated at 37°C for 30 minutes to 
remove contaminating genomic DNA, extracted once with phenol chloroform and re- 
precipitated with 1/10 volume of 3M sodium acetate and 2 volumes of 100% ethanol. The 
RNA was spun down and placed in RNAse free water. RNA was stored at -80°C. 

30 AI_comprehensive panel_vl.0 

The plates for AI_comprehensive panel_vl.O include two control wells and 89 test 
samples comprised of cDNA isolated from surgical and postmortem human tissues 



377 



WO 03/010327 



PCT/US02/14199 



obtained from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was 
extracted from tissue samples from the Backus Hospital in the Facility at CuraGen. Total 
RNA from other tissues was obtained from Clinomics. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained 
5 from patients undergoing total knee or hip replacement surgery at the Backus Hospital . 
Tissue samples were immediately snap frozen in liquid nitrogen to ensure that isolated 
RNA was of optimal quality and not degraded. Additional samples of osteoarthritis and 
rheumatoid arthritis joint tissues were obtained from Clinomics. Normal control tissues 
were supplied by Clinomics and were obtained during autopsy of trauma victims. 
1 0 Surgical specimens of psoriatic tissues and adjacent matched tissues were provided 

as total RNA by Clinomics. Two male and two female patients were selected between the 
ages of 25 and 47. None of the patients were taking prescription drugs at the time samples 
were isolated. 

Surgical specimens of diseased colon from patients with ulcerative colitis and 
15 Crohns disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue 
from three female and three male Crohn's patients between the ages of 41-69 were used. 
Two patients were not on prescription medication while the others were taking 
dexamethasone, phenobarbital, or tylenol. Ulcerative colitis tissue was from three male 
and four female patients. Four of the patients were taking lebvid and two were on 
20 phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or 
with emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients 
ranged in age from 40-70 and all were smokers, this age range was chosen to focus on 
patients with cigarette-linked emphysema and to avoid those patients with alpha- lanti- 
25 trypsin deficiencies. Asthma patients ranged in age from 36-75, and excluded smokers to 
prevent those patients that could also have COPD. COPD patients ranged in age from 35- 
80 and included both smokers and non-smokers. Most patients were taking corticosteroids, 
and bronchodilators. 

In the labels employed to identify tissues in the AI_comprehensive panel_vl .0 
30 panel, the following abbreviations are used: 
AI = Autoimmunity 
Syn = S ynovial 
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Normal = No apparent disease 

Rep22 /Rep20 = individual patients 

RA = Rheumatoid arthritis 

Backus = From Backus Hospital 

OA = Osteoarthritis 

(SS) (BA) (MF) = Individual patients 

Adj = Adjacent tissue 

Match control = adjacent tissues 

-M = Male 

-F = Female 

COPD = Chronic obstructive pulmonary disease 
Panels 5D and 51 

The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
isolated from human tissues and cell lines with an emphasis on metabolic diseases. 
Metabolic tissues were obtained from patients enrolled in the Gestational Diabetes study. 
Cells were obtained during different stages in the differentiation of adipocytes from 
human mesenchymal stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are young (18-40 years), otherwise 
healthy women with and without gestational diabetes undergoing routine (elective) 
Caesarean section. After delivery of the infant, when the surgical incisions were being 
repaired/closed, the obstetrician removed a small sample (<1 cc) of the exposed metabolic 
tissues during the closure of each surgical level. The biopsy material was rinsed in sterile 
saline, blotted and fast frozen within 5 minutes from the time of removal. The tissue was 
then flash frozen in liquid nitrogen and stored, individually, in sterile screw-top tubes and 
kept on dry ice for shipment to or to be picked up by CuraGen. The metabolic tissues of 
interest include uterine wall (smooth muscle), visceral adipose, skeletal muscle (rectus) 
and subcutaneous adipose. Patient descriptions are as follows: 

Patient 2: Diabetic Hispanic, overweight, not on insulin 

Patient 7-9: Nondiabetic Caucasian and obese (BMI>30) 

Patient 10: Diabetic Hispanic, overweight, on insulin 

Patient 1 1 : Nondiabetic African American and overweight 

Patient 12: Diabetic Hispanic on insulin 
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Adiocyte differentiation was induced in donor progenitor cells obtained from 
Osirus (a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had 
only two replicates. Scientists at Clonetics isolated, grew and differentiated human 
mesenchymal stem cells (HuMSCs) for CuraGen based on the published protocol found in 
Mark F. Pittenger, et al., Multilineage Potential of Adult Human Mesenchymal Stem Cells 
Science Apr 2 1999: 143-147. Clonetics provided Trizol lysates or frozen pellets suitable 
for mRNA isolation and ds cDNA production. A general description of each donor is as 
follows: 

Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 
Donor 2 and 3 AM: Adipose, AdiposeMidway Differentiated 
Donor 2 and 3 AD: Adipose, Adipose Differentiated 

Human cell lines were generally obtained from ATCC (American Type Culture 
Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver 
HepG2 cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These 
cells are all cultured under standard recommended conditions and RNA extracted using 
the standard procedures. All samples were processed at CuraGen to produce single 
stranded cDNA. 

Panel 51 contains all samples previously described with the addition of pancreatic 
islets from a 58 year old female patient obtained from the Diabetes Research Institute at 
the University of Miami School of Medicine. Islet tissue was processed to total RNA at an 
outside source and delivered to CuraGen for addition to panel 51. 

In the labels employed to identify tissues in the 5D and 51 panels, the following 
abbreviations are used: 

GO Adipose = Greater Omentum Adipose 

SK = Skeletal Muscle 

UT = Uterus 

PL = Placenta 

AD = Adipose Differentiated 

AM = Adipose Midway Differentiated 



380 



WO 03/010327 



PCT/US02/14199 



U = Undifferentiated Stem Cells 
Panel CNSD.01 

The plates for Panel CNSD.01 include two control wells and 94 test samples 
comprised of cDNA isolated from postmortem human brain tissue obtained from the 
5 Harvard Brain Tissue Resource Center. Brains are removed from calvaria of donors 

between 4 and 24 hours after death, sectioned by neuroanatomists, and frozen at -80°C in 
liquid nitrogen vapor. All brains are sectioned and examined by neuropathologists to 
confirm diagnoses with clear associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains two brains 
10 from each of the following diagnoses: Alzheimer's disease, Parkinson's disease, 

Huntington's disease, Progressive Supernuclear Palsy, Depression, and "Normal controls". 
Within each of these brains, the following regions are represented: cingulate gyrus, 
temporal pole, globus palladus, substantia nigra, Brodman Area 4 (primary motor strip), 
Brodman Area 7 (parietal cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 
15 17 (occipital cortex). Not all brain regions are represented in all cases; e.g., Huntington's 
disease is characterized in part by neurodegeneration in the globus palladus, thus this 
region is impossible to obtain from confirmed Huntington's cases. Likewise Parkinson's 
disease is characterized by degeneration of the substantia nigra making this region more 
difficult to obtain. Normal control brains were examined for neuropathology and found to 
20 be free of any pathology consistent with neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following 
abbreviations are used: 

PSP = Progressive supranuclear palsy 

Sub Nigra = Substantia nigra 
25 Glob Palladus= Globus palladus 

Temp Pole = Temporal pole 

Cing Gyr = Cingulate gyrus 

BA 4 = Brodman Area 4 

Panel CNS Neurodegeneration Vl.O 
30 The plates for Panel CNSNeurodegenerationVl .0 include two control wells and 

47 test samples comprised of cDNA isolated from postmortem human brain tissue 
obtained from the Harvard Brain Tissue Resource Center (McLean Hospital) and the 



381 



WO 03/010327 



PCT/US02/14199 



Human Brain and Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare 
System). Brains are removed from calvaria of donors between 4 and 24 hours after death, 
sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are 
sectioned and examined by neuropathologists to confirm diagnoses with clear associated 
5 neuropathology. 

Disease diagnoses are taken from patient records. The panel contains six brains 
from Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who 
showed no evidence of dementia prior to death. The eight normal control brains are 
divided into two categories: Controls with no dementia and no Alzheimer's like pathology 

1 0 (Controls) and controls with no dementia but evidence of severe Alzheimer's like 
pathology, (specifically senile plaque load rated as level 3 on a scale of 0-3; 0 = no 
evidence of plaques, 3 = severe AD senile plaque load). Within each of these brains, the 
following regions are represented: hippocampus, temporal cortex (Brodman Area 21), 
parietal cortex (Brodman area 7), and occipital cortex (Brodman area 17). These regions 

15 were chosen to encompass all levels of neurodegeneration in AD. The hippocampus is a 
region of early and severe neuronal loss in AD; the temporal cortex is known to show 
neurodegeneration in AD after the hippocampus; the parietal cortex shows moderate 
neuronal death in the late stages of the disease; the occipital cortex is spared in AD and 
therefore acts as a "control" region within AD patients. Not all brain regions are 

20 represented in all cases. 

In the labels employed to identify tissues in the CNS_Neurodegeneration_V1.0 
panel, the following abbreviations are used: 

AD = Alzheimer's disease brain; patient was demented and showed AD-like 
pathology upon autopsy 

25 Control = Control brains; patient not demented, showing no neuropathology 

Control (Path) = Control brains; pateint not demented but showing sever AD-like 
pathology 

SupTemporal Ctx = Superior Temporal Cortex 
Inf Temporal Ctx = Inferior Temporal Cortex 

30 

A. CG100 126-01: Keratin Associated Protein 
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Expression of gene CG100126-01 was assessed using the primer-probe set 
Ag4163, described in Table AA. Results of the RTQ-PCR runs are shown in Tables AB, 
AC and AD. 

Table AA . Probe Name Ag4163 



Primers 


Sequences 


Length 


Start Position 


SEQIDNo 


Forward 


5 ' -cagtgctgccagtctgtgt-3 ■ 


19 


217 


203 


Probe 


TET-5 ' -agctgcagcatctccagctgctg-3 ' -TAMRA 


23 


517 


204 


Reverse 


5 ' -agctggattcacagcaagag-3 • 


20 


546 


205 



Table AB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4163, Run 
221000456 


Tissue Name 


Rel. Exp.(%) 
Ag4163, Run 
221000456 


Adipose 


0.0 


Renal ca.TK-10 


0.0 


Mr Li nor ia H 688( \) r 


0.0 


Bladder 


5.3 


Melanoma* Hs688(B).T 


2.4 


Gastric ca. (liver met.) NCI- 
N87 


3.4 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


100.0 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


0.0 


Colon ca. SW480 


8.1 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) L . 
SW620 


Testis Pool 


0.0 


Colon ca. HT29 jO.O 


Prostate ca.* (bone met) 
PC-3 


0.0 


Colon ca. HCT-116 


95.9 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


2.9 


Placenta 


0.0 


Colon cancer tissue |0.0 


Uterus Pool 


0.0 


Colon ca S\\ I I lo.O 


Chan an ca. OVCAR-3 


0.0 


Colon ca. Colo-205 jo.O 


Ovarian ca. SK-OV-3 


5.8 


Colon ca. SW-48 jO.0 


Ovai ian ca. OVCAR-4 


0.0 


Colon Pool |8.1 


Ovarian ca. OVCAR-5 


26.1 


Small Intestine Pool j 1 7.4 


Ovarian ca. IGROV- 1 


31.2 


Stomach Pool , |3.5 


Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool |o.O 


Ovary 


0.0 


Fetal Heart lo.O 


Breast ca. MCF-7 


0.0 


Heart Pool 


4.2 


Brea MDA f 1B-2 1 


0.0 


Lymph Node Pool 


3.0 


Breast ca. BT 549 


l.O 


Fetal Skeletal Muscle 


3.0 


Breast ca. T47D 


81.8 


Skeletal Muscle Pool 


5.1 



383 



WO 03/010327 



PCT/US02/14199 



Breast ca. MDA-N 


0.0 


jSpleen Pool 






0.0 


Breast Pool 


2.6 


{Thymus Pool 






1.6 


Trachea 

— 


0.0 


jCNS cancer (glio/astro) U87- 
MG 






0.0 


Lung 


5.7 


jCNS cancer (glio/astro) U-l 18 
MG 




10.2 


Fetal Lung 


0.0 


jCNS cancer (neuro;met) SK-N- 
AS 


0.0 


Lung ca.NCI-N417 


0.0 


jCNS cancer (astro) SF-539 






0.0 


Lung ca. LX-1 


8.2 


jCNS cancer (astro) SNB-75 






0.0 


Lung ca. NCI-H146 


0.0 


jCNS cancer (glio) SNB-19 






31.9 


Lung ca. SHP-77 


0.0 


jCNS cancer (glio) SF-295 






0.0 


Lung ca. A549 


0.0 


jBrain (Amygdala) Pool 






0.0 


Lung ca. NCI-H526 


0.0 


jBrain (cerebellum) 






0.0 


Lung ca. NCI-H23 


26.6 


jBrain (fetal) 






0.0 


Lung ca. NCI-H460 


jo.o 


jBrain (Hippocampus) Pool 




Jo.o 


Lung ca. HOP-62 


jo.o 


jCerebral Cortex Pool 




jo.o 


Lung ca. NCI-H522 


jo.o 


jBrain (Substantia nigra) Pool 




Jo.o 


Liver 


jo.o 


(Brain (Thalamus) Pool 




jo.o 


Fetal Liver 


jo.o 


jBrain (whole) 




io.o 


Liver ca. HepG2 


jo.o 


JSpinal Cord Pool 




jo.o 


Kidney Pool 


j9.0 


jAdrenal Gland 




jo.o 


Fetal Kidney 


jo.o 


jPituitaiy gland Pool 




Jo.o 


Renal ca. 786-0 


|3.2 


jSalivary Gland 




0.0 


Renal ca. A498 


joxT~" 


jThyroid (female) 




0.0 


Renal ca. ACHN 


jo.o 


jPancreatic ca. CAPAN2 




0.0 


Renal ca. UO-31 


Io.o 


[Pancreas Pool 




5.6 


Table AC. Panel 4.1 D 


Tissue Name 


Rel. Exp.(%) 
Ag4163,Run 
173333942 


Tissue Name 


Rel. Exp.(%) 
Ag4163,Run 
173333942 


Secondary Th 1 act 


0.0 


HUVECIL-lbeta 


lo.i 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


io 


0 


Secondary Trl act 


0.0 


HUVECTNF alpha + IFN 
gamma 




0.3 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


jo.o 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


io.i 


Secondary Trl rest 


0.0 


Lung Microvascular EC none jo.O 


Primary Thl act 


»•' ] 


Lung Microvascular EC lo.O 
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TNFalpha + IL-lbeta 




Primary Th2 act 


0.0 


Microvascular Dermal EC non« 


;]26.4 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha 
+ ILlbeta 


0j0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.3 


Secondary CD8 
lymphocyte rest 




Astrocytes TNFalpha + IL- 
lbeta 


0.2 


Secondary CD8 
lymphocyte act 


0 0 


KU-8I2 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 




!rj rhl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD] 106 (Keratinocytes) none 


~ 


LAK cells rest 


0.0 


CCD 11 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.3 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


0.0 


MCI-H292 none 


0.1 


LAK cells IL-2+IFN 
gamma 


0.0 


^CI-H292 IL-4 


0.0 


LAK cells IL-2+IL- 18 ] 


0.0 ] 


MCI-H292 IL-9 




LAK cells 
PMA/ionomycin 


0 0 |NCI-H292 IL-13 j 


0.2 


NK Cells IL-2 rest 


3 0 |NCI-H292 IFN gamma f 


30 1 


1 ' Wa\ Ml R 3 da) 


}-0 jHPAECnone 


~— 


Two Way MLR 5 day 


) 0 jHPAFC TNF alpha + IL- 1 bel i 




Two Way MLR 7 day ( 


} -° |Lu ig fibi ,1 1 isl ii >nt ( 


).0 


PBMC rest i 


0 jLung fibroblast TNF alpha + 

jIL-1 beta ( 


.4 


I'BMCPWM i 


•0 jLung fibroblast IL-4 C 


.2 


PBMC PHA-L i 


•0 jLung fibi blast 11. -0 q 


.1 


Ramos (B cell) none 0 


-0 L 


ung fibroblast IL- 1 3 l 


.2 


Ramos (B cell) 

ionomycin ^ 


•0 L 


ung fibroblast IFN gamma 0 


.0 
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B lymphocytes PWM 


0.1 


Dermal fibroblast CCD1 070 
rest 


0.1 


B lymphocytes CD40L 
and IL-4 




Dermal fibroblast CCD1 070 
TNF alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD] 070 IL- 
1 beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 1FN gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.5 


Dendritic cells LPS 


0.0 




0.9 


Dendritic cells anti-CD40 


0.1 


Neutrophils TNFa+LPS 


0.7 


Monocytes rest 


0.0 


Neutrophils rest 


0.6 


Monocytes LPS 


0.0 


Colon 


0.8 


Macrophages rest 


0.1 


Lung 


3.6 


Macrophages LPS 


0.0 


Thymus 


15.5 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 




.__ 



Table AD . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4I63, Run 
268624200 


Tissue Name 


Rel. Exp.(%) 
Ag4163, Run 
268624200 


Colon cancer 1 


0.0 


BI i Iderc incei a i _ 


0.0 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 2 


0.0 


Adenocarcinoma of the prostate 
1 


0.6 


Colon cancer 3 


0.0 


Adenocarcinoma of the prostate 


0.0 


Colon cancer NAT 3 


0.0 


Adenocarcinoma of the prostate 
3 


0.2 


Colon malignant cancer 4 


0.3 


Adenocarcinoma of the prostate 
4 


0.5 


Colon normal adjacent 
tissue 4 


0.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


0.3 


Adenocarcinoma of the prostate 
6 


0.0 


Lung NAT 1 


0.1 


Adenocarcinoma of the prostate 
7 


0.0 


Lung cancer 2 


0.0 


Adenocarcinoma of the prostate 
8 


0.7 


Lung NAT 2 


0.3 


Adenocarcinoma of the prostate 
9 


0.0 
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Squamous cell carcinoma 
3 


0.2 


Prostate cancer NAT 10 


0.2 


Lung NAT 3 


0.0 


Kidney cancer t 


0.0 


metastatic melanoma 1 


0.3 


KidneyN \ 1 1 


0.0 


Melanoma 2 


100.0 


Kidney cancer 2 


1.8 


Melanoma 3 


0.3 


Kidney NAT 2 


0.5 


metastatic melanoma 4 


0.2 


Kidney cancer 3 


0.0 


metastatic melanoma 5 


0.5 


Kidney NAT 3 


0.5 


Bladder cancer 1 


0.0 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag4163 Results from one experiment 
with the CGI 00 126-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General_screeningjpanel_vl.4 Summary: Ag4163 Highest expression of the 
5 CGI 001 26-01 gene is detected in Melanoma LOXIMVI and a colon cancer HCT-1 16 cell 
lines (CTs=33.8). In addition, significant expression of this gene is also detected in a 
breast cancer T47D cell line. Therefore, expression of this gene can be used to distinguish 
these samples from other samples used in this panel and also as diagnostic marker for 
melanoma, colon and breast cancer. Furthermore, therapeutic modulation of this gene 
1 0 product can be useful in the treatment of these cancers. 

Panel 4.1D Summary: Ag4163 Highest expression of the CG100126-01 gene is 
detected in kidney (CT=29). Moderate expression of this gene is also seen in lung, thymus 
and microvascular dermal endothelial cells. Therefore, expression of this gene can be used 
to distinguish these samples from other samples in this panel. In addition, therapeutic 
15 modulation of this gene can be useful in the treatment of autoimmune and inflammatory 
diseases that affect lung and kidney. 

General oncology screening panel_v_2.4 Summary: Ag4163 Highest 
expression of the CGI 00 126-01 gene is detected exclusively in melanoma 
sample(CT=29.8). Therefore, expression of this gene may be used as a diagnostic marker 
20 for detection of melanoma and therapeutic modulation of this gene product may be 
beneficial in the treatment of melanoma. 

B. CG100146-01: UDP-GIucuronosyl Transferase 
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Expression of gene CGI 00146-01 was assessed using the primer-probe set 
Ag4165, described in Table BA. Results of the RTQ-PCR runs are shown in Tables BB 
andBC. 

Table BA . Probe Name Ag4165 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -agaccaggatgtctctgaaatg-3 ' 


22 


193 


206 


Probe 


TET-5 ' -cgtcagtctttctgctgatacagctca-3 1 - 
TAMRA 


27 


217 


207 


Reverse 


5 ' -ccacaactcccagagctaaag-3 ' 


21 


251 


208 



Table BB . General_screeningjpanel_vl.4 



Tissue Name 


Pol Pvn CO/.\ A r.A] AS 

Kei. tLXp.{/o) Ag4 10J, 

Run 221000458 


Tissue Name 


Rel. Exp.(%) 
Ag4165,Run 
221000458 


Adipose 


7.4 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


16.4 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI-N87 


0.0 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


2.6 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC- 
4 


2.0 


Colon ca.* (SW480 met) 
SW620 


0.0 


Testis Pool 


1.6 


Colon ca. HT29 


0.0 


Prostate ca.* 
(bone met) PC-3 


0.0 


CoIonca.HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


17 1 


Placenta 


0.0 


Colon cancer tissue 


4.4 


Uterus Pool 


0.0 


' olonca SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


1.9 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


100.0 
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Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


0.6 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


4.0 


Fetal Heart 


0.0 


Breast ca. MCF- 
7 


7.4 


Heart Pool 


0.0 


Breast ca. MDA- 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 


0.0 


Fetal Skeletal Muscle 




Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA- 
N 


0.0 


Spleen Pool 


1.7 


Breast Pool 


0.8 


Thymus Pool 


2.8 




0.9 


CNS cancer (glio/astro) U87- 
MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 1 8- 
MG 


0.0 


Fetal Lung 

.■ ■ 


0.8 


CNS cancer (neuro;met) SK-N- 
AS 


00 
.0 


Lung ca. NCI- 
N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX- 1 


0.0 


CNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI- 


0.0 


CNS cancer (glio) SNB-19 




Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI- ! 
H526 : 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. NCI- j 

' " ~ j 


0.0 


Brain (fetal) 


0.0 


.Lung ca. NCI- I 
H460 ] 


3.0 


Jrain (Hippocampus) Pool 


D.O 


Lung ca. HOP-62] 


10 | 


^erebral Cortex P 1 




Lung ca. NCI- j 
H522 ) 


).0 


Jrain (Substantia nigra) Pool 


3.0 


Liver j: 


3.2 


3rain (Thalamus) Pool 


).0 


Fetal Liver j: 


-4 


3rain (whole) 


.0 


Liver ca. HepG2 (0.0 


spinal Cord Pool ( 


).0 


Kidney Pool j( 


-o 


Vdrenal Gland 


.4 


Fetal Kidney j( 


.0 f 


'ituitary gland Pool C 


.0 
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Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


1.6 


Table BC. General oncnln^v srrfpning panel ^ 


_ 2 -4 


Tissue Name 


Rel. Exp.(%) Ag4165, 
Run 268624228 


Tissue Name 


Rel. Exp.(%) 
Ag4165, Run 
268624228 


Colon cancer 1 


84.1 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


100.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.4 


Bladder cancer NAT 4 


0.0 


Colon cancer 
NAT 2 


49.0 , 


Adenocarcinoma of the prostate 
1 


0.0 


Colon cancer 3 


2.9 


Adenocarcinoma of the prostate 

2 


0.0 


Colon cancer 
NAT 3 


28.3 


Adenocarcinoma of the prostate 
3 


0.0 


Colon malignant 
cancer 4 


19.3 


Adenocarcinoma of the prostate 


0.0 


Colon normal 
adjacent tissue 4 


59.9 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


0.0 


— — . . , 

Adenocarcinoma of the prostate 
6 


0.0 


Lung NAT 1 


0.0 


Adenocarcinoma of the prostate 
7 


0.0 


Lung cancer 2 


0.0 


Adenocarcinoma of the prostate 
8 


0.0 


Lung NAT 2 


0.0 


Adenocarcinoma of the prostate 
9 


0.0 


Squamous cell 
carcinoma 3 


0.2 


Prostate cancer NAT 10 


0.0 


Lung NAT 3 


1.0 


Kidney cancer 1 


0.0 


metastatic 
melanoma 1 


0.1 


KidneyNAT 1 


3.0 


Melanoma 2 


l).0 


Kidney cancer 2 


3.0 


Melanoma 3 


3.0 


Kidney NAT 2 


3.0 


metastatic 
melanoma 4 


3.0 


Kidney cancer 3 


3.1 


metastatic 
melanoma 5 


).l 


Kidney NAT 3 


3.0 


Bladder cancer 1 ( 


).0 


Cidney cancer 4 ( 


).0 


Bladder cancer ( 


).0 


Kidney NAT 4 jo.O 
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NAT 1 








Bladder cancer 2 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag4165 Expression of the 



CG100146-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

General_screening_panel_vl.4 Summary: Ag4165 Highest expression of the 
5 CG100146-01 gene is detected exclusively in small intestine (Ct=32.7). In addition, low 
expression of this gene is also detected in liver. Thus, expression of this gene can be used 
to distinguish these samples from other samples used in this panel. Furthermore, 
therapeutic modulation of this gene product could be useful in the treatment of small 
intestine and liver related disorders. The CGI 00 146-01 gene codes for UDP- 

10 glucuronosyltransferase. UDP-Glucuronosyltransferases (UGTs) are glycoproteins, which 
catalyze the confugation of a broad variety of lipophilic aglycon substrates with 
glucuronic acid using UDP-glucuronic acid (UDP-GlcUA) as the sugar donor. The major 
function of glucuronidation is to change hydrophobic compounds into hydrophilic 
derivatives, a process which facilitates their detoxification and excretion (Radominska- 

15 Pandya et al., 2001, Curr Drug Metab 2(3):283-98, PMID: 1 1513331). Mutations in the 
UGT1 Al gene, one of the gene belonging to UGT family, are implicated in type I and 
type II Crigler-Najjar syndromes and the more common mild hyperbilirubinemia known as 
Gilbert syndrome (Kadakol et al., 2000, Hum. Mutat. 16: 297-306, PubMed 
ID:1 1013440). Thus, the CG100146-01 gene may also play a role in pathogenesis of 

20 Crigler-Najjar syndromes and Gilbert syndrome and therapeutic modulation of this gene 
could be beneficial in the treatment of these diseases. 

Panel 4.1D Summary: Ag4165 Results from one experiment with the 
CGI 00146-01 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 

25 General oncology screening panel_v_2.4 Summary: Ag4 165 Highest 

expression of the CG100146-01 gene is detected in colon sample (CT=27.8). Expression 
of this gene is exclusive to colon samples, with lower expression in cancer samples 
compared to adjacent normal tissue. Therefore, this gene may play a role as tumor 
suppressor and therapeutic modulation to increase the activity of the gene product may be 

30 beneficial in the treatment of colon cancer. 
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C. CG100179-01: Cyclophilin Like 

. Expression of gene CGI 00 179-01 was assessed using the primer-probe set 
Ag4166, described in Table CA. Results of the RTQ-PCR runs are shown in Tables CB, 
CC, CD and CE. 

5 Table CA . Probe Name Ag4166 



Primers j Sequences 


Length! p Start 
| Position 


SEQ ID 
No 


Forward 1 5 • -tttggacgagtgactaaaggaa-3 1 


22 1 2122 


209 


Probe ]TET-5' -cagaggatctccaacgrcaaagtcaa-3 ' -TAMRA 


26 1 2155 


:if. 


Reverse j 5 • -tcatagggcttatctgtrttgg-3 ■ 


22 | 2183 


211 



Table CB . CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%)Ag4166, Run 
215538908 


Tissue Name 


Rel. Exp.(%) 
Ag4166, Run 
215538908 


AD 1 Hippo 


12.2 


Control (Path) 3 Temporal 
Ctx 


7.6 


AD 2 Hippo 


24.3 


Control (Path) 4 Temporal 
Ctx 


52.5 


AD 3 Hippo 


10.2 


AD 1 Occipital Ctx 


27.2 


AD 4 Hippo 


10.7 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


97.3 


r\U J vyCcipuai v^CX 




AD 6 Hippo 


61.1 


AD 4 Occipital Ctx 


21.8 


Control 2 Hippo 


18.9 


AD 5 Occipital Ctx 


37.6 


Control 4 Hippo 


11.3 


AD 6 Occipital Ctx 


28.9 


Control (Path) 3 
Hippo 


11.3 


Control 1 Occipital Ctx 


8.2 


AD 1 Temporal 
Ctx 


23.0 


Control 2 Occipital Ctx 


41.2 


AD 2 Temporal 
Ctx 


29.3 


Control 3 Occipital Ctx 


30.4 


AD 3 Temporal 
Ctx 


11.7 


Control 4 Occipital Ctx 


9.7 


AD 4 Temporal 
Ctx 


26.8 


Control (Path) 1 Occipital 
Ctx 


76.3 


AD 5 Inf 
Temporal Ctx 


100.0 


Control (Path) 2 Occipital 
Ctx 


18.7 


AD 5 Sup 
Temporal Ctx 


49.3 


Control (Path) 3 Occipital 
Ctx 


6.2 


AD 6 Inf 
Temporal Ctx 


68.8 


Control (Path) 4 Occipital 
Ctx 


25.9 
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AD 6 Sup 
Temporal Ctx 


55.9 


Control 1 Parietal Ctx 


10.6 


Control 1 
Temporal Ctx 


8.4 


Control 2 Parietal Ctx 


48.6 


Control 2 
Temporal Ctx 


27.5 


Control 3 Parietal Ctx 


19.3 


Control 3 
Temporal Ctx 


16.8 


Control (Path) 1 Parietal 
Ctx 


63.7 


Control 3 
Temporal Ctx 


12.9 


Control (Path) 2 Parietal 
Ctx 


30.6 


Control (Path) 1 
Temporal Ctx 


51.8 


Control (Path) 3 Parietal 
Ctx 


8.6 


Control (Path) 2 
Temporal Ctx 


34.4 


Control (Path) 4 Parietal 
Ctx 


48.0 



Table CC . General_screeningjpanel_vl.4 



Tissue Name 


Rel.Exp.(%) Ag4166, Run 
221034286 


Tissue Name 


Rel. Exp.(%) 
Ag4166,Run 
221034286 


Adipose 


9.7 


Renal ca. TK-10 


19.9 


Melanoma* 
Hs688(A).T 


9.3 


Bladder 


22.5 


Melanoma* 
Hs688(B).T 


8.0 


Gastric ca. (liver met.) 
NCI-N87 


44.8 


Melanoma* Ml 4 


7.5 


Gastric ca. KATO III 


23.8 


Melanoma* 
LOXIMVI 


14.6 


Colon ca. SW-948 


5.8 


Melanoma* SK- 
MEL-5 


20.3 


Colon ca. SW480 


28.1 


Squamous cell 
carcinoma SCC-4 


7.5 


Colon ca.* (SW480 met) 
SW620 


29.7 


Testis Pool 


0.1 


Colon ca. HT29 


14.3 


Prostate ca.* 
(bone met) PC-3 


7.0 


Colon ca. HCT-116 


29.1 


Prostate Pool 


8.4 


Colon ca. CaCo-2 


50.1 


Placenta 


1.7 


Colon cancer tissue 


13.6 


Uterus Pool 


9.6 


Colon ca. SW1116 


3.0 


Ovarian ca. 
OVCAR-3 


16.4 


Colon ca. Colo-205 


5.3 


Ovarian ca. SK- 
OV-3 


28.9 


Colon ca. SW-48 


4.9 


Ovarian ca. 
OVCAR-4 


4.0 


Colon Pool 


20.6 
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Ovarian ca. 
OVCAR-5 


45.4 


Small Intestine Pool 


26.2 


Ovarian ca. 
IGROV-1 


11.1 


Stomach Pool 


14.8 


Ovarian ca. 
OVCAR-8 


15.2 


Bone Marrow Pool 


9.9 


Ovary 


12.0 


Fetal Heart 


12.2 


Breast ca. MCF-7 


19.8 


Heart Pool 


8.5 


Breast ca. MDA- 
MB-231 


42.3 


Lymph Node Pool 


19.3 


Breast ca. BT 549 


40.9 


Fetal Skeletal Muscle 


8.2 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


9.5 


Breast ca. MDA- 
N 


14.3 


Spleen Pool 


18.4 


Breast Pool 


18.0 


Thymus Pool 


22.4 


Trachea 


5.6 


CNS cancer (glio/astro) 
U87-MG 


36.1 


Lung 


19.5 


CNS cancer (glio/astro) U- 
118-MG 


27.9 


Fetal Lung 


26.2 


CNS cancer (neuro;met) 
SK-N-AS 


26.6 


Lung ca. NCI- 
N417 


3.9 


CNS cancer (astro) SF-539 


8.1 


Lung ca. LX- 1 


19.8 


CNS cancer (astro) SNB- 
75 


49.0 


Lung ca. NCI- 
H146 


5.1 


CNS cancer (glio) SNB- 19 


10.9 


Lung ca. SHP-77 


17.3 


CNS cancer (glio) SF-295 


46.0 


Lung ca. A549 


20.0 


Brain (Amygdala) Pool 


8.6 


Lung ca. NCI- 
H526 


3.3 


Brain (cerebellum) 


9.7 


Lung ca. NCI- 
H23 


20.9 


Brain (fetal) 


12.9 


Lung ca. NCI- 
H460 


15.7 


Brain (Hi ocam us) Pool 
ippoeampus 


6.8 


Lungc.-i \OPC 


5.0 


Cerebral Cortex Pool 


13 5 


Lungca MCI- 
H522 


26.8 


Brain (Substantia nigra) 
Pool 


7.9 


Liver 


0.7 


Brain (Thalamus) Pool 


14.0 


Fetal Liver 


14.5 


Brain (whole) 


4.7 


Liver ca. HepG2 


11.4 


Spinal Cord Pool 


10.2 


Kidney Pool 


45.1 


Adrenal Gland 


3.8 
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Fetal Kidney 


27.5 


Pituitary gland Pool 


3.6 


Renal ca. 786-0 


9.5 


Salivary Gland 


0.8 


Renal ca. A498 


6.0 


Thyroid (female) 


6.2 


Renal ca. ACHN 


7 


Pancreatic ca. CAPAN2 


25.2 


Renal ca. UO-31 


11.7 


Pancreas Pool 


21.0 



Table CD . Panel 4.1D 



Tissue Name 


Rel. Exp.(%) Ag4166, Run 
173334537 


j " ' 

Tissue Name 


i\ci. cxp.^ /o ) 
Ag4166, Run 
173334537 


Secondary Thl act 


77.9 


HUVEC IL-lbeta 


49.0 


Secondary Th2 act 


74.2 


HUVEC IFN gamma 




Secondary Trl act 


66.0 


HUVEC TNF alpha + IFN 


29.3 


Secondary Th 1 
rest 


39.8 


HUVEC TNF alpha + IL4 


33.4 


Secondary Th2 
rest 


55.5 


HUVEC IL- 11 


32.5 


Secondary Trl rest 


58.2 


Lung Microvascular EC 


57.8 


Primary Thl act 


68.8 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


39.8 


Primary Th2 act 


94.0 


Microvascular Dermal EC 


33.7 


Primary Trl act 


81.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


23.0 


Primary Thl rest 


55.5 


Bronchial epithelium 
TNFalpha + IL1 beta 


20.9 


Primary Th2 rest 




Small airway epithelium 
none 


7.7 


Primary Trl rest 


66.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


18.7 


D45RACD4 
1; inp.jocytc act 


48.6 


Coronery artery SMC rest 


17.8 


CD45RO CD4 
lymphocyte act 


87.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


22.1 


CDS lymphocyte 
act 


66.4 


Astrocytes rest 


14.1 


Secondary CD8 
lymphocyte rest 


62.9 


Astrocytes TNFalpha + IL- 
lbeta 


12.5 


Secondary CD8 
lymphocyte act 


33.4 


KU-812 (Basophil) rest 


37.6 


CD4 lymphocyte 


63.3 i 


KU-8 12 (Basophil) 


45.7 
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none 




PMA/ionomycin 




2ry 

Thl/Th2/Trl anti 
CD95 CH11 


- 100.0 


CCD1106(Keratinocytes) 
none 


43.5 


LAK cells rest 


33.7 


CCD1106 (Keratinocytes) 
TNFalpha+IL-lbeta 


27.2 


LAK cells IL-2 


62.4 


Liver cirrhosis 


20.4 


LAK cells IL- 
2+IL-12 


61.1 


NCI-H292 none 


29.5 


LAK cells IL- 
2+IFN gamma 


55.9 


NCI-H292 IL-4 


34.4 


LAK cells IL-2+ 
IL-18 


67.4 


NCI-H292 IL-9 


54.7 


LAK cells 
PMA/ionomycin 


18.2 


NCI-H292 IL-13 


59.9 


NK Cells IL-2 rest 


91.4 


NCI-H292 IFN gamma 


33.9 


Two Way MLR 3 
day 




HPAEC none 


27.5 


Two Way MLR 5 


52.5 


HPAEC TNF alpha + 1L-1 
beta 


34.4 


Two Way MLR 7 


35.4 


Lung fibroblast none 




PBMC rest 


17.0 


-ung fibroblast TNF alpha 
HL-1 beta 


— 


PBMC PWM 


53.6 


jLung fibroblast IL-4 


28.9 


PBMC PHA-L 


50.0 


jLung fibroblast IL-9 


25.9 


Ramos (B cell) 
none 


61.1 


jLung fibroblast IL- 13 


24.1 


Ramos (B cell) 
ionomycin 


39.8 


jLung fibroblast IFN 

jgamma 


24.1 


B lymphocytes 
PWM 


35.4 


jDermal fibroblast 
1CCD1070 rest 


49.7 


B lymphocytes 
CD40L and IL-4 


71.2 


Dermal fibroblast 
CCD 1070 TNF alpha 


71.7 


EOL-1 dbcAMP ( 


0.3 


Dermal fibroblast 
CCD1070IL-1 beta 


5.8 


POT 1 ^IvAKyfD 

PMA/ionomycin 


0.3 


Dermal fibroblast IFN 
gamma 1 


9.5 


Dendritic cells ^ 


6.6 


Dermal fibroblast IL-4 4 


0.1 


Dendritic cells 
LPS 


0.0 


Dermal Fibroblasts rest 2 


7.9 


Dendritic cells 2 


1.5 


Neutrophils TNFa+LPS 5 


.3 
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|anti-CD40 
[Monocytes rest 



[Monocytes LPS 



(Macrophages rest 3 1 .2 



Neutrophils rest 



(Macrophages LPS 114.2 
(hUVEC none hnTT 



jHUVEC starved {59.0 



Lung 



Thymus 



Kidney 



16.4 

R9~~ 



Table CE . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) Ag41 66, Run 
268624305 


Tissue Name 


Rel. Exp.(%) 
Ag41 66, Run 
268624305 


Colon cancer 1 


20.0 


Bladder cancer NAT 2 


3.5 


Colon cancer 
NAT 1 


7.5 


Bladder cancer NAT 3 


0.3 


Colon cancer 2 


12.9 


Bladder cancer NAT 4 


0.0 


Colon cancer 


5.3 


Adenocarcinoma of the prostate 
1 




Colon cancer 3 

— . _ — 


34.4 


Adenocarcinoma of the prostate 
2 


3.5 


Colon cancer 
NAT 3 


17.4 


Adenocarcinoma of the prostate 
3 


8.1 


Colon malignant 
cancer 4 


39.2 


Adenocarcinoma of the prostate 
4 


19.2 


Colon normal 
adjacent tissue 4 


2.7 


Prostate cancer NAT 5 


9.2 


Lung cancer 1 


15.5 


Adenocarcinoma of the prostate 
6 


2.4 


Lung NAT 1 


3.9 


Adenocarcinoma of the prostate 
7 


4.5 


Lung cancer 2 


55.5 


Adenocarcinoma of the prostate 
8 


1.3 


Lung NAT 2 


5.8 


Adenocarcinoma of the prostate 
9 


16.4 


Squamous cell 
carcinoma 3 


13.7 


Prostate cancer NAT 10 


0.9 


Lung NAT 3 


0.5 


Kidney cancer 1 


33.0 


metastatic 
melanoma 1 


23.2 


KidneyNAT 1 


12.5 


Melanoma 2 


2.9 


Kidney cancer 2 


00.0 


Melanoma 3 


3.2 


vidncyNAT2 


26.4 


metastatic 
melanoma 4 


13.5 


Sidney cancer 3 i 


10.9 
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metastatic 
melanoma 5 


66.0 


Kidney NAT 3 


11.3 


Bladder cancer 1 


7.9 


Kidney cancer 4 


10.9 


Bladder cancer 
NAT 1 


0.0 


Kidney NAT 4 


4.0 


Bladder cancer 2 


5.0 







CNS_neurodegeneration_vl.O Summary: Ag4166 This panel confirms the 
expression of the CG100179-01 gene at low levels in the brains of an independent group 
of individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag41 66 Highest expression of the 
CGI 00 179-01 gene is detected in the breast cancer T47D cell line (CT=26). High 
expression of this gene is seen in cluster of breast, ovarian, pancreatic, CNS, colon, 
10 gastric, renal, lung cancer cell lines and melanoma cell lines. Thus, therapeutic modulation 
of this gene could be beneficial in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
1 5 activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
20 central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4166 Highest expression of the CG100179-01 gene is 
detected in anti-CD95 CHI 1 treated secondary Thl/Th2/Trl cells (CT=28.5). This gene is 
expressed at high to moderate levels in a wide range of cell types of significance in the 
25 immune response in health and disease. These cells include members of the T-cell, B-cell, 
endothelial cell, macrophage/monocyte, and peripheral blood mononuclear cell family, as 
well as epithelial and fibroblast cell types from lung and skin, and normal tissues 
represented by colon, lung, thymus and kidney. This ubiquitous pattern of expression 
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suggests that this gene product may be involved in homeostatic processes for these and 
other cell types and tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl .4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel v_2.4 Summary: Ag4166 Highest 
expression of the CG100179-01 gene is detected in kidney cancer (CT=28.5), with 
significant expression also seen in melanoma, colon, lung, bladder, prostate and kidney 
cancers. In addition, expression of this gene is higher in the cancers than in the normal 
adjacent tissue. Therefore, expression of this gene could be as a marker to detect the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 



D. CG100212-01 and CG100212-02: Novel Gene Belonging to Zinc-Containing 
Alcohol Dehydrogenase Superfamily 

Expression of gene CG100212-01 and full length physical clone CG100212-02 
was assessed using the primer-probe set Ag4167, described in Table DA. Results of the 
RTQ-PCR runs are shown in Tables DB, DC, DD and DE. Please note that CGI 002 12-02 
represents a full-length physical clone of the CG100212-01 gene, validating the prediction 
of the gene sequence. 

Table DA. Probe Name Ag4167 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
ID No 


Forward 


5 ' -ggatccttacatgcgttgtaga-3 ' 


22 


146 


212 


Probe 


TET-5 ' -tggcactgattatataacaccttggca-3 '-TAMRA 




182 


213 


Reverse 


5 ' -tccatcaacgacttgagatagc-3 ' 


22 


209 


214 



Table DB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag4167, 
Run 215538909 


Tissue Name 


Rel. Exp.(%) 
Ag4167, Run 
215538909 


AD 1 Hippo 


18.0 


Control (Path) 3 Temporal 


9.9 
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Ctx 




AD 2 Hippo 


44.4 


Control (Path) 4 Temporal 
Ctx 


47.3 


AD 3 Hippo 


10.7 


AD 1 Occipital Ctx 


21.9 


AD 4 Hippo 


12.9 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


56.3 


AD 3 Occipital Ctx 


9.3 


AD 6 Hippo 


67.8 


AD 4 Occipital Ctx 


24.5 


Control 2 Hippo 


31.0 


AD 5 Occipital Ctx 


15.9 


Control 4 Hippo 


25.3 


AD 6 Occipital Ctx 


34.6 


Control (Path) 3 
Hippo 


13.3 


Control 1 Occipital Ctx 


9.2 


AD 1 Temporal 
Ctx 


34.2 


Control 2 Occipital Ctx 


39.8 


AD 2 Temporal 
Ctx 


100.0 


Control 3 Occipital Ctx 


22.2 


AD 3 Temporal 
Ctx 


9.4 


Control 4 Occipital Ctx 


13.8 


AD 4 Temporal 
Ctx 


15.1 


Control (Path) 1 Occipital 
Ctx 


87.7 


AD 5 Inf 
Temporal Ctx 


79.6 


Control (Path) 2 Occipital 

Ctx 


17.6 


AD 5 

SupTemporal Ctx 


51.1 


Control (Path) 3 Occipital 
Ctx 


5.6 


AD 6 Inf 
Temporal Ctx 


71.7 


Control (Path) 4 Occipital 
Ctx 


16.8 


AD 6 Sup 
Temporal Ctx 


60.3 


control 1 Parietal Ctx 




Control 1 
Temporal Ctx 


10.2 


control 2 Parietal Ctx 


, 


Control 2 
Temporal Ctx 


34.9 


^ontrol 3 Parietal Ctx 




Control 3 
Temporal Ctx 


29.5 


Control (Path) 1 Parietal Ctx 


75.3 


Control 4 
Temporal Ctx 


16.6 


Control (Path) 2 Parietal Ctx 


26.8 


Control (Path) 1 
Temporal Ctx 


51.6 


Control (Path) 3 Parietal Ctx 


10.2 


Control (Path) 2 
Temporal Ctx 


36.9 


Control (Path) 4 Parietal Ctx 


42.0 



Table DC . General_screening_panel_vl.4 



Tissue Name 


Rel.Exp.(%)Ag4167, 
Run 221035838 


Tissue Name 


Rel. Exp.(%) 
Ag4167,Run 
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221035838 


(Adipose 


3.6 


Renal ca. TK-10 


22.5 


(Melanoma* 
|Hs688(A).T 


5.3 


Bladder 


5.4 


Melanoma* 
|Hs688(B).T 


5.8 


Gastric ca. (liver met.) NCI- 
N87 


18.6 


[Melanoma* M14 


1.6 


Gastric ca. KATO III 


31.9 


Melanoma* 
JLOXIMVI 


5.0 


Colon ca. SW-948 


7.3 


Melanoma* SK- 
MEL-5 


12.5 


Colon ca. SW480 


22.7 


Squamous cell 
carcinoma SCC-4 


17.3 


— 

Colon ca.* (SW480 met) 
SW620 


14.0 


Testis Pool 


7.0 


Colon ca. HT29 


4.7 


Prostate ca.* (bone 
met) PC-3 


27.0 


Colon ca. HCT-116 


25.7 


Prostate Pool 


6.8 


Colon ca. CaCo-2 


24.1 


Placenta 


0.7 


Colon cancer tissue 


7.4 


Uterus Pool 


2.7 


Colon ca. SWI1I6 


2.8 


Ovarian ca. 
OVCAR-3 


28.3 


Colon ca Colo-205 




Ovarian ca. SK- 


30.4 


Colon ca. SW-48 




Ovarian ca. 
< )VCAR-4 


9.5 


Colon Pool 


5.8 


Ovarian ca. 
OVCAR-5 


59.5 


Small Intestine Pool 


8.8 


Ovarian ca. 
IGROV-1 


8.7 


Stomach Pool 


5.3 


Ovarian ca. 
OVCAR-8 i 


3.9 


Bone Marrow Pool 


2.8 


Ovary 


4.4 


Fetal Heart 


13.7 


Breast ca. MCF-7 


27.2 


Hteart Pool 


j 5 


Breast ca. MDA- 
MB-231 


7.7 


-ymph Node Pool 


5.7 


Breast ca. BT 549 


54.7 


: 'etai Skclcl.il Muscle 


5.2 


Breast ca. T47D 


00.0 


Skeletal Muscle Pool 


>2.2 


Breast < i MD \ N 


■3 


5pleen Pool i 


\.2 


Breast Pool ( 


.9 


'hymus Pool i 


1.5 


Trachea 


.6 ( 
Z 


>JS cancer (glio/astro) 
J87-MG 


.4.8 


Lung : 


• 7 |CNS cancer (glio/astro) U- 


7.4 
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118-MG 




Fetal Lung 


10.7 


CNS cancer (neuro;met) SK- 
N-AS 


9.2 


Lung ca. NCI- 
N417 


4.1 


CNS cancer (astro) SF-539 


5.6 


Lung ca. LX-1 


27.4 


CNS cancer (astro) SNB-75 


16.0 


Lung ca. NCI- 
H146 


1.2 


CNS cancer (glio) SNB-1 9 


7.9 


Lung ca. SHP-77 


19.5 


CNS cancer (glio) SF-295 


15.2 


Lung ca. A549 


20.6 


Brain (Amygdala) Pool 


6.7 


Lung ca. NCI- 
H526 


1.0 


Brain (cerebellum) 6.3 


I mi.' i NCI-H2 5 


33.9 


Brain (fetal) ]7.8 


Lung ca. NCI- 
H460 


8.0 


Brain (Hippocampus) Pool 


8.3 


Lung ca. HOP-62 


8.5 

?:z. 


Cerebral Cortex Pool 


8.1 


Lung ca. NCI- 
H522 


27.7 


Brain (Substantia nigra) 
Pool 


7.0 


Liver 


1.3 


Brain (Thalamus) Pool 


12.8 


Fetal Liver 


8.4 


Brain (whole) 


6.8 


Liver ca. HepG2 


7.9 


Spinal Cord Pool 


10.6 


Kidney Fool 


10.0 


Adrenal Gland 


10.5 


Fetal Kidney 


11.8 


Pituitary gland Pool 


2.8 


Renal ca. 786-0 


9.8 


Salivary Gland 


3.0 


Renal ca. A498 


2.8 


Thyroid (female) 


5.4 


Renal ca \c 


9.4 ; 


Pancreatic ca. CAPAN2 


24.3 


Renal ca. UO-31 


15.7 


Pancreas Pool 


8.0 



Table DP . Panel 4.1D 



Tissue Name 


Rel.Exp.(%)Ag4167, 
Run 173350762 


Tissue Name 


Rel. Exp.(%) 
Ag4167, Run 
173350762 


Secondary Thl act 


8.5 


HUVEClL-lbeta 


9.4 


Secondary Th2 act 


10.7 


IIUVEC I FN gamma 


14.0 


Secondary Trl act 


5.3 


HUVEC TNF alpha + IFN 
gamma 


4.3 


Secondary Th 1 
rest 


6.2 


HUVEC TNF alpha + IL4 


6.1 


Secondary Th2 
rest 


11.5 


HUVEC IL-11 


6.9 


Secondary Trl rest 


7.4 


Lung Microvascular EC 
none 


15.3 
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Lung Microvascular EC 
TNFalpha + IL-lbeta 


10.2 






Microvascular Dermal EC 
none 


17.1 




9.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


6.3 






Bronchial epithelium 
TNFalpha + ILlbeta 


28.1 


Primary Th2 rest 




Small airway epithelium 
none 


9.9 


Primary Trl rest 




Small airway epithelium 
TNFalpha + IL-lbeta 


31.4 


CD45RA CD4 
lymphocyte act 




Coronery artery SMC rest 


20.4 


CD45RO CD4 
lymphocyte act 




Coronery artery SMC 
TNFalpha + IL-lbeta 


14.1 


CD8 lymphocyte 
act 




Astrocytes rest 


21.2 


Secondary CD8 
lymphocyte rest 


12.4 


Astrocytes TNFalpha + IL- 
lbeta 


8.6 


Secondary CD8 
lymphocyte act 


12.3 


KU-812 (Basophil) rest 


22.1 


CD4 lymphocyte 
none 




KU-812 (Basophil) 
PMA/ionomycin 


31.4 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


3.4 


CCD 1 106 (Keratinocytes) 
none 


38.4 


LAK cells rest 


21.2 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


30.6 


LAK cells IL-2 


7.0 


Liver cirrhosis 


23.7 


LAK cells IL- 
2+IL-12 




NCI-H292 none 


66.0 


LAK cells IL- 
2+IFN gamma 


4.6 


NCI-H292 IL-4 


61.6 


LAK cells IL-2+ 
IL-18 


11.7 


NCI-H292 IL-9 


100.0 


LAK cells 
PMA/ionomycin 


17.7 


NCI-H292IL-13 


59.5 


NK Cells IL-2 rest 


9.0 


NCI-H292 I FN gamma 


68.3 


Two Way MLR 3 
day 


14.8 


HPAEC none 


12.2 


Two Way MLR 5 
day 


14.5 


HPAEC TNF alpha + IL-1 
beta 


23.0 


Two Way MLR 7 


13.6 


Lung fibroblast none 


21.5 
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day 








PBMC rest 


4.5 


Lung fibroblast TNF alpha 
+ IL-1 beta 


29.9 


PBMC I'WM 


13.2 


Lung fibroblast IL-4 


15.2 


PBMC PHA-L 


20.6 


Lung fibroblast IL-9 


19.2 


Ramos (B cell) 




Lung fibroblast IL-1 3 


21.9 


Ramos (B cell) 
ionomycin 


1.1 


Lung fibroblast IFN 
gamma 


13.1 


B lymphocytes 
PWM 


22.5 


Dermal fibroblast 
CCD 1070 rest 


9.8 


B lymphocytes 
CD40L and IL-4 


6.7 


Dermal fibroblast 
CCD1070 TNF alpha 


12.8 


EOL-1 dbcAMP 


5.5 


Dermal fibroblast 
CCD 1070 IL-1 beta 


7.0 


EOL-1 dbcAMP 
PMA/ionomycin 


1.4 


Dermal fibroblast IFN 
gamma 


8.4 


Dendritic cells 
none 


26.8 


Dermal fibroblast IL-4 


28.1 


Dendritic cells 
LPS 


27.2 


Dermal Fibroblasts rest 


16.3 


Dendritic cells 
anti-CD40 


25.7 


Neutrophils TNFa+LPS 


— 

1.8 


Monocytes rest 


7.9 


Neutrophils rest 


1.0 


Monocytes LPS 


8.4 


Colon 


12.0 


Macrophages rest 


52.5 


Lung 


10.4 


Macrophages LPS 


14.6 


Thymus 


22.2 


HUVEC none 


6.5 


Kidney 


77.9 


HUVEC starved 


11.0 







Table DE . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) Ag4167, 
Run 268624306 


Tissue Name 


Rel. Exp.(%) 
Ag4167, Run 
268624306 


Colon cancer 1 


10.4 


Bladder cancer NAT 2 


o 7 


Colon NAT 1 


4.7 


Bladder cancer NAT 3 


0.2 


Colon cancer 2 


18.3 


Bladder cancer NAT 4 


2.7 


Colon cancer 
NAT 2 


10.5 


Adenocarcinoma of the 
prostate 1 


21.3 


Colon cancer 3 


22.8 


Adenocarcinoma of the 
prostate 2 


2.8 


Colon cancer 
NAT 3 


15.3 


Adenocarcinoma of the 
prostate 3 


19.2 
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Colon malignant 
cancer 4 


28.1 


Adenocarcinoma of the 
prostate 4 




Colon normal 
adjacent tissue 4 


5.6 






Lung cancer 1 


13.0 


Adenocarcinoma of the 
prostate 6 




Lung NAT 1 

■ 


1.0 


Adenocarcinoma of the 
prostate 7 


7.0 


Lung cancer 2 


41.2 


Adenocarcinoma of the 
prostate 8 


3.5 


Lung NAT 2 


29 jAdenocarcinoma of the 
jprostate 9 


31.6 


Squamous cell 
carcinoma 3 


20 0 jProstate cancer NAT 1 0 


3.1 


Lung NAT 3 


L3 (Kidney cancer 1 


21.8 


metastatic 
melanoma 1 


16.8 


KidneyNAT 1 


1 1.4 


Melanoma 2 


'•» 


Kidney cancer 2 


100.0 


Melanoma 3 


4 - 8 


Kidney NAT 2 


43.5 


metastatic 
melanoma 4 


24.7 


Kidney cancer 3 


44.4 


metastatic 
melanoma 5 


35.8 


Kidney NAT 3 


14.3 


Bladder cancer 1 


2.9 


Kidney cancer 4 


17.6 


Bladder cancer 
NAT 1 


10 


Kidney NAT 4 


9.9 


Bladder cancer 2 


3.7 







expression of the CG100212-01 gene at low levels in the brains of an independent group 
of individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

GeneraLscreeningjaneLvl^ Summary: Ag4167 Highest expression of the 
CGI 002 12-01 gene is detected in a breast cancer T47D cell line (CT=26). High expression 
of this gene is seen in cluster of breast, ovarian, colon, gastric, renal, lung, pancreatic, 
CNS, hepatic, prostate cancer cell lines and melanoma cell lines. Thus, therapeutic 
modulation of this gene product could be beneficial in the treatment of these cancers. 
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Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
5 diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 

1 0 epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4167 Highest expression of the CG100212-01 gene is 
detected in IL-9 treated NCI-H292 cell line (CT=31). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

1 5 macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 

20 General_screening_panel_vl .4 and also suggests a role for the gene product in cell 

survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 

25 psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4167 Highest 
expression of the CGI 002 12-01 gene is detected in kidney cancer (CT=28.9), with 
significant expression also seen in metastatic melanoma, colon, lung, bladder, prostate and 
kidney cancers. In addition, expression of this gene is higher in the cancers than in the 

30 normal adjacent tissue. Therefore, expression of this gene could be as a marker to detect 
the presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 
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E. CG100222-01 : NADP-Dependent Leukotriene B4 

Expression of gene CGI 00222-01 was assessed using the primer-probe set 
Ag4169, described in Table EA. Results of the RTQ-PCR runs are shown in Tables EB 
and EC. 



Table EA . Probe Name Ag4169 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5' -gagcttcacatggaagggtt-3 ' 


20 


783 


215 


Probe 


TET-5 1 -attgtgacctgctggccaggagat-3 1 -TAMRA 


24 


804 


216 


Reverse 


5 ' -ctctgagacccatttcagca-3 1 


20 


853 


217 



Table EB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4169, Run 
221035855 


Tissue Name 


Rel. Exp.(%) 
Ag4169, Run 
221035855 


Adipose 


0.1 


Renal ca. TK- 10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.7 


Melanoma* 
Hs688(B).T 


0.4 


Gastric ca. (liver met.) NC1-N87 


2.7 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.2 


Melanoma* 
LOXIMVI 


0.2 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


1.4 


Colon ca. SW480 


1.1 


Squamous cell 
carcinoma SCC-4 


0.3 


Colon ca.* (SW480 met) SW620 


0.6 


Testis Pool 


2.4 jColon ca. HT29 


0.6 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.2 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


4.6 


Placenta 


0.1 


Colon cancer tissue 


0.0 


Uterus Pool 


0 0 


Colon ca. SW1116 


0.2 


Ovarian ca. 
OVCAR-3 


1.6 


Colon ca. CoIo-205 


0.0 


Ovarian ca. SK- 
OV-3 


100.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.8 


Small Intestine Pool 


0.2 
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Ovarian ca. 
IGROV-1 


0.4 


Stomach Pool 


0.3 


OVCAR-8 


0.2 


Bone Marrow Pool 


0.2 


Ovary 


0.0 


Fetal Heart 


0.1 


Breast ca. MCF-7 


0.3 


Heart Pool jo.3 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool lo 4 


Breast ca. BT 549 


0.3 


Fetal Skeletal Muscle jo.O 


Breast ca. T47D 


0.7 


Skeletal Muscle Pool Jo.O 


Breast ca. MDA-N 


0.2 


Spleen Pool jo.O 


Breast Pool 


0.1 


Thymus Pool jo.3 


Trachea 


0.0 


CNS cancer (glio/astr. 1 J7-MG 0.0 


Lung 


0.1 


CNS cancer (gho/astro) ' I I18MG |0.3 


Fetal Lung 


0.3 


CNS cancer (neuro;met) SK-N-AS 12.3 


Lung ca. NCI-N417|0.2 


CNS cancer (astro) SF-539 jo.O 


Lungca. LX-1 jo.5 


CNS cancer (astro) SNB-75 jo.4 


Lung ca.NCI-H146'0.2 


CNS cancer (glio) SNB- 1 9 jo.O 


Lung ca. SHP-77 }l.6 


CNS cancer (glio) SF-295 Jo.O 


Lung ca. A549 jo.O 


Brain (Amygdala) Pool jo.O 


Lungca.NCI-H526j0.4 


Brain (cerebellum) jo.O 


Lung ca. NCI-H23 ]3.7 jBrain (fetal) 


0.3 


Lung ca. NCI-H460 10.5 jBrain (Hippocampus) Pool 


0.0 


Lung ca. HOP-62 j0.2 jCerebral Cortex Pool 


0.3 


Lung ca. NCI-II522 jo.4 jBrain (Substantia nigra) Pool 


0.0 


Llver |0.0 jBrain I i alan us) Po I 


0.0 


Fetal Liver jo.l jBrain (whole) Iq 2 


Liver ca. HepG2 jo.6 (Spinal Cord Pool 


0.0 


Kidney Pool (0.6 (Adrenal Gland 


0.0 


1 falKidno jl.l |Pituitary gland Pool 


0.0 


Renal ca. 786-0 jO.O | Salivary Gland 


0.0 


Renal ca. A498 jo.2 (Thyroid (female) 


0.2 


Renal ca. ACHN JO.O |Pancreatic ca. CAPAN2 


0.7 


Renal ca. UO-3 1 jo.O (Pancreas Pool 


3.0 



Table EC . Panel 4 ID 





Rel. 






Tissue Name 


Exp.(%) 
Ag4169, 
Run 


Tissue Name 


Rel. Exp.(%) 
Ag4169, Run 
173333241 
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173333241 






Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.4 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 




Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN gamma 


™ 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


0.0 


Secondary Trl rest (0.0 


Lung Microvascular EC none 


0.4 


Primary Th 1 act jO.O 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


0.0 


Primary Th2 act |o.O 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th 1 rest 


0.4 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest 


0.5 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.3 


Small airway epithelium TNFalpha 
+ IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.5 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.7 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.0 


CD8 lymphocyte act 


0.4 


Astrocytes rest 


0.0 


Secondary CD 8 
lymphocyte rest 


0.8 


Astrocytes TNFalpha + IL-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 1 2 (Basophil) rest 


0.3 


CD4 lymphocyte none 


0.0 


KU-8I2 (Basophil) PMA/ionomycin: 


3.4 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1 1 06 (Keratinocytes) none j 


0.0 


LAK cells rest 


0.0 


CCD1106(Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 


LAK cells IL-2 


0.0 


_iver cirrhosis j 


J.O 


LAK c lis IL-2+IJL-12 


0.0 


NCI-H292 none 1 


.2 


LAK cells IL-2+IFN 
gamma 


0.0 


VCI-H292 IL-4 | 


.3 


LAK cells IL-2+ IL- 
18 


3.7 


^CI-H292 IL-9 


1.1 


LAK cells 
PMA/ionomycin 


10 1 


■4CI-H292 IL-1 3 ( 


).8 


NK Cells IL-2 rest 


).0 1 


^JCI-H292 IFN gamma ( 


.8 


Two Way MLR 3 day 


).0 1 


4PAEC none jo.3 
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Two Way MLR 5 day 


0.0 


nrncv. iint aipna^iij-i oeta 




Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.4 


Lung fibroblast TNF alpha + IL-1 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


0.4 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast I FN gamma 


0.4 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD! 070 rest 


0.8 


B lymphocytes 
CD40L and IL-4 


0.0 


Dermal fibroblast CCD1 070 TNF 
alpha 


0.0 


EOL-1 dbcAMP 


0.4 


Dermal fibroblast CCD1 070 IL- 1 
beta 




0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


,.5 


Dendritic cells none 


0 4 


- 

Dermal fibroblast IL-4 


- — ; 

0.9 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


1 .5 : 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


1.3 


Monocytes rest 


0.0 


Neutrophils rest 


0.4 " 


Monocytes LPS 


0.0 


Colon 


1.3 


Macrophages rest 


0.0 


Lung 


3.0 


Macrophages LPS 


0.0 


Thymus 


7.6 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved j 


0.0 







CNSneurodegenerationvl.O Summary: Ag4 169 Expression of the 
CG100222-01 gene is low/undetectable (CTs > 34.5) across all of the samples on this 
panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag4169 Highest expression of the 
CGI 00222-01 gene is detected in ovarian cancer SK-OV-3 cell line (CT=28). In addition, 
low but significant expression of this gene is seen in CNS cancer, colon cancer, lung 
cancer, ovarian cancer and melanoma cell lines. Thus, expression of this gene could be 
used as diagnostic marker in detection of these cancers. Therapeutic modulation of this 
gene product through the use of antibodies or small molecule drugs, may be beneficial in 
the treatment of these cancers, [mpattu, 25-Mar-02] 

Panel 4.1D Summary: Ag4169 Highest expression of the CG100222-01 gene is 
detected in Kidney (CT=29). Thus, expression of this gene can be used to distinguish the 
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kidney sample from other samples used in this panel. In addition, low but significant 
expression of this gene is also seen in lung and thymus. Therefore, therapeutic modulation 
of this gene can be beneficial in the autoimmune and inflammatory diseases that affect 
lung and kidney. 

Panel 5 Islet Summary: Ag4169 Expression of the CGI 00222-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4169 Results from one 
experiment with the CGI 00222-01 gene are not included. The amp plot indicates that 
there were experimental difficulties with this run. 

F. CG100266-01 and CG100266-02: Cyclophilin A Like Gene 

Expression of gene CG100266-01 and full length clone CG100266-02 was 
assessed using the primer-probe set Ag4174 ? described in Table FA. Results of the RTQ- 
PCR runs are shown in Tables FB. FC and FD. 



Table FA . Probe Name Ag4174 



Primers 


Sequences 


Lenath 


Start 
Position 


seqid 

No 


Forward 


5 ' -gggagaaatttgatgacaagaa-3 ' 




371 


218 


Probe 


TET-5 1 -cttcatcctaaagcacgcaggtcctg-3 ' -TAMRA 


26 


393 


219 


Reverse 


5 ' -actgtttgtgttgggtccagta-3 1 


22 


438 


220 



Table FB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4174, Run 
221034287 


Tissue Name 


Rel.Exp.(%) Ag4174, 
Run 221034287 


Adipose 


1.6 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI-N87 


3.2 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


2.4 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


0.0 


Testis Pool 


14.0 


Colon ca. HT29 


0.0 
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{Prostate ca.* (bone 
met) PC-3 


2.9 


Colon ca. HCT-116 


5.3 






Colon ca. CaCo-2 


0.0 


|Placcnta 


0.0 


Colon cancer tissue 


0.0 


juterus Pool 


0.0 


Colon ca. SW1116 


0.0 


jOvarian ca. 
jOVCAR-3 




Colon ca. Colo-205 


0.0 


IOvarian ca. SK- 
jOV-3 




Colon ca. SW-48 


0.0 


iOvarian ca. 
j ; OVCAR-4 




Colon Pool 


0.0 


jOvarian ca. 

IUVLAJ\-J 


2.3 


Small Intestine Pool 


0.0 


jOvarian ca. 
jIGROV-1 


0.0 


Stomach Pool 


2.3 


'Ovarian ca. 
jOVCAR-8 


0.0 


Bone Marrow Pool 


1.6 


[Ovary 

(Breast ca. MCF-7 


0.0 

o.o 


Fetal Heart 
Heart Pool 


0.0 
0.0 


IBreast ca. MDA- L 
MB-231 i 


Lymph Node Pool 


0.0 


Breast ca. BT 549 fo.O 


Fetal Skeletal Muscle 


3.7 


Breast ca. T47D 


5.5 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


3.5 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


25. 3 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18- 
MG 


5.5 


Fetal Lung 


62.0 


CNS cancer (neuro;met) SK-N- 
AS 


0.0 


Lung ca. NCI- 
N417 


0.0 


CNS cancer (astro) SF-539 


3.3 


Lung ca. LX-1 


0.0 


CNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI- 
H146 


100.0 


CNS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 


0.0 


i^jns cancer yjnoj ar-zyo 




Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


5.2 


Lung ca. NCI- 
H526 


0.0 


3rain (cerebellum) 


0.0 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


4.0 


Lung ca. NCI- 
H460 


0.0 


Brain (Hippocampus) Pool 


14.8 
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Lung ca. HOP-62 


2.5 


Cerebral Cortex Pool 


■ 8 


Lung ca. NCI- 
H522 




Brain (Substantia nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


1.2 


Fetal Liver 


0.0 


[Brain (whole) 


4.4 


jLiverca. HepG2 


(2.2 


(Spinal Cord Pool 


3.6 


{Kidney Pool 


[0.9 


j Adrenal Gland 


).() 


(Fetal Kidney 


(18.6 


jPituitary gland Pool 


2.3 


Re ial ca 786-0 


]b.o 


jSalivary Gland 


0.0 


jRenal ca. A498 


10.0 


(Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 ~' 


0.0 


Renal ca. UO-3 1 


0.0 


Pancreas Pool |2.7 


Table FC. Panel 4.1 F> 


Tissue Name 


Rel. Exp.(%) 
Ag4174, Run 
173507624 


kissue Name 


Rel. Exp.(%) 

AuJ.174 Pun 

173507624 


| Secondary Th 1 act 


0.0 


TIUVEC IL-lbeta 


0.0 


Secondary Th2 act ().() 


jHUVECIFN gamma 


job 


Secondary TrI act 


0.0 


HUVEC TNF alpha + IFN aanima 


joo " i 


(Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + 1L4 


'o.o i 


{Secondary Th2 rest 


0.0 


HUVECIL-II 


[6"o " 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


10 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.5 


Primary Th 1 rest 


3.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest ( 


).0 


Small airway epithelium none 


0.0 


Primary Trl rest ( 


.0 


Small airway epithelium TNFalpha 
+ IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


.0 


^oronery artery SMC TNFalpha + 
L-lbeta 


0.0 


CD8 lymphocyte 
act 0 


.0 


\strocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


.0 j 


Vstrocytes TNFalpha + IL-1 beta 


3.0 


Secondary CD8 0 


o jf 


CU-812 (Basophil) rest 


).0 
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lymphocyte act 


I 




CD4 lymphocyte 
none 


0.6 




|KU-8 12 (Basophil) 
;PMA/ ionomycin 


0.0 


2ry 

Thl/Th2/Trl anti- 
CD95CH11 


1 


106 (Keratinocytes) none 


0.0 


jLAK cells rest 


|o.o 


CCD1 106 (Keratinocytes) 
rNFalpha + IL-Ibeta 


0.0 


1 \l cells IL-2 


00 - 


Liver cirrhosis 


0.0 


LAK cells 1L- 
2+IL-12 


0.0 


NCI-H292 none 


0.0 


LAK cells IL- 
2+IFN gamma 


0.0 


NCI-H292 IL-4 


1.0 


LAK cells IL-2+ 
IL-18 


0.0 


NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-1 3 


0.0 


NK Cells IL-2 rest 


- 


NCI-H292 I FN gamma jo.6 


Two Way MLR 3 
day 


0.0 


HPAEC none jo.0 


Two Way MLR 5 
day 


0.0 


HPAEC TNF alpha + IL-1 beta 


0.0 


Two Way MLR 7 
day 


0.6 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-1 
beta 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 




Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
none 


0.0 


Lung fibroblast IL-1 3 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes 
PWM 


0.0 


Dermal fibroblast CCD 1 070 rest 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


Dermal fibroblast CCD 1070 TNF 
alpha 


0.2 


EOL-1 dbcAMP 


3.0 


Dermal fibroblast CCD 1070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


3.4 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells 
none 


3.0 


Dermal fibroblast IL-4 


1.4 


Dendritic cells LPS 


3.3 


Dermal Fibroblasts rest 


2.7 
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Dendritic cells 
anti-CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


1.0 


Monocytes LPS 


0 0 


Colon 


1.8 


Macrophages rest 


0.5 


Lung 


4.0 


Macrophages LPS 


0 0 


Thymus 


21.9 


HLVEC none 


0.0 


Kidney 


100.0 


HUVEC starved |0.0 




Table FD. General oncologv screening panel v 7.4 


Tissue Name 


Rel. Exp.(%) 
Ag4174, Run 
268624376 


Tissue Name 


Rel. Exp.(%) 
Ag4174, Run 
268624376 


Colon cancer 1 


0.0 


Bladder cancer NAT 2 


4.3 


Colon cancer NAT 
1 


0.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 
2 


0.0 


Adenocarcinoma of the prostate 1 


4.2 


Colon cancer 3 


0 0 


Adenocarcinom i oi the prostate 2 jo.O 


Colon cancer NAT 


0.0 


Adenocarcinoma of the prostate 3 


5.6 


Colon malignant 
cancer 4 


0.0 


Adenocarcinoma of the prostate 4 


0.0 


Colon normal 
adjacent tissue 4 


0.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


4.9 


Adenocarcinoma of the prostate 6 


0.0 


Lung NAT 1 


14.2 


Adenocarcinoma of the prostate 7 


4.0 


Lung cancer 2 


5.2 


Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 


8.8 


Adenocarcinoma of the prostate 9 


0.0 


Squamous cell 
carcinoma 3 


5.2 


Prostate cancer NAT 10 


0.0 


Lung NAT 3 


9.7 


Kidney cancer 1 


0.0 


metastatic 
melanoma 1 


100.0 


KidneyNAT 1 


0.0 


Melanoma 2 


4.2 


Kidney cancer 2 


0.0 


Melanoma 3 


0.0 


Kidney NAT 2 


0.0 


metastatic 
melanoma 4 


0.0 


Kidney cancer 3 


0.0 


metastatic 
melanoma 5 


18.6 


Kidney NAT 3 


0.0 


Bladder cancer 1 


0.0 


Sidney cancer 4 


0.0 
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Bladder cancer 
NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag4174 Results from one experiment 
with the CGI 00266-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General scrcening panel vl.4 Summary: Ag4174 Highest expression of the 
CGI 00266-01 gene is seen in a lung cancer cell line (CT=30.2). Significant levels of 
expression are also seen in fetal lung tissue, which is more proliferative than the adult 
counterpart. Thus, expression of this gene, a cyclophilin A homog. could be used to 
differentiate between fetal lung and adult lung and as a marker of lung cancer. This is in 
agreement with published reports that a novel cyclophilin like molecule is co-expressed in 
small cell lung cancer cell lines (Kim JO. Oncogene 1998 Aug 27; 1 7(8): 101 9-26). 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
useful in the treatment of lung cancer. 

Low but significant levels of expression are also seen in the amygdala, 
hippocampus and whole and fetal brain. Cyclophilin A has been implicated in neuronal 
differentiation and human embryonic brain cells (Nahreini P, Cell Mol Neurobiol 2001 
Feb;21(l):65-79). Based on the expression of this cyclophilin homolog in the CNS, 
modualation of the expression of this gene may be useful in the treatment of 
neurodegenerative disorders, including Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 4.1D Summary: Ag4174 Expression of the CG100266-01 gene is restricted 
to normal kidney, lung and thymus (CTs=30-34). This pattern of expression suggests that 
this gene product may be involved in the normal homeostasis of these tissues. Therapeutic 
modulation of the expression or function of this gene may be useful in maintaining or 
restoring function to these organs during inflammation. 

General oncology screening panel v_2.4 Summary: Ag4174 Expression of the 
CGI 00266-01 gene is restricted to a melanoma (CT=33.8). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of melanoma. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of melanoma. 
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G. CG100456-01: CoA Transferase-Like 

Expression of gene CGI 00456-01 was assessed using the primer-probe set 
Ag4179, described in Table GA. Results of the RTQ-PCR runs are shown in Tables GB, 
GC and GD. 

Tabic GA . Probe Name Ag4179 



Primers 


Sequences 


Length 


£L | SEQIDN ° 


Forward 


5 ' -ccaggtgtggaaatgaagaag-3 ' 


21 


1030 | 221 


Probe 


TET-5 ' -atcactgacacccacacgccgtct-3 ' -TAMRA 


24 


1051 j 222 


Reverse 


5 ' -gcatgtaatagagcgtcaggtt-3 • 


22 1087 | 223 



Table GB . General_screening_panel_vl.4 



[Tissue Name 


Rel Exp.(%) 
Ag4179, Run 
221117873 


Tissue Name 


Rel. Exp.(%) 
Ag4179, Run 
221117873 


(Adipose 


2.4 


Renal ca. TK-10 


43.2 


Melanoma* 
Hs688(A).T 


18.8 


Bladder 


8.4 


Melanoma* 
|Hs688(B)T 


20.2 


1 — J 
Gastric ca. (liver met.) NCI-N87 123.5 


Melanoma* Ml 4 


23.0 


Gastric ca. KATO 111 -55.5 


Melanoma* 
LOXIMVI 


8.9 


Colon ca. S W-948 \ 1 9.8 


Melanoma* SK- 
MEL-5 


22.7 


Colon ca. S W480 


28.5 


Squamous cell 
carcinoma SCC-4 


22.1 


Colon ca* (SW480 met) 
SW620 


15.3 


Testis Pool 


9.2 


Colon ca. HT29 


18.0 


Prostate ca.* (bone 
met) PC-3 


24.0 


Colon ca. HCT-116 


38.2 


Prostate Pool p. 3 (Colon ca. CaCo-2 


24.1 


Placenta J 1 5.6 


Colon cancer tissue 


13.1 


Uterus Pool 


1.6 


Colon ca.SWl 116 


12.0 


Ovarian ca. 
OVCAR-3 


28.7 


Colon ca. Colo-205 


16.4 


Ovarian ca. SK- 
OV-3 


36.3 


Colon ca. SW-48 


12.0 


Ovarian ca. 
OVCAR-4 


20.6 


Colon Pool 


8.0 


Ovarian ca. 
OVCAR-5 


20.3 


Small Intestine Pool 


6.0 
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Ovarian ca. 
IGROV-1 


fl7 
17.3 


Stomach Pool 


4.1 


Ovarian ca. 
OVCAR-8 


12.7 


Bone Marrow Pool 


2.6 






Fetal Heart 


5.6 


'Breast ca MCF-7 


~j 


Heart Pool 


4.1 


Breast ca. MDA- 
MB-23 1 


39.5 


Lymph Node Pool 


9.2 


Breast ca. BT 549 


100.0 


Fetal Skeletal Muscle 


4.8 


Breast ca. T47D 


36.6 


Skeletal Muscle Pool 


M 5 


Breast ca. MDA-N 


16.6 


Spleen Pool 


5.1 


Breast Pool 


7.5 


Thymus Pool 


6.6 


Trachea 


10.6 


( NS cancel (gli . astn ) 1 187 

MG | 160 


jLung 


0.8 


CNS cancer (glio/astro) U- 1 18- L . , 
MG j 34 - 6 


'Fetal Lung 


9.6 


CNS cancer (neuro;met) SK-N- ( _ 

as : 24 - 7 


jLung ca. NCI- 
IN417 


15.2 


CNS cancer (astro) SF-53 9 |28.3 


Lungca. LX-1 


21-0 jcNS cancer (astro) SNB-75 \53.6 


Lung ca. NCI- 
H146 


7.5 


CNS cancer (glio)SNB- 19 113.6 


Lung ca. SHP-77 


17.8 


CNS cancer (glio)SF-295 


40.6 


Lung ca. A549 


14.6 jBrain (Amygdala) Pool 


5.1 


Lung ca. NCI- 
H526 


4-8 jBrain (cerebellum) 


14.2 


Lung ca. NCI-H23 


18.0 jBrain (fetal) 


6.7 


Lung ca. NCI- 
H460 


8-4 jBrain (Hippocampus) Pool 


4.9 


Lung ca. HOP-62 


14.3 | 


Cerebral Cortex Pool 


5.1 


H522 


12.1 


Brain (Substantia nigra) Pool 


6.0 


Liver 


5.0 | 


3rain (Thalamus) Pool 


7.0 


Fetal Liver 


10.2 i 


3rain (whole) 


7.5 


Liver ca. HepG2 


70.7 JSpinal Cord Pool 


7.6 


Kidney Pool 


1 1 -5 jAdrenal Gland 


9.3 


Fetal Kidney 


4-9 (Pituitary gland Pool 


3.3 


Renal ca. 786-0 


20.3 iSalivary Gland 


0.1 


Renal ca. A498 


5-1 (Thyroid (female) 


5.2 


Renal ca. ACHN 


5-7 jPancreatic ca. CAPAN2 


U.9 \ 
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[Renal ca. UP 31 |l3.9 jPancreas Pool jipj 

Table GC . Panel 4.1D 



Tissue Name 


Rel.Exp.(%) 
Ag4179, Run 
173507602 


Tissue Name 


Rel. Exp.(%) 
Ag4179, Run 
173507602 


Secondary Th 1 act 


56.3 


HUVECIL-lbeta 


S 


Secondary Th2 act 


50.0 


HUVEC 1FN gamma 


61.6 


Secondary Trl act 


3.8 


HUVECTNF alpha + IFN 
gamma 


43.8 


Secondary Th 1 j , 
rest [ 4-J 


HUVECTNF alpha + IL4 


50.0 


Secondary Tli2 
rest 


24.1 


HUVEC IL-11 


41.5 


Secondary Trl rest 


25.5 


Lung Microvascular EC none 76.8 






Lung Microvascular EC i 
TNFalpha+IL-lbeta j 58 " 2 


{Primary Th2 act 
Primary Trl act 


39.8 
33.7 


Microvascular Dermal EC none 
Microsvasular Dermal EC 
TNFalpha+ ll.-lbeta 


54.3 
39.8 


Primary Th 1 rest 


19.5 


Bronchial epithelium TNFalpha 
+ ILIbeta 


40.6 


Primary Th2 rest 


1 8.8 


Small airway epithelium none 


24.1 


Primary Trl rest 


24.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


43.5 


CD45RA CD4 
lymphocyte act 


55.5 


Coronery artery SMC rest 


49.7 


CD45RO CD4 
lymphocyte act 


35.4 


Coronery artery SMC TNFalpha 
+ 1L-Ibeta 


51.1 


CDS lymphocyte 
act 


40.9 


Astrocytes rest 


46.7 


Secondary CDS 
lymphocyte rest 


35.8 


Astrocytes TNFalpha + IL-1 beta 


24.0 


Secondary CD8 
lymphocyte act 


29.1 


KU-812 (Basophil) rest 


39.2 


CD4 lymphocyte 
none 


12.9 


KU-8 12 (Basophil) 
PMA/ionomycin 


54.7 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


34.9 


CCD1 1 06 (Keratinocytes) none 


50.0 


LAK cells rest 


46.3 


2CDI106(Keratinocytes) 
rNFalpha+ IL-1 beta 


41.5 


LAK cells IL-2 


40.9 


Liver cirrhosis 


6.8 
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LAK cells IL- 
J2+IL-12 


43.5 


NCI-H292 none 


17.4 


LAK cells IL- 
J2+IFN gamma 


43.8 


NCI-H292 IL-4 


23.2 


|LAK cells IL-2+ 
jIL-18 


|40.3 


NCI-H292 IL-9 


28.1 


jLAK cells 
jPMA/ionomycin 


10.6 


NCI-H292 1L-13 !30.6 


|NK Cells IL-2 rest 


45.4 


NCI-H292 1 FN gamma =25.0 ' 


■ Two Way MLR 3 
|day 


44.1 


HPAECnone J44.1 


Two Way MLR 5 

jday 


38.2 


HPAEC TNF alpha + 1L-1 beta 49.7 


jTwo Way MLR 7 


40.6 


Lung fibroblast none 


39.8 


jPBMC rest 


22.2 


Lung fibroblast TNF alpha + 1L- i 
1 beta ; 3lJ 


fPBMC PWM 


34.6 


Lung fibroblast IL-4 147.6 


jPBMC PHA-L 


42.0 


Lung fibroblast IL-9 


=50.7 j 


(Ramos (B cell) 
|none 


- 


Lung fibroblast IL-13 


40.6 


(Ramos (B cell) 
jionomycin 




Lung fibroblast 1FN gamma (55.1 


B lymphocytes 
PWM 




Dermal fibroblast CCD 1 070 rest 


56.3 


B lymphocytes 
CD40L and IL-4 




Dermal fibroblast CCD 1 070 
TNF alpha 


59.5 


JeOL-1 d be AMP 


... ^ 


Dermal fibroblast CCD1070 1L- 
1 beta 


45.1 


JEOL-1 dbcAMP 
[PMA/ionomycin 




Dermal fibroblast 1FN gamma 


31.6 


jDendritic cells 

j - - 


54.7 


Dermal fibroblast IL-4 


40.1 


Dendritic cells 
jLPS 


9.2 


Dermal Fibroblasts rest 


30.6 


Dendritic cells 
|anti-CD40 ( 


1.1 


Neutrophils TNFa+LPS 


2.2 


(Monocytes rest 8 


8.9 J 


Neutrophils rest 


36.1 


Monocytes LPS 4 


0.3 ( 


2o\on L 


12.6 


Macrophages rest 5 


0.3 I 


-mg 


2.8 


Macrophages LPS 3 


7.1 1 


Tiymus ; 


7.4 


HUVEC none 4 


1.5 I 


Cidney 


8.2 
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jHUVEC starved [53.2 j j 

Table GD . General oncology screening panel_v_2.4 



Tissue Name 


Re I. Exp.(%) 
Ag4179, Run 
268695203 


Tissue Name 


Rel. Exp.(%) 
Ag4179, Run 
268695203 


Colon cancer 1 


34.4 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


18.9 


Bladder cancer NAT 3 


0.2 


Colon cancer 2 


36.1 


Bladder cancer NAT 4 


Colon cancer NAT 
2 


17.9 


Adenocarcinoma of the prostate 
1 


16.4 


Colon cancer 3 


76.3 


Adenocarcinoma of the prostate 
2 


1.3 


Colon cancer NAT 
3 


34.2 


Adenocarcinoma of the prostate L _ 

3 ! 6 - 3 


Colon malignant 


100.0 


Adenocarcinoma of the prostate j 

4 


Colon normal 
adjacent tissue 4 


12.9 


Prostate cancer NAT 5 3.0 


Lung cancer 1 


15.4 


Adenocarcinoma of the prostate i, _ 

6 i J -' 


Lung NAT 1 


1.1 


Adenocarcinoma of the prostate 


Lung cancer 2 


25.0 


Adenocarcinoma of the prostate 
8 


1.2 


Lung NAT 2 


1.5 


Adenocarcinoma of the prostate 
9 


21.8 


Squamous cell 
carcinoma 3 


24.1 


Prostate cancer NAT 10 


0.7 


Lung NAT 3 


0.4 


Kidney cancer 1 


18.9 


metastatic 
melanoma 1 


15.6 


KidneyNAT 1 


6.8 


Melanoma 2 


3.8 


Kidney cancer 2 


51.1 


Melanoma 3 


1.6 


Kidney NAT 2 


19.1 


metastatic 
melanoma 4 


39.2 


Kidney cancer 3 


16.6 


metastatic 
melanoma 5 


36.6 


Kidney NAT 3 


3.8 


Bladder cancer 1 


0.5 


Kidney cancer 4 


21.9 


Bladder cancer 
NAT 1 


0.0 


Kidney NAT 4 


21.5 


Bladder cancer 2 


2.5 
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General_screening_panel_vl.4 Summary: Ag4179 Highest expression of the 
CGI 00456-01 gene is detected in a breast cancer BT 549 cell line (CT=24). High 
expression of this gene is seen in cluster of breast, ovarian, colon, gastric, renal, lung, 
pancreatic, CNS, hepatic, prostate cancer cell lines and melanoma cell lines. Thus, 
5 therapeutic modulation of this gene product could be beneficial in the treatment of these 
cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 

0 activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 

5 central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1 D Summary: Ag4179 Highest expression of the CGI 00456-01 gene is 
detected in ionomycin treated Ramos B cells (CT=28). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 

0 health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 

5 tissues. This pattern is in agreement with the expression profile in 

General_screening_panel_vl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 

0 diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 
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General oncology screening panel_v_2.4 Summary: Ag4179 Highest 
expression of the CG100456-01 gene is detected in malignant colon cancer (CT=26), with 
significant expression also seen in metastatic melanoma, colon, lung, bladder, prostate and 
kidney cancers. In addition, expression of this gene is higher in the cancers than in the 
normal adjacent tissue. Therefore, expression of this gene could be as a marker to detect 
the presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 



H. CG100466-01: Adenine Nucleotide Translocator 2 (ANT 2) (ADP/ATP 
Translocase 2) 

Expression of gene CGI 00466-01 was assessed using the primer-probe set 
Ag4177. described in Table HA. Results of the RTQ-PCR runs are shown in Tables HB, 
HC and HD. 

Table HA . Probe Name Ag4177 



jPrimers Sequences 


Length 


Start 
Position 


SEQ ID 

No 


J Forward 5 1 - tgtgggtaaagctgaagctg-3 ' 


20 


452 "~ 


— -— 


{Probe TET-5 ' -aggcctctgtgactgcctggttaaga-3 ' -TAMRA 


26 


485 


225 


jReverse :S 1 -ttggtacaggcccttaatcc-3 ' 


20 


526 


226 



Tissue Name 


Rel.Exp.(%)Ag4177, 
Run 215539616 


Tissue Name 


Rel. Exp.(%) 
Ag4177, Run 
215539616 


AD 1 Hippo 


11.4 


Control (Path) 3 Temporal Ctx 


9.3 


AD 2 Hippo 


31.2 


Control (Path) 4 Temporal Ctx 


30.1 


AD 3 Hippo 


4.1 


AD 1 Occipital Ctx 


18.2 


AD 4 Hippo 


5.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


96.6 


AD 3 Occipital Ctx 


4.6 


AD 6 Hippo 


60.3 


AD 4 Occipital Ctx 


20.2 


Control 2 Hippo 


42.6 


AD 5 Occipital Ctx 


59.0 


Control 4 Hippo 


11.0 


AD 6 Occipital Ctx 


47.6 


Control (Path) 3 
Hippo 


5.4 


Control 1 Occipital Ctx 


3.7 


AD 1 Temporal 
Ctx 


11.2 


Control 2 Occipital Ctx 


100.0 


AD 2 Temporal 
Ctx 


28.1 


Control 3 Occipital Ctx 


16.7 
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AD 3 Temporal 
Ctx 


3.2 


Control 4 Occipital Ctx 


7.2 


AD 4 Temporal 
Ctx 


27.5 


Control (Path) 1 Occipital Ctx 


95.3 


AD 5 Inf 
Temporal Ctx 


93.3 


Control (Path) 2 Occipital Ctx 


8.6 


AD 5 Sup 
Temporal Ctx 


34.4 


Control (Path) 3 Occipital Ctx 


6.5 


AD 6 Inf 
Temporal Ctx 


42.9 


Control (Path) 4 Occipital Ctx 


15.3 


AD 6 Sup 
Temporal Ctx 


49.0 


Control 1 Parietal Ctx 


5.8 


Control 1 
Temporal Ctx 


5.5 


Control 2 Parietal Ctx 


31.6 


Control 2 
Temporal Ctx 


47.6 


Control 3 Parietal Ctx 


18.9 


Control 3 
Temporal Ctx 


16.6 


Control (Path) 1 Parietal Ctx 


81.2 


Control 3 
Temporal Ctx 


15.1 ^Control (Path) 2 Parietal Ctx 


32.8 


Control (Path) 1 
Temporal Ctx 


44.8 [Control (Path) 3 Parietal Ctx 


5.0 


Control (Path) 2 
Temporal Ctx 


42.9 'Control (Path) 4 Parietal Ctx :48.3 



Table HC . General_screening_paneljvl.4 



Tissue Name 


Rel. Exp.(%) Ag4177, 
Run 221036658 


Tissue Name 


Rel. Exp.(%) 
Ag4177, Run 
221036658 


Adipose 


2.4 


Renal ca. TK-10 


18.7 


Melanoma* 
Hs688(A).T 


5.2 


Bladder 


12.9 


Melanoma* 
Hs688(B).T 


6.5 


Gastric ca. (liver met.) NCI- 
N87 


17.8 


Melanoma* M14 


53.2 


Gastric ca. KATO 111 


100.0 


Melanoma* 
LOXIMVI 


33.7 


Colon ca. SW-948 


19.5 


Melanoma* SfC- 
MEL-5 


58.6 


Colon ca. SW480 


81.2 


Squamous cell 
carcinoma SCC-4 


21.5 


Colon ca.* (SW480 met) 
SW620 


57.0 


Testis Pool 


2.3 


Colon ca. HT29 


35.4 


Prostate ca.* 
(bone met) PC-3 


37.1 


Colon ca. HCT-116 


52.9 
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Prostate Pool 


1.5 


Colon ca. CaCo-2 


23.7 


Placenta 


3.4 


Colon cancer tissue 


27.0 


Uterus Pool 


0.6 


Colon ca. SW1116 


8.5 


Ovarian ca. 
OVCAR-3 


34.2 


Colon ca. Colo-205 


17.7 


Ovarian ca. SK- 

;ov-3 


27.0 


Colon ca. SW-48 


19.8 


.Ovarian ca. 
OVCAR-4 


32.1 


Colon Pool 


3.3 


Ovarian ca. 
OVCAR-5 


19.3 


Small Intestine Pool 


1 .9 


Ovarian ca. 
IGROV-I 


19.2 




2.8 


Ovarian ca. 
OVCAR-8 


13.7 


Bone Marrow Pool 


1.0 


Ovary 


3.4 


Fetal Heart 


5.2 


Breast ca. MCF-7 


39.5 


Heart Pool 


1.7 


Breast ca. MDA- 
MB-231 


37.6 


Lymph Node Pool 


!! j 


'Breast ca. BT 5 19 


60.7 


Fetal Skeletal Muscle 


1.3 j 


(Breast ca. T47D 


34.4 


Skeletal Muscle Pool 


1.2 j 


: Breast ca. MDA- 
N 


65.1 


Spleen Pool 


2.5 


Breast Pool 


3.2 


Thymus Pool 


3.5 


Trachea 


4.7 


CNS cancer (glio/astro) U87- 
MG 


25.2 


Lung 


0.9 


CNS cancer (glio/astro) U-l 1 8- 
MG 


26.2 


Fetal Lung 


4.8 


CNS cancer (neuro;met) SKI- 
N-AS 


29.1 


Lung ca. NCI- 
N417 


13.8 


CNS cancer (astro) SF-539 


28.3 


Lung ca. LX-1 


46.3 


CNS cancer (astro) SNB-75 


43.8 


Lung ca. NCI- 
H146 


13.1 


^JNO CdllLCr l^gllOJ oiNt>-iy 


15.7 


Lung ca. SHP-77 


28.5 


CNS cancer (glio) SF-295 


18.8 


Lung ca. A549 


48.3 


Brain (Amygdala) Pool 


3.7 


Lung ca. NCI- 
H526 


0.2 


Brain (cerebellum) 


7.3 


Lung ca. NCI- 
H23 


24.7 


Brain (fetal) 


3.2 


Lung ca. NCI- 


3/7 


Brain (Hippocampus) Pool 


3.5 
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H460 








Lung ca. HOP-62 


8.9 


Cerebral Cortex Pool 


5.1 


Lung ca. NCI- 
H522 


25.7 


Brain (Substantia nigra) Pool 


4.9 




2.0 


Brain (Thalamus) Pool 


6.4 


Fetal Liver 


10.2 


Brain (whole) 


4.1 


Liver ca. HepG2 


23.0 


Spinal Cord Pool 


2.9 


Kidney Pool 


3.2 


Adrenal Gland 


;■■> . . . 


Fetal Kidney 


4.5 


Pituitary gland Pool 


0.8 


Renal ca. 786-0 


17.8 


Salivary Gland 


4.0 


Renal ca. A498 


6.5 


Thyroid (female) 


2.1 


Renal ca. ACHN 


14.5 


Pancreatic ca. CAPAN2 


29.3 


Renal ca UO-31 


11.9 


Pancreas Pool 


4.8 



Table HP . Panel 4.1D 



Tissue Name 


Rel. Exp.(%) Ag4177, 

Run 1 7*5T*T*7£ 


Tissue Name 


Rel. Exp.(%) 
Ag4177, Run 

173333376 


Secondary Thl act 


72.2 


HUVEC 1 L- 1 beta 


54.3 


Secondary Th2 act 


51.8 


HUVEC 1FN gamma 


34.6 


Secondary Trl act 


47.6 


HUVEC TN F alpha + 1 FN 
gamma 


23.5 


Secondary Th 1 
rest 


17.2 


HUVEC TNF alpha + 1L4 


30.6 


Secondary Th2 
rest 


17.4 


HUVEC IL-11 


22.4 


Secondary Trl rest 


17.8 


Lung Microvascular EC none 


39.0 


Primary Thl act 


52.1 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


25.0 


Primary Th2 act 


60.3 


Microvascular Dermal EC 
none 


25.2 


Primary Trl act 


80.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


24.0 


Primary Thl rest 


25.5 


Bronchial epithelium 
TNFalpha + ILlbeta 


40.6 


Primary Th2 rest 


19.6 


Small airway epithelium none 


16.3 


Primary Trl rest 


37.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


44.8 


CD45RA CD4 
lymphocyte act 


61.1 


Coronery artery SMC rest 


16.5 


CD45RO CD4 
lymphocyte act 


74.7 


Coronery artery SMC 
TNFalpha + IL-lbeta 


16.6 
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CD 8 lymphocyte 
act 


96.6 


Astrocytes rest 


11.2 


Secondary CD8 
lymphocyte rest 


57.0 


Astrocytes TNFalpha + IL- 


11.6 


Secondary CD8 
lymphocyte act 


38.4 


KU-812 (Basophil) rest 


27.2 


CD4 lymphocyte 
none 


6.2 


KU-812 (Basophil) 
PMA/ionomycin 


41.2 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


25.0 


CCD1 106 (Keratinocytes) 
none 


52.1 


LAK cells rest 


31.6 


CCD1106 (Keratinocytes) 
TNFalpha +IL-1 beta 


57.0 


LA K cells IL-2 


38.4 


Liver cirrhosis 


7.0 


LAN. CcllS IL,- 

2+1 L- 12 


25.9 


NCI-H292 none 


56.6 


LAK cells IL- 
2+1FN gamma 


32.8 


NCI-H292 IL-4 


61.1 


LAK cells IL-2+ 
IL-18 


36.3 


NCI-H292 IL-9 


100.0 


LAK cells 
PMA/ionomycin 


56.3 


NCI-H292 IL- 1 3 


51.1 


NK Cells IL-2 rest 45.4 


NCI-H292 IFN gamma 


4.7 


Two Way MLR 3 
day 


25.0 


HPAEC none 


24.8 


Two Way MLR 5 
day 


49.3 


HPAEC TNF alpha + IL-1 beta 


29.7 


Two Way MLR 7 
day 


42.6 


Lung fibroblast none 


14.8 


PBMC rest 


11.0 


Lung fibroblast TNF alpha + 
IL-1 beta 


11.4 


PBMC PWM 


50.0 


Lung fibroblast IL-4 


17.2 


PBMC PHA-L 


54.3 


Lung fibroblast IL-9 


19.5 


Ramos (B cell) 
none 


67.4 


Lung fibroblast IL- 13 


15.3 


Ramos (B cell) 
ionomycin 


82.4 


Lung fibroblast IFN gamma 


21.2 


B lymphocytes 
PWM 


48.6 


Dermal fibroblast CCD 1070 
rest 


33.7 


B lymphocytes 
CD40L and IL-4 


54.0 


Dermal fibroblast CCD] 070 
TNF alpha 


57.4 


EOL-1 dbcAMP 


44.1 


Dermal fibroblast CCD 1070 
IL-1 beta 


25.2 
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EOL-1 dbcAMP 
PMA/ionomycin 


31.9 


Dermal fibroblast IFN gamma 


20.3 


Dendritic cells 
none 


45.1 


Dermal fibroblast IL-4 


28.3 


Dendritic cells 
LPS 


31.2 


Dermal Fibroblasts rest 


15.1 


Dendritic cells 
anti-CD40 


36.1 


Neutrophils TNFa+LPS 


3.0 


Monocytes rest 


24.1 


Neutrophils rest 


4.3 


Monocytes LPS 


13.4 


Colon 


29.9 


Macrophages rest 


50.3 


Lung 


12.9 


Macrophages LPS 


15.4 


Thymus 


29.1 


HUVEC none 


32.8 


Sidney 


51.4 


HUVEC starved 


37.9 J 





expression of the CGI 00466-01 gene at low levels in the brains of an independent group 
of individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

Gcneral_screening_panel_vl.4 Summary: Ag41 77 Highest expression of the 
CGI 00466-01 gene is detected in a gastric cancer KATO III cell line (CT=22). High 
expression of this gene is seen in cluster of breast, ovarian, colon, gastric, renal, lung, 
pancreatic, CNS, hepatic, prostate cancer cell lines and melanoma cell lines. Thus, 
therapeutic modulation of this gene product could be beneficial in the treatment of these 
cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

The CGI 00466-01 gene codes for adenine nucleotide translocator (ANT 2) 
homologue. Dysfunctioning of the ANT2 have been shown to to induce myopathies in 
mouse and in humans (Fiore et al., 2001, Clin Chim Acta 31 1(2): 125-35, PMID: 
1 1566172). ANT has a role in mtDNA maintenance and mutation in ANT has been 
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implicated in autosomal dominant progressive external ophthalmoplegia and other 
mitochondrial diseases (Kaukonen et al., 2000, Science 289(5480):782-5, PMID: 
10926541). Mice deficient in the heart/muscle specific isoform of the ANT1 exhibit many 
of the hallmarks of human oxidative phosphorylation (OXPHOS) disease, including a 
5 dramatic proliferation of skeletal muscle mitochondria (Murdoch et al., 1 999. J Biol Chem 
274(20): 14429-33, PMID: 10318868). Therefore, therapeutic modulation of the ANT 
protein encoded by the CG100466-01 gene through the use of small molecule drug could 
be useful in the treatment mitochondria related diseases including autosomal dominant 
progressive external ophthalmoplegia and cancers. 
10 In addition, this gene is expressed at high levels in all regions of the central 

nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 
15 Panel 4.1D Summary: Ag4167 Highest expression of the CG100466-01 gene is 

detected in IL-9 treated NCI-H292 cell line (CT=25). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
20 and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 

General_screening_panel_vl .4 and also suggests a role for the gene product in cell 
25 survival and proliferation. Therefore, modulation of the gene product with a functional 

therapeutic may lead to the alteration of functions associated with these cell types and lead 

to improvement of the symptoms of patients suffering from autoimmune and inflammatory 

diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 

psoriasis, rheumatoid arthritis, and osteoarthritis. 
30 General oncology screening panel_v_2.4 Summary: Ag41 77 Results from one 

experiment with the CGI 00466-01 gene are not included. The amp plot indicates that 

there were experimental difficulties with this run. 
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I. CG100609-01: Glutathione S-Transferase 

Expression of gene CGI 00609-01 was assessed using the primer-probe set 
Ag41 82, described in Table IA. Results of the RTQ-PCR runs are shown in Tables IB, IC 
5 and ID. 



Table I A . Probe Name Ag4182 



Primers 


Sequences 


Length 


Stan 
Position 


SEQ ID 
No 


Forward 


5 ' -ctctacatggacctgctgtca-3 ' 


21 


73 


227 


Probe 


TET-5 ' -ccgtgccgtctacatcttctcgaag-3 ' -TAMRA 


25 


102 


228 


Reverse 


5 ' -agttgaactggatgtcatgctt-3 ' 


22 


127 


229 



Tissue Name 


Rel. Exp.(%)Ag4I82, 
Run 215539692 


Tissue Name 


Rel. E.\p.(%) 
Ag4182. Run 
2 1 5539692 


AD I Hippo 


0.0 


Control (Path) 3 Temporal 
Ctx 


0.0 


AD 2 Hippo 


0.0 


' ontrol (Path t Temp - il 
Ctx 


23.8 


AD 3 Hippo 


0.0 


AD 1 Occipital Ctx 


iO.O 


AD 4 Hippo 


0.0 


AD 2 Occipital Ctx 
(Missing) 




AD 5 hippo 


0.0 


AD 3 Occipital Ctx 


0.0 


AD 6 Hippo 


32.3 


AD 4 Occipital Ctx 


0.0 


Control 2 Hippo 


0.0 


AD 5 Occipital Ctx 


0.0 


Control 4 Hippo 


22.7 


AD 6 Occipital Ctx 


0.0 


Control (Path) 3 
Hippo 


0.0 


Control 1 Occipital Ctx 


0.0 


AD 1 Temporal 
Ctx 


78.5 


Control 2 Occipital Ctx 


0.0 


AD 2 Temporal 
Ctx 


0.0 


Control 3 Occipital Ctx 


0.0 


AD 3 Temporal 
Ctx 


0.0 


Control 4 Occipital Ctx 


22.2 


AD 4 Temporal 

Ctx 


63.3 


Control (Path) 1 Occipital 
Ctx 


25.0 


AD 5 Inf 
Temporal Ctx 


00.0 


Control (Path) 2 Occipital 
Ctx 


0.0 


AD 5 

SupTemporal Ctx \ 


).0 


Control (Path) 3 Occipital 

Ctx j 


3.0 
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AD 6 Inf 
Temporal Ctx 




Control (Path) 4 Occipital 
Ctx 


0.0 


AD 6 Sup 
Temporal Ctx 




Control 1 Parietal Ctx 


21.2 


Control 1 
Temporal Ctx 




Control 2 Parietal Ctx 


0.0 


Control 2 
Temporal Ctx 





Control 3 Parietal Ctx 


0.0 


Control 3 
Temporal Ctx 


0.0 


Control (Path) 1 Parietal 
Ctx 


0.0 


Control 4 
Temporal Ctx 


0.0 


Control (Path) 2 Parietal 
Ctx 


0.0 


Control (Path) 1 
Temporal Ctx 


0.0 


Control (Path) 3 Parietal 
Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


0.0 


Control (Path) 4 Parietal 
Ctx 


0.0 


Table IC. General screening panel vl.4 


Tissue Name 


Rel. Exp.(%) Ag4182, 
Run 221 118504 


Tissue Name 


Rel. Exp.(%) ! 

Ag4182, Run 

22 1 1 1 8504 j 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.6 


iMelanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI- 
N87 


0.8 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


1.5 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.*(SW480 met) 
SW620 


0.0 


Testis Pool 


100.0 


Colon ca. HT29 


0.0 


Prostate ca.* 
(bone met) PC-3 


0.0 


Colon ca. HCT-1I6 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


2.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW11 16 


0.0 


Ovarian ca. 
OVCAR-3 


9.1 


Colon ca. CoIo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.6 


Colon ca. SW-48 


0.0 



431 



WO 03/010327 



PCT/US02/14199 



Ovarian ca. 
OVCAR-4 


0.6 


Colon Pool 


0.4 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.7 


Ovarian ca. 
1GROV-1 


1.0 


Stomach Pool 


0.5 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.4 


Ovary 


0.0 


Fetal Heart 


0.0 




Breast ca. MCF-7 


0.6 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 




Breast ca. T47D 


1.0 


Skeletal Muscle Pool 


0.0 




Breast ca. MDA- 
N 


0.0 


Spleen Pool 


0.4 




Breast Pool 


0.5 


Thymus Pool 


0.0 




Trachea 


1 .6 


CNS cancer (»lio/astro) 
U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U- 
118-MG 


0.0 




Fetal Lung 


0.0 


CNS cancer (neiiro:met) 
SK-N-AS 


0.0 




Lung ca. NCI- 
N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


5.4 


CNS cancer (astro) SNB-75 


0.8 




Lung ca. NCI- 
H146 


0.0 


CNS cancer (glio) SNB-1 9 


0.6 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


2.3 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI- 
H526 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. nci- 
EL- 


0.0 


Brain (fetal) 


0.3 


Lung ca. NCI- 
H460 


0.0 


Brain (Hippocampus) Pool 


0.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


0.7 


Lung ca. NCI- 

H522 


D.O 


Brain (Substantia nigra) 
Pool 


0.8 


Liver 


3.0 


3rain (Thalamus) Pool 


0.0 


Fetal Liver 


3.0 


Brain (whole) 


D.O 


Liver ca. HepG2 


3.7 


Spinal Cord Pool 


D.O 
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Kidney Pool 


0.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.7 


Salivary Gland 


0.8 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACI-IN 


0.0 


Pancreatic ca. CAPAN2 


™ 


Renal ca. UO-3 1 


0.0 


Pancreas Pool 





Table ID . Panel 4.1 D 



Tissue Name 


Rel.Exp.(%) Ag4182, 
Run 1 73607859 


Tissue Name 


Rel. Exp.(%) 
Ag4182, Run 
173607859 


Secondary Tli 1 actjO.O 


HUVEClL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


0.0 


HUVECTNF alpha + 1FN 
gamma 


0.0 


Secondary Th 1 
rest 


0.0 


HUVECTNF alpha + IL4 


|o.o 


Secondary Th2 
rest 


0.0 


, -4 

HUVEC IL- 11 0.0 


Se< ndary Trl rest 


!o.9 


Lung Microvascular EC none 


0.0 


Primary Thl act !0.9 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none |0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th 1 rest 


0.0 


Bronchial epithelium TNFalpha 
+ ILlbeta 


0.6 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha+IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 i 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.0 


CD8 lymphocyte 
act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte 
none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ry 


0.0 


CCD1 106 (Keratinocytes) none 


o.o 1 
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Thl/Th2/Trl anti 
CD95 CHI 1 








LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha+IL-Ibeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


1.7 


LAK cells IL- 
2+IL-12 


0.0 


NCI-H292 none 


0.0 


LAK cells IL- 
2+1 FN gamma 


0.0 


NC1-H292 IL-4 


0.0 


LAK cells IL-2- 
IL-18 


0.0 


NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-13 


0.0 


NK Cells IL-2 rest 


0.5 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 3 
day 


0.0 


HPAEC none jO.O 


|Two Way MLR 5 
{day 


0.0 


HPAEC TNF alpha + IL-1 beta 


0.0 


jTwo Way MLR 7 


0.0 


Lung fibroblast none 


0.0 


:PBMC rest 

1 


1.0 


Lung fibroblast TNF alpha + 1L- 
1 beta 


0.0 


jPBMC PWM 


0.8 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
none 


0.0 


Lung fibroblast IL-13 


1.5 


Ramos (B cell) 
ionomycin 




Lung fibroblast I FN gamma 


0.0 


B lymphocytes 
PWM 




Dermal fibroblast CCD 1070 rest 


1.5 


B lymphocytes 
CD40L and IL-4 


1.0 


Dermal fibroblast CCD 1070 
TNF alpha 


0.0 


POT 1 /4knA\/fD 

uDCAJVIJr 




Dermal fibroblast CCD 1070 IL- 
1 beta 


0.0 


EOL-l dbcAMP 
PMA/ionomycin 


3.0 


Dermal fibroblast IFN gamma 


3.0 


Dendritic cells j 
none 


J.0 


Dermal fibroblast IL-4 


3.0 


Dendritic cells ] 
LPS 


).0 


Derma] Fibroblasts rest 


).7 


Dendritic cells 
anti-CD40 ( 


•o J; 


Neutrophils TNFa+LPS 


.1 


Monocytes rest ( 


•0 (Neutrophils rest 


.6 1 
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Macrophages rest 



Macrophages LPS 



Lung 



Kidney 



W 

TooT" 



HUVEC starved 



CNS_neurodegeneration_vl.O Summary: Ag41 82 f his^anel confimis the ' 
expression of the CGI 00609-01 gene at very low levels in the brain in an independent 
group of individuals. This gene is found to be upregulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, therapeutic modulation of the expression or 
function of this gene may decrease neuronal death and be of use in the treatment of this 
disease. 

General screening panel vl. 4 Summary: Ag4182 Highest expression of the 
CGI 00609-01 gene is detected in testis (CT=29). Thus, expression of this gene can be 
used to distinguish testis from other samples in this panel. In addition, therapeutic 
modulation of this gene can be useful in the treatment of testis related diseases such as 
fertility and hypogonadism. 

Low lex-els of expression of this gene is also associated with a CNS cancer, colon 
cancer, lung cancer, and an ovarian cancer cell lines. Therefore, therapeutic modulation of 
this gene can be beneficial in the treatments of these cancers. 

Panel 4.1D Summary: Ag4182 Highest expression of the CG100609-01 gene is 
detected exclusively in kidney (CT=30.6). In addition low expression of this gene is also 
seen in thymus. Thus, expression of this gene can be used to distinguish these two samples 
from other samples in this panel. Furthermore, small molecule therapies designed with the 
protein encoded for by this gene could modulate kidney function and be important in the 
treatment of inflammatory or autoimmune diseases that affect the kidney, including lupus 
and glomerulonephritis 

Panel CNS_1 Summary: Ag4182 Expression of the CG100609-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel v_2.4 Summary: Ag4182 Expression of the 
CGI 00609-0 1 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



J. CG100710-01: AAA (ATPase Associated with Various Activities) 
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Expression of gene CGI 007 10-01 was assessed using the primer-probe set 
Ag4249, described in Table JA. 

Table JA . Probe Name Ag4249 



Primers 


Sequences 


Length 


Start Position 


SEQIDNo 


Forward 


5 ' -cacaggagcggctgtca-3 1 


17 


213 


230 


Probe 
Reverse 


TET-5 ' -tgccagccctgagcaagtgcc-3 ' -TAMRA 
5 ' -ctgcagagcacagcactca-3 1 


21 
19 


235 

"""""262 


231 



CNSneurodegenerationvl.O Summary: Ag4249 Expression of the 



5 CGI 00710-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

General_screening_panel_vl.4 Summary: Ag4249 Expression of the 
CGI 007 10-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

10 Panel 4.1D Summary: Ag4249 Expression of the CGI 0071 0-01 gene is 

low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening pancl_v_2.4 Summary: Ag4249 Expression of the 
CGI 0071 0-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

15 

K. CG100730-01: Exoribonuclease 

Expression of gene CG100730-01 was assessed using the primer-probe set 
Ag4187, described in Table KA. Results of the RTQ-PCR runs are shown in Tables KB, 
KC, KD and KE. 

20 Table KA . Probe Name Ag4187 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -gcaaatggctctgctgtaatac-3 ' 


22 


201 


233 


Probe 


TET-5 1 -taatggccacagccgtcagtaaaaca-3 ' -TAMRA 


26 


241 


234 


Reverse 


5 ' -taaactgggaaggggaaggt-3 ' 


20 


269 


235 



Table KB . CNS_neurodegeneration_vl.O 



Tissue Name 


ReI.Exp.(%) Ag4187, 
Run 215539701 


Tissue Name 


Rel. Exp.(%) 
Ag4187, Run 
215539701 


AD 1 Hippo 


0.0 


Control (Path) 3 Temporal 
Ctx 


7.5 
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AD 2 Hippo 


0.0 


Control (Path) 4 Temporal 
Ctx 


0.4 


AD 3 Hippo 


0.0 


AD 1 Occipital Ctx 


0.0 


AD 4 Hippo 




AD 2 Occipital Ctx (Missing) 


100.0 


AD 5 Hippo 


1.9 


AD 3 Occipital Ctx 


0.0 


AD 6 Hippo 


3.0 


AD 4 Occipital Ctx 


1.3 


Control 2 Hippo 


0.0 


AD 5 Occipital Ctx 


1.1 


Control 4 Hippo 


0.0 


AD 6 Occipital Ctx 


2.6 


Control (Path) 3 
Hippo 


0.3 


Control 1 Occipital Ctx 


0.0 


Ctx 


0.0 


Control 2 Occipital Ctx 


0.0 


Ctx 


0.0 


Control 3 Occipital Ctx 


0.0 


Ctx P 


0.0 


Control 4 Occipital Ctx 


0.0 


Ctx P 


0.9 


Control (Path) 1 Occipital Ctx 


1.2 


AD 5 Inf 
Temporal Ctx 


2.9 


Control (Path) 2 Occipital Ctx 


0.0 


AD 5 Sup 
Temporal Ctx 


2.1 


Control (Path) 3 Occipital Ctx 


0.0 


AD 6 Inf 
Temporal Ctx 


9.9 


Control (Path) 4 Occipital Ctx 


1.0 


AD 6 Sup 
Temporal Ctx 


7.5 


Control 1 Parietal Ctx 


0.0 


Control 1 
Temporal Ctx 


0.0 


Control 2 Parietal Ctx 


12.8 


Control 2 
Temporal Ctx 


0.0 


Control 3 Parietal Ctx 


0.0 


Control 3 
Temporal Ctx 


0.0 


Control (Path) 1 Parietal Ctx 


0.1 


Control 3 
Temporal Ctx 


0.0 


Control (Path) 2 Parietal Ctx 


5.8 


Control (Path) 1 
Temporal Ctx 


0.0 


Control (Path) 3 Parietal Ctx 


6.7 


Control (Path) 2 
Temporal Ctx 


0.0 


Control (Path) 4 Parietal Ctx 


0.4 


Table KC. General screening panel vl.4 


Tissue Name 


Rel. Exp.(%) Ag4187, 
Run 221154082 


Tissue Name 


Rel. Exp.(%) 
Ag4187, Run 
221154082 
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Adipose 


13.2 


Renal ca. TK-10 


41.5 


Melanoma* 
Hs688(A).T 


2.2 


Bladder 


16.4 


Melanoma* 
Hs688(B).T 


5.7 


Gastric ca. (liver met.) NCI- 
N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
jMEL-5 


4.4 


Colon ca SW480 


0.0 


Squamous cell 


0.0 


Colon ca* (SW480 met) 
SW620 


9.7 


Testis Pool 


10.7 


Colon ca. HT29 


: o 


Prostate ca.* 
(bone met) PC-3 


9.9 


Colon ca. HCT-116 


5.2 


Prostate Pool 


19.8 


Colon ca. CaCo-2 


0.0 


'Placenta 


4.1 


Colon cancer tissue 


18.6 


.Uterus Pool 


6.1 


Colon ca. SW1116 


1.0~ 


iOvarian ca. 
'OVCAR-3 


6.9 


Colon ca. Colo-205 


0.0 


iOvarian ca. SK- 
OV-3 


31.0 


Colon ca. SW-48 


0.0 ! 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


9.6 


Ovarian ca. 
OVCAR-5 


55.1 


Small Intestine Pool 




Ovarian ca. 
IGROV-1 


1.6 


Stomach Pool 


21.9 


Ovarian ca. 
OVCAR-8 


12.9 


Bone Marrow Pool 


0.8 


Ovary 


10.5 


Fetal Heart 


0.0 


Bio l< i Mi_F 7 


17.3 


Heart Pool 


14.7 


Breast ca. MDA- 
MB-231 


9.1 


Lymph Node Pool 


16.3 


Breast ca. BT 549 


24.8 


Fetal Skeletal Muscle 


7.2 


Breast ca. T47D 1 


75.8 


Skeletal Muscle Pool 


24.3 


Breast ca. MDA- 
N 


0.8 


Spleen Pool 


26.4 


Breast Pool 


23.3 


Thymus Pool 


26.2 


Trachea 


25.2 


CNS cancer (glio/astro) U87- 
MG 


8.5 


Lung 


3.4 


CNS cancer (glio/astro) U- 


3.7 
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118-MG 




Fetal Lung 


0.0 


CNS cancer (neuro;met) SK- 
N-AS 


0.7 


Lung ca. NCI- 
MI 7 


0.0 


CNS cancer (astro) SF-539 


4.9 


Lung ca. LX-1 


29.3 


( NS cancel , i tro) <■ n'B - 


100.0 


Lung ca. NC1- 
H146 


0.0 


CNS cancer (glio) SNB-19 


5.8 


Lung ca. SHP-77 


2.9 


CNS cancer (glio) SF-295 


76.3 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


13.8 


Lung ca. NCI- 
H526 


1.0 


Brain (cerebellum) 


O.O 


Lung ca. NCI- 
H23 


0.0 


Brain (fetal) 


49 7 


Lung ca. NCI- 
H460 


41.8 


Brain (Hippocampus) Pool 


36.9 


Luna ca. HOP-62 


2 „_ - 


Cerebral Cortex Pool 


1 8.9 


H522 


2.4 


Brain (Substantia nigra) Pool 


7.7 


Liver 


0.0 


Brain (Thalamus) Pool 


34.2 


Fetal Liver 


3.7 


Brain (whole) 


15.8 


Liver ca. HepG2 


4.7 


Spinal Cord Pool 


15.5 


Kidney Pool 


45.7 


Adrenal Gland 


2.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


1.9 


Renal ca. 786-0 


2.3 


Salivary Gland 


1.3 


Renal ca. A498 


3.6 


Thyroid (female) 


2.5 


Renal ca. ACHN 


21.0 


Pancreatic ca. CAPAN2 


92.7 


Renal ca. UO-3 1 


3.9 


Pancreas Pool 


10.8 



Table KD . Panel 4.1D 



Tissue Name 


ReI.Exp.(%)Ag4187, 
Run 182086758 


Tissue Name 


Rel. Exp.(%) 
Ag4187, Run 
182086758 


Secondary Th 1 act 


2.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


1.8 


HUVEC I FN gamma 


2.4 


Secondary Trl act 


3.0 


HUVEC TNF alpha + I FN gamma 


0.5 


Secondary Th 1 
rest 


3.1 


HUVEC TNF alpha +IL4 


0.2 


Secondary Th2 
rest 


0.0 


HUVEC 1L- 11 


4.3 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.4 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha 


1.8 
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+ IL-lbeta 




Primary Th2 act 


0.0 


Microvascular Dermal EC noi e 


1.8 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha+ IL-lbeta 


3.6 


Primary Thl rest 


7.2 


Bronchial epithelium TNFalpha + 
ILlbeta 


1.7 


Primary Th2 rest 


4.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


5.5 


Small airway epithelium 
TNFalpha + IL-lbeta 




CD45RACD4 
lymphocyte act 


4.8 


Coronery artery SMC rest 


0.4 


CD45ROCD4 
lymphocyte act 


7.6 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.0 


CDS lymphocyte 
act 


6.7 


Astrocytes rest 


0.8 


Secondary CD8 
lymphocyte rest 


3.4 


Astrocytes TNFalpha + 1L-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


5.3 


KU-812 (Basophil) rest 


1.0 


CD4 lymphocyte 
none 


6.6 


KU-8 12 (Basophil) 
PMA/ionomycin 


16.4 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


4.5 


CCD1 1 06 (Keratinocytes) none 


2.3 


LAK cells rest 


10.4 


CCD 1106 (Keratinocytes) 
TNFalpha + IL-lbeta 


1.9 


LAK cells IL-2 


5.0 


Liver cirrhosis 


0.7 


LAK cells IL- 
2+IL-12 


3.1 


iN^i-rizyz none 


0.0 


LAK cells IL- 
2+IFN gamma 


6.7 


NCI-H292 IL-4 


0.0 


LAK cells IL-2+ 
IL-18 


7.7 


NCI-H292 IL-9 


0.2 


LAK cells 
PMA/ionomycin 


3.8 


NCI-H292 IL-13 


0.5 


NK Cells IL-2 rest 


3.4 


J I-H29 FN imm 


0.4 


i wo way ivilk. j 
day 


5.9 


HPAEC none 


.4 


Two Way MLR 5 
day 


• 1 


HPAEC TNF alpha + IL-1 beta 


3.2 


Two Way MLR 7 
day ( 


.5 i " 


omg fibroblast none 


7.0 


PBMC rest 


■1 


„ung fibroblast TNF alpha + IL-1 


.8 
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beta 




PBMC PWM 


4.1 


Lung fibroblast IL-4 


1.7 


PBMC PHA-L 


3.3 


Lung fibroblast IL-9 


2.S 


Ramos (B cell) 
none 


2.8 


Lung fibroblast 1L-I3 


0.3 


Ramos (B cell) 
jionomycin 


0.3 


Lung fibroblast 1FN gamma 


0.5 


B lymphocytes • 
PWM !°'° 


Dermal fibroblast CCD 1070 rest 


3.4 


B lymphocytes 
CD40L and IL-4 


0.3 


Dermal fibroblast CCD1070 TNF 
alpha 


6.0 


|EOL-1 dbcAMP 


9.9 


Dermal fibroblast CCD 1 070 IL-1 
beta 


0.0 


jEOL-I dbcAMP 
jPMA/ionomycin 


2.7 


Dermal fibroblast IFN gamma 


0.9 


jDendritic cells 
jnone 


10.7 


Dermal fibroblast IL-4 


1.9 


jDendritic cells i 
jLPS | 4 - 7 


Dermal Fibroblasts rest 


1.1 


Dendritic cells 
ianti-CD40 

[Monocytes rest 


8.3 


Neutrophils TNFa+LPS 


4.3 


19.2™ 


Neutrophils rest 


7.4 


Monocytes LPS 2.7 


Colon 


0.0 


Macrophages rest 


8.5 


Lung 


3.7 


Macrophages LPS 


3.0 


Thymus 


17.9 


HUVEC none 


18 


Cidney 


00.0 


HUVEC starved 


.0 







Tissue Name 


Rel. Exp.(%) Ag4187, 
Run 268689531 


Tissue Name 


Rel. Exp.(%) 
Ag4187, Run 
268689531 


Colon cancer 1 


7.8 


Bladder cancer NAT 2 


1.5 


Colon cancer NAT 
1 


0.0 


Bladder cancer NAT 3 


2.4 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


1.6 


Colon cancer NAT 

2 


0.0 


Adenocarcinoma of the 
prostate 1 


86.5 


Colon cancer 3 


0.0 


Adenocarcinoma of the 
prostate 2 


2.1 


Colon cancer NAT 
3 


8.8 


Adenocarcinoma of the 
prostate 3 


0.0 


Colon malignant 


2.3 


Adenocarcinoma of the 


6.2 
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cancer 4 




prostate 4 




Colon normal 
adjacent tissue 4 


0.0 


Prostate cancer NAT 5 


— — 
0.0 


Lung cancer 1 


0.0 


Adenocarcinoma of the 
prostate 6 


15.2 


Lung NAT 1 


0.0 


Adenocarcinoma of the 
prostate 7 


14.7 


Lung cancer 2 


100.0 


Adenocarcinoma of the 
prostate 8 


1.9 


Lung NAT 2 


3.9 


Adenocarcinoma of the 
prostate 9 


18.0 


Squamous cell 
carcinoma 3 


0.0 


Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 


0.0 


Kidney cancer I 


3.3 


metastatic 
melanoma 1 


26.8 


KidneyNAT 1 


13.4 


Melanoma 2 0.0 


Kidney cancer 2 


71.2 


Melanoma 3 6.8 
metastatic 

melanoma 4 j~ ' 


Kidney NAT 2 
Kidney cancer 3 

i 


23.7 
79.0 


metastatic \ 
melanoma 5 j ' J 


Kidney NAT 3 |7.5 


Bladder cancer 1 jO.O 


Kidney cancer 4 joTo 


Bladder cancer 
NAT 1 


0.0 


Kidney NAT 4 


3.0 


Bladder cancer 2 J 


3.5 







CNSneurodegenerationvl.O Summary: Ag4187 Results from one experiment 
with the CGI 00730-01 gene are not included. The amp plot indicates that there were 



experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag4187 Highest expression of the 
CGI 00730-01 gene is detected in CNS cancer SNB-75 cell line (CT=32). In addition high 
expression of this gene is seen in pancreatic cancer, CNS cancer, renal, lung, breast and 
ovarian cancer cell line. Therefore, therapeutic modulation of this gene through the use of 
small molecule drugs can be useful in the treatment of these cancers. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, and cerebral cortex. Therefore, this gene may play a role in central nervous 
system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple 
sclerosis, schizophrenia and depression. 
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Panel 4.1D Summary: Ag4187 Highest expression of the CGI 00730-01 gene is 
detected in kidney (CT=32.7). Thus, expression of this gene can be used to distinguish 
kidney samples from other samples in this panel. Furthermore, small molecule therapies 
designed with the protein encoded for by this gene could modulate kidney function and be 
important in the treatment of inflammatory or autoimmune diseases that affect the kidney, 
including lupus and glomerulonephritis. 

In addition, low expression of this gene is seen in resting primary Thl and Trl 
cells, activated CD45RO CD4 lymphocyte, 3 day Two Way MLR, eosinophils, 
monocytes, macrophages, dendritic cells, ionomycin treated basophils and thymus. 
Therefore, therapeutic modulation of this gene may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4187 Highest 
expression of the CGI 00730-01 gene is detected in lung cancer sample (CT=33.7). with 
significant expression also seen in metastatic melanoma, lung, prostate and kidney 
cancers. In addition, expression of this gene is higher in the cancers than in the normal 
adjacent tissue. Therefore, expression of this gene could be as a marker to detect the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 

L. CG100819-01: Polynucleotide Phosphorylase 

Expression of gene CG100819-01 was assessed using the primer-probe set 
Ag4195, described in Table LA. Results of the RTQ-PCR runs are shown in Tables LB, 
LC and LD. 



Table LA . Probe Name Ag4195 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
ID 

No 


Forward 


5 1 -atcgttcaattagaccgctctt-3 ' 


22 


430 


236 


Probe 


TET-5 ' -tccagctggctacttctatgatacacagg-3 1 -TAMRA 


29 


452 


237 


Reverse 


5 ' -acatcaggctcatttacaccat-3 ' 


22 


505 


238 



Table LB . General_screening_panel_vl.4 
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Tissue Name 


Rel. Exp.(%) Ag4195 
Run 221157738 


Tissue Name 


Rel. Exp.(%) 
Ag4195, Run 
221 157738 


Adipose 


3.4 


Renal ca. TK-10 


16.8 


Melanoma* . 
Hs688(A).T 


5.7 


Bladder 


13.3 


Melanoma* 
Hs688(B).T 


5.6 


Gastric ca. (liver met.) NCI- 
N87 


\ 100.0 


Melanoma* MI4 


16.8 


Gastric ca. KATO III 


58.2 


Melanoma* 
LOXIMVI 


25.3 


Colon ca. SW-948 


7.1 


Melanoma* SK- 
MEL-5 


25.9 


Colon ca. SW480 


35.8 


Squamous cell 
carcinoma SCC-4 


10.1 


Colon ca.* (SW480 met) 
SW620 


19.2 


Testis Pool 


5.1 


Colon ca. HT29 


9.0 


Prostate ca.* (bone 
met) PC-3 


20.4 


Colon ca. HCT-116 


27.5 


Prostate Pool \2. 1 


Colon ca. CaCo-2 


30.8 


Placenta J0.6 


Colon cancer tissue 




Uterus Pool '23 


Colon ca. SW1116 


3.0 


Ovarian ca. 
OVCAR-3 


10.6 


Colon ca. CoIo-205 


4.7 


Ovarian ca. SK- 
OV-3 


17.6 


v^oion ca. oW-4o 


7.3 


Ovarian ca. 
OVCAR-4 


5.8 


v^oion rOOl 


4.9 


Ovarian ca. 
OVCAR-5 


12.8 


Small Intestine Pool 


3.2 


Ovarian ca. 
IGROV-1 


5.8 


Stomach Pool 


2.7 


Ovarian ca. 
OVCAR-8 


3.1 


3one Marrow Pool 


1.8 


Ovary { 


2.5 


Fetal Heart 


3.7 


Breast ca. MCF-7 | 


14.3 


Heart Pool 


3.4 


Breast ca. MDA- 
MB-231 


5.1 


^ymph Node Pool 


6.3 


Breast ca. BT 549 


26.8 


Fetal Skeletal Muscle 


1.8 


Breast ca. T47D 


21.0 


Skeletal Muscle Pool 


7.7 


Breast ca. MDA-N 


1.3 


Spleen Pool , 


5.0 


Breast Pool 


5.6 


rhymus Pool i 


1.4 


Trachea ; 


• 1 


3NS cancer (glio/astro) U87- 


4.0 
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4 MG 




Lung 


0.9 


jCNS cancer (glio/astro) U-l 1 8- 

}MG 


21.0 


Fetal Lun 
ung 




jCNS cancer (neuro;met) SK-N- 

|AS 


22.1 


Lung ca.NCI-N417 


3.5 


jCNS cancer (astro) SF-539 


3.1 


Lung ca. LX-1 


20.0 


|CNS cancer (astro) SNB-75 


12.4 


Lung ca. NCI-H146 


6.7 


jCNS cancer (glio) SNB-19 


5.3 


Lung ca. SHP-77 


39.0 


jCNS cancer (glio) SF-295 


20.4 


Lung ca. A549 


23.0 


jBrain (Amygdala) Pool 


3.9 


Lung ca. NCI-H526 


4.4 


Brain (cerebellum) 


2.2 


Lung ca. NCI-H23 


21.3 


Brain (fetal) 


3.8 


Lung ca. NCI-H460 


16.8 


Brain (Hippocampus) Pool 


3.7 


Lung ca. HOP-62 


6.5 


Cerebral Cortex Pool 


5.0 


Lung ca. NCI-H522 


12.3 


Brain (Substantia nigra) Pool 


3.4 


Liver 


0.4 


Brain (Thalamus) Pool 


7.2 


Fetal Liver |6.3 


Brain (whole) 


3.5 


Liver ca. 1 IepG2 


8.2 


Spinal Cord Pool 


4.2 


Kidney Pool !6.2 


Adrenal Gland 


2.8 


Fetal Kidney 


5.4 


Pituitary gland Pool 


1.3 


Renal ca. 786-0 


5.3 


Salivary Gland 


0.6 


Renal ca. A498 


3.1 


Thyroid (female) 


1.8 


Renal ca. ACHN 


5.4 


Pancreatic ca. CAPAN2 


8.5 


Renal ca. UO-3 1 


>.9 


Pancreas Pool 


5.4 



Table LC . Panel 4.1D 



Tissue Name 


Rel Exp.(%) 
Ag4l95, Run 
174255656 


Tissue Name 


Rel. Exp.(%) 
Ag4195, Run 
174255656 


Secondary Till act 


54.3 


HUVECIL-lbeta 


20. < 


Secondary Th2 act 


100.0 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


28.7 


HUVEC TNF alpha + I FN 
gamma 


26.4 


Secondary Thl rest 




HUVEC TNF alpha + IL4 


15.6 


Secondary Th2 rest 


6.6 


HUVEC IL-1 1 


11.4 


Secondary Trl rest 


7.5 


Lung Microvascular EC none 


20.7 


Primary Thl act 


57.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


15.9 


Primary Th2 act 


47.0 


Microvascular Dermal EC none 


19.8 


Primary Trl act 


51.4 


Microsvasular Dermal EC 


12.0 
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TNFalpha + IL-1 beta 




Primary Thl rest 


5.2 


Bronchial epithelium TNFalpha 
+ ILlbeta 


10.5 


Primary Th2 rest 


3.0 


Small airway epithelium none 


5.7 


Primary Trl rest 


5.4 


Small airway epithelium 
TNFalpha + 1L-1 beta 


12.4 


CD45RA CD4 
lymphocyte act 


46.7 


Coronery artery SMC rest 


11.7 


CD45RO CD4 
1 lymphocyte act 


62.0 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


10.0 


CD8 lymphocyte act 


37.1 


Astrocytes rest 


4.8 


Secondary CD8 
lymphocyte rest 


41.2 


Astrocytes TNFalpha + IL-] beta 


6.1 


Secondary CD8 
lymphocyte act 


14.8 


KU-8 12 (Basophil) rest 


29.1 


! CD4 lymphocyte : , 
(none j 4 ' 0 


KU-8 12 (Basophil) 
PMA/ionomycin 


49.3 


|2ry f " " " " 
jThl/Th2/Trl anti- ;S.l 
jCD95 CHI 1 


CCD1 1 06 (Keratinocytes) none 


24.0 


LAK cells rest 1 13.3 


CCD 11 06 (Keratinocytes) 
TNFalpha + 1L-1 beta 


24.5 


LAK cells IL-2 


25.5 


Liver cirrhosis 


2.3 


LAK cells IL-2+IL- 
12 


28.9 


NC1-H292 none 


13.1 


LAK cells IL-2+IFN 
gamma 


25.3 


NC1-H292 IL-4 


29.5 


LAK cells IL-2+ IL- 
18 


33.4 


NCI-H292 1L-9 


30.1 


LAK cells 
PMA/ionomycin 


31.2 


NCI-H292 IL-13 


31.6 


NK Cells IL-2 rest 


24.7 


NCI-H292 IFN gamma 


47.6 


Two Way MLR 3 
day 


15.5 


HPAEC none 


14.2 


Two Way MLR 5 
day 


24.1 


UPAPP TMU o|r.ho -+■ IT 1 Koto 

nr/\cL i jnt aipna^iju-i Deia 


27.9 


Two Way MLR 7 
day 


13.5 


^ung fibroblast none 


13.1 


PBMC rest 


4.6 


Lung fibroblast TNF alpha + IL- 
1 beta 


22.5 


PBMC PWM 


29.5 


^ung fibroblast IL-4 


16.2 


PBMC PHA-L jl8.4 


Lung fibroblast IL-9 


21.2 


Ramos (B cell) none |50.7 


^ung fibroblast IL-13 


15.8 
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Ramos (B cell) 
ionomycin 


56.3 


Lung fibroblast IFN gamma 


20.2 


B lymphocytes 
PWM 


37.4 


Dermal fibroblast CCD1 070 rest 


23.7 


B lymphocytes 
CD40L and 1L-4 


13.7 


Dermal fibroblast CCD 1070 
TNF alpha 


26.8 


EOL-1 dbcAMP 


22.7 


Dermal fibroblast CCD 1070 1L- 
1 beta 




jEOL-1 dbcAMP 
IPMA/ionomvcin 


7.6 


Dermal Fibroblast IFN gamma 


7.0 


[Dendritic cells none 


7.6 


Dermal fibroblast IL-4 


10.6 


[Dendritic cells LPS 


27.9 


Dermal Fibroblasts rest 


5.7 


Dendritic cells anti- 
CD40 


10.5 


Neutrophils TNFa+LPS 


0.9 


Monocytes rest 


8.2 


Neutrophils rest 


2.5 


Monocytes LPS 


41.5 


Colon 


3.1 


Macrophages rest 


9.9 jLung 


6.8 


Macrophages LPS 


30.4 Thymus 


10.4 


HUVEC none 


12.5 'Kidney 


7.4 


HUV EC starved 


14.0 I 





Tissue Name 


IRel. Exp.(%) 
Ag4195,Run 
268689535 


Tissue Name 


Rel. Exp.(%) 
Ag4195, Run 
268689535 


Colon cancer 1 


23.2 


Bladder cancer NAT 2 


0.7 


Colon cancer NAT I 


7.3 Bladder cancer NAT 3 


1.6 


Colon cancer 2 


45.4 


Bladder cancer NAT 4 


4.0 


Colon cancer NAT 2 


19.6 


Adenocarcinoma of the prostate 
1 


26.2 


Colon cancer 3 


95.9 


Adenocarcinoma of the prostate 
2 


2.0 


Colon cancer NAT 3 


18.7 


Adenocarcinoma of the prostate]^ ^ 


Colon malignant cancer 4 


72.7 


Adenocarcinoma of the prostate 
4 


30.6 


Colon normal adjacent tissue 4 


8.0 


Prostate cancer NAT 5 


3.3 


Lung cancer 1 


18.0 


Adenocarcinoma of the prostate 
6 


4.6 


Lung NAT 1 


2.2 


Adenocarcinoma of the prostate 
7 


5.8 


Lung cancer 2 


100.0 


Adenocarcinoma of the prostate 
8 


1.8 
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Lung NAT 2 


3.4 


Adenocarcinoma of the prostate 
9 


18.3 


Squamous cell carcinoma 3 


39.2 


Prostate cancer NAT 10 


2.1 


Lung NAT 3 


0.6 


Kidney cancer 1 


17.1 


metastatic melanoma 1 


22.7 


KidneyNAT 1 7.7 


Melanoma 2 


2.0 


Kidney cancer 2 


38.4 


Melanoma 3 


2.5 


Kidney NAT 2 


15.5 


metastatic melanoma 4 


31.0 


Kidney cancer 3 


10.2 


metastatic melanoma 5 


44.4 


Kidney NAT 3 


7.2 


Bladder cancer I ;2.5 


Kidney cancer 4 


18.7 


B ladder cancer NAT 1 jo.O 


Kidney NAT 4 


7.7 


Bladder cancer 2 j3.8 







CNS_neurodegeneration_vl.O Summary: Ag4195 Results from one experiment 
with the CG100819-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General_screening_panel_vl.4 Summary: Ag4195 Highest expression of the 
CGI 008 19-01 gene is detected in a gastric cancer NC1-N87 cell line (CT=25.3). High 
expression of this gene is seen in cluster of CNS cancer, colon, gastric, renal, lung, breast, 
ovarian, pancreatic, prostate, squamous cell carcinoma cell lines and melanoma. 
Therefore, therapeutic modulation of this gene could be beneficial in the treatment of these 
cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Interestingly, expression of this gene is higher in fetal lung and liver (CTs=29) as 
compared to the corresponding adult tissues (CTs=32-33). Therefore, expression of this 
gene could be useful in distinguishing the fetal lung and liver from the corresponding adult 
tissues. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
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central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4195 Highest expression of the CG100819-01 gene is 
detected in activated secondary Th2 cells (CT=27). This gene is expressed at high to 
5 moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 

10 product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_v 1 .4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 

15 to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4195 Highest 
expression of the CG100819-01 gene is detected in lung cancer (CT=28.7), with 

20 significant expression also seen in metastatic melanoma, colon, lung, prostate and kidney 
cancers. In addition, expression of this gene is higher in the cancers than in the normal 
adjacent tissue. Therefore, expression of this gene could be as a marker to detect the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 

25 

M. CG100872-01: Protein-Arginine Deiminase 

Expression of gene CGI 00872-01 was assessed using the primer-probe sets 
Ag41 97 and Ag4299, described in Tables MA and MB. Results of the RTQ-PCR runs are 
shown in Table MC. 
30 Table MA. Probe Name Ag4197 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

No 


Forward 


5 ' -gtcccagatgactctgaatgtc-3 ' 


22 


539 


239 
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Probe 


TET-5 ' -caaggccccagctgtatcttaaagaa-3 ' -TAMRA 


26 


561 


240 


Reverse 


5 ' -ccttggaggtatggaggactag-3 1 


22 


594 


241 



Table MB . Probe Name Ag4299 



{Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


i Forward 


5 ' -gtcccagatgactctgaatgtc-3 ' 


22 


539 


242 


■Probe 


TET-5 ' -caaggccccagctgtatcttaaagaa-3 ' -TAMRA 


26 


561 


243 


| Reverse 


5 ' -ttcgactcttccttggaggtat-3 ' 


22 


604 


244 



Table MC . Panel 4. ID 



jTissue Name 


Rel. 

Exp.(%) 
Ag4197, 
Run 

174255682 


Rel. Exp.(%) 
Ag4299, Run 
183712613 


Tissue Name 


Rel. 

Exp.(%) 
Ag4 1 97, 
Run 

174255682 


Rel. 

Exp.(%) 
Ag4299. 
Run 

183712613 


^Secondary Till act 


0.0 


0.0 


HUVEC I L- 1 beta 


0.0 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC I FN gamma 


0.0 


0.0 


Secondary TrI act 


0.5 


29.1 

' 


HUVEC TNFalpha 
+ I FN gamma 


0.0 


0.0 


'Secondary Th 1 
;rest 


o.o jo.o 


HUVEC TNFalpha L. 
+ 1L4 P-° 


0.0 


Secondary Th2 
rest 


0.0 


0.0 iHUVECIL-H 


0.0 


0.0 


Secondary TrI rest 


0.0 


0 q jLung Microvascular 
|EC none 


0.0 


0.0 


Primary Thl act 


0.0 


Lung Microvascular 
0.0 EC TNFalpha + IL- 
jlbeta 


0.0 


0.0 


Primary Th2 act 


0.0 


qq Microvascular 

(Dermal EC none 


0.0 


0.0 


Primary Trl act 


0.0 


jMicrosvasular 
q q Dermal EC 

TNFalpha + IL- 
jlbeta 


0.0 


0.0 


Primary Thl rest 


0.0 


0.0 


3ronchial epithelium 
TNFalpha + IL1 beta 


0.0 


0.0 


Primary Th2 rest 


0.0 


q q Small airvvay 

jepithelium none 


0.0 


0.0 


Primary Trl rest 


0.0 


0.0 


Small airway 
;pithelium 
HMFalpha + IL- 
beta 


0.0 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


0.0 


3oronery artery 
SMC rest j 


3.0 


0.0 
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CD45RO CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


0.0 


0.0 


CD8 lymphocyte 
act 


0.0 


0.0 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 0.0 


Astrocytes 
TNFalpha+ IL- 
{ 1 beta 


0.0 


0.0 i 


Secondary CDS 
lymphocyte act 


00 loo |KU-8 12 (Basophil) 
(rest 


0.0 


0.0 j 


CD4 lymphocyte 


0.0 


loo 

! 


jivu-o i z ^odsopnn ) 
PMA/ionomycin 


0.0 


0.0 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


jo.o 


CCD1 106 

(Keratinocytes) none 


0.0 


0.0 


LAK cells rest 


0.0 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + lL- 
lbeta 


0.0 


0.0 

1 

J 


LAK cells IL-2 


0.0 ,0.0 


Liver cirrhosis 


0.0 


0.0 ; 


LAK cells IL- 
2+IL-12 

LAK cells IL- 
2+1 FN gamma 


o.o jo.o 

0.0 • 0.0 


inc. i-nivz none 
NCI-H292 1L-4 


0.0 

0.0 


nn ~~" ! 

j 

0.0 


LAK cells IL-2+ 
IL-18 


0.0 


0.0 


NCI-H292 IL-9 


0.0 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


0.0 


NCI-H292 IL-13 


0.0 


0.0 


NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292 I FN 
gamma 


0.0 


0.0 


Two Way MLR 3 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 


Two Way MLR 5 


0.0 


0.0 


HPAEC TNF alpha 
+ IL-1 beta 


0.0 


0.0 


Two Way MLR 7 


0.0 


0.0 


Lung fibroblast none 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


3.0 


PBMC PWM 


0.0 


0.0 


Lung fibroblast IL-4 


1.1 


3.0 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast IL-9 


3.0 


3.0 


Ramos (B cell) 
none 


3.0 


3.0 


Lung fibroblast IL- 
13 


2.5 


3.0 


Ramos (B cell) 
ionomycin 


3.0 


3.0 


Lung fibroblast I FN 
gamma 


3.0 ( 


).0 
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B lymphocytes 
PWM 



B lymphocytes 
CD40L and IL-4 



EOL-1 dbcAMP 



lEOL 1 dbcAMP 
jPMA/ionomycin 



[Dendritic cells 
jnone 

{Dendritic cells 
(LPS 

jDendritic cells 
janti-CD40 



jMonocytes rest 



jMonocytes LPS 



{Macrophages rest 



Dermal fibroblast 
CCD 1070 rest 



Dermal fibroblast 
CCD 1070 TNF 
alpha 



Dermal fibroblast 
CCD 1070 IL-1 beta 



0.0 



0.0 



0.0 



.Macrophages LPS jO.O 



jo.o 



Dermal fibroblast 
1FN gamma 



Dermal fibroblast 
IL-4 



Dermal Fibroblasts 



Neutrophils 
TNFa+LPS 



Neutrophils rest 



Colon 



Lung 



HUVEC none 



io.o 



'HUVEC starved Eo 



jThymus 
[Kidney 



CNS_neurodcgei 



jO.O 

jo.o 
jo.o 
jo.o 
jo.o 



0.0 



100.0 



merationvl.O Summary: Ag4197/Ag4299 Expression of the 
CG100872-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

General_screening_panel_vl.4 Summary: Ag4197 Expression of the 
CG100872-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

Panel 4.1D Summary: Ag4197/Ag4299 Low expression of the CG100872-01 
gene is seen in kidney (CTs=32-36). Therefore, expression of this gene can be used to 
distinguish this sample from other samples used in this panel. In addition, therapeutic 
modulation of this gene can be useful in the treatment of autoimmune and inflammatory 
diseases that affect the kidney including lupus and glomerulonephritis. 

General oncology screening panel_v_2.4 Summary: Ag4197 Expression of the 
CGI 00872-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



N. CG100980-01: Protein-Arginine Deiminase Type III 
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Expression of gene CGI 00980-01 was assessed using the primer-probe set 
Ag4200, described in Table NA. Results of the RTQ-PCR runs are shown in Tables NB 
and NC. 

Table NA . Probe Name Ag4200 



Primers [Sequences 


Length 


Start 
Position 


SEQ ID No 


Forwardj5 1 -actgcaccttcattgatgactt-3 ' 


22 


1 881 


245 | 


Probe jTET-5 1 -actccataccacatgctgcatggg-3 1 -TAMRA 


24 


1904 


246 ~j 


Reverse ;5 cactt laaga ag tttn 3 ' 


22 


1956 


247 | 



Table NB . General screening panel_vl.4 



]Rel. Exp.(%) 
Tissue Name jAg4200, Run 
|22 1 178493 


Tissue Name 


Rel. Exp.(%) 
Ag4200, Run 


Adipose 


0.0 


Renal ca. TK-10 


0.1 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI- 
N87 


0.8 


Melanoma* M 1 4 0.0 


Gastric ca. KATO III 


0.1 


Melanoma* j 
LOXIMVI 00 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


2.1 


Squamous cell 
carcinoma SCC-4 


2.4 


Colon ca.* (SW480 met) 
SW620 


0.0 


Testis Pool 


0.1 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.1 


Colon ca. HCT-116 


0.1 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV- 
3 


0.4 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.1 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


3.6 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
1 


0.2 


Stomach Pool 


0.0 
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Ovarian ca. 
OVCAR-8 


8.5 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.9 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.2 Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


8.1 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer (glio/astro) U87- 
MG 


6.0 


Lung 


0.0 


CNS cancer (glio/astro) U-1I8- 
MG 


0.0 


Fetal Lung 0.0 


CNS cancer (neuronnet) SK-N- 
AS 


0.0 


Lungca. NCI-N4 17 10.0 


CNS cancer (astro) SF-539 jO.O 


Lungca. LX-I |0.2 


CNS cancer (astro) SNB-75 ;0.4 j 


Lungca. NC1-H 146 [O.O 


CNS cancer (glio) SNB-I9 |0~3~ 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 2.4 


Lungca. A549 


^5 


Brain (Amygdala) Pool iO.O 


Lungca. NCI-H526 


0.0 


Brain (cerebellum) |0.0 


Lung ca. NCI-H23 


0.1 


Brain (fetal) 


0.0 


Lungca.NCI-H460 


0.0 


Brain (Hippocampus) Pool 


0.0 


Lung ca. HOP-62 


0.2 


Cerebral Cortex Pool 


0.0 


Lung ca.NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


).() 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.0 


Adrenal Gland jo.O 


Fetal Kidney 


0.0 


Pituitary gland Pool jo.O 


Renal ca. 786-0 


0.0 


Salivary Gland jo.O 


Renal ca. A498 


0.6 


Thyroid (female) jo.O 


Renal ca. ACHN 


100.0 


Pancreatic ca. CAPAN2 jo. 1 


Renal ca. UO-3 1 


0.2 


Pancreas Pool jo.O 


Table NC. General oncologv screening panel v ?. d 


Tissue Name 


Rel. Exp.(%) 
*ig4200, Run 
268695263 


Tissue Name 


Rel. Exp.(%) 
4.g4200, Run 
268695263 
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Colon cancer 1 


!0.2 


Bladder cancer NAT 2 


0.0 


i NAT 1 


0.0 


Bladder cancer NAT 3 


6.2 


Colon cancer 2 


2.1 


Bladder cancer NAT 4 


0.2 


Colon cancer NAT 2 


0.2 


Adenocarcinoma of the prostate 
1 


4.9 


Colon cancer 3 


0.2 


Adenocarcinoma of the prostate 

2 


0.0 


Colon cancer NAT 3 


q 2 .Adenocarcinoma of the prostate 


0.2 
11.7 


Colon malignant 
cancer 4 


q q -Adenocarcinoma of the prostate 
4 


Colon normal 
adjacent tissue 4 


0.0 


Prostate cancer NAT 5 


0.8 


Lung cancer 1 


0.1 


Adenocarcinoma of the prostate 
6 


0.0 


Lung NAT 1 


q q I Adenocarcinoma of the prostate 


0.2 


Lung cancer 2 


q ^ | Adenocarcinoma of the prostate 


0.0 


Luns, NAT 2 


0 q [Adenocarcinoma of the prostate 


0.0 


Squamous cell 
carcinoma 3 


7.9 Prostate cancer NAT 1 0 


1.0 


Lung NAT 3 


0.0 ; Kidney cancer 1 


0.2 


metastatic 
melanoma 1 


0.0 


KidneyNAT 1 


0.0 


Melanoma 2 


0.9 


Kidney cancer 2 


0.6 


Melanoma 3 


0.4 


Kidney NAT 2 


0.0 


metastatic 
melanoma 4 


0.2 


Kidney cancer 3 


0.0 


metastatic 
melanoma 5 


0.0 


Kidney NAT 3 


0.0 


Bladder cancer 1 


0.2 


Kidney cancer 4 


).2 


Bladder cancer NAT 
1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


100.0 







CNS_neurodegeneration_vl.O Summary: Ag4200 Expression of the 
CGI 00980-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



General_screening_panel_vl.4 Summary: Ag4200 Highest expression of the 
CG100980-01 gene is detected in renal cancer ACHN cell line (CT=23.6). Significant 
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expression of this gene is seen exclusively in cluster of CNS, pancreatic, colon, renal, 
lung, breast, ovarian, squamous cell carcinoma and prostate cancers cell lines. Therefore, 
expression of this gene can be used as diagnostic marker for these cancers and therapeutic 
modulation through the use of small molecule target could be useful in the treatments of 
these cancers. 

Panel 4.1 D Summary: Ag4200 Expression of the CGI 00980-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 5 Islet Summary: Ag4200 Expression of the CG100980-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4200 Highest 
expression of the CGI 00980-01 gene is detected in bladder cancer (CT=28.7), with low 
but significant expression of this gene in squamous cell carcinoma, colon and prostate 
cancers. In addition, expression of this gene is higher in the cancers than in the normal 
adjacent tissue. Therefore, expression of this gene may be useful as a marker to detect the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of these cancers. 

O. CG56763-01: GPCR 

Expression of gene CG56763-01 was assessed using the primer-probe set Ag3012, 
described in Table OA. Results of the RTQ-PCR runs are shown in Tables OB and OC. 

Table OA . Probe Name Ag3012 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 1 -ctctttgtcctggtggagaac-3 ' 


21 


162 


248 


Probe 


TET-5 ' -acctccctccacaggc< tg a ta-3 ' -TAMRA 


26 


213 


249 


Reverse 


5 ' -gaaagacatgga; -t ;agaaag-3 ' 


22 


239 


250 



TabIeOB.Panell.3D 



Tissue Name 


Rel. Exp.(%) 
Ag3012, Run 
167810404 


Tissue Name 


Rel. Exp.(%) Ag3012, 
Run 167810404 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAPAN 
2 


1.6 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0.0 
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' 


Renal ca UO-31 






0.0 


Renal ca.TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


12.6 


Liver ca. (hepatoblast) 
HepG2 


0.0 


Brain (cerebellum) 


0.0 I'Lung 


0.0 


Brain (hippocampus) 


5.5 .Lung (fetal) jO.O 


Brain (substantia nigra) 


3.0 jLungca. (small cell) LX-1 jl.4 


Brain (thalamus) 


, . JLungca. (small cell) NCI- 
|H69 


0.0 


Cerebral Cortex 


q q iLung ca. (s.cell var.) SHP- 
I 77 


2.4 


Spinal cord 


nnn ILung ca. (large eel 1)NCI- 
IUU ' U iH460 


0.0 


glio/astro U87-MG 


» n ILung ca. (non-sm. cell) 
IA549 


0.0 


glio/astroU-118-MG 


Lungca.(non-s.cell)NCI- 

UU ;H23 


0.0 


astrocytoma SW1783 


Q Q Lung ca. (non-s.cell) HOP- 


0.0 


neuro*; met SK-N-AS 


0.0 


Luns ca. (non-s.cl) NC1- 
H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) SW 900 


0.0 


astrocytoma SNB-75 

— ■ — 


0.0 


Lung ca. (squam.) NCI- 
H596 


0.0 


glioma SNB-19 


3.2 


Mammary gland 


2.9 


glioma U25 1 


0.0 


Breast ca.* (pl.ef) MCF-7 


3.5 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) MDA- 
MB-231 


0.0 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) T47D 


0.0 


Heart 


2 0 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


12.5 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 




0.0 


Ovarian ca. OVCAR-3 


3.0 


Thymus 


9.5 


Ovarian ca. OVCAR-4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR-5 


3.5 


Lymph node 


0.0 


Ovarian ca. OVCAR-8 


0.0 


Colorectal 


4.3 


Ovarian ca. IGROV-1 


0.0 


Stomach 


0.0 


Ovarian ca.* (ascites) SK- 
OV-3 


0.0 
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Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-1 16 


0.0 


Testis 


35.4 


Colon ca. CaCo-2 


0.0 


Melanoma Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


6.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX IMVI 


0.0 


Trachea jO.O 


Melanoma* (met) SK- 
MEL-5 


0.0 


Kidney jo.O 


Adipose 


0.0 


Table OC. Panel 4D 


jRel. Exp.(%) 
Tissue Name j Ag30 1 2, Run 
1 164404080 


Tissue Name 


Rel. Exp.(%) 
Ag3012, Run 
164404080 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 1L4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha+ IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


8.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 
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CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


7.2 


Secondary CD8 
lymphocyte rest 


7.9 


Astrocytes TNFalpha + IL- 
Ibeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


10.2 


KU-8 12 (Basophil) 
PMA/ionomycin 


77 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


0.0 


CCDH06(Keratinocytes) 
none 


0.0 


LAK cells rest 


10.7 


CCD1106(Keratinocytes) 
TNFalpha +1L-1 beta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gam ma 


9.4 


NCI-H292 none 


0.0 




LAK cells IL-2+ IL-18 


14.1 




NCI-H292 IL-4 


0.0 




LAK cells 
PMA/ionomycin 


0.0 




NCI-H292 IL-9 


18.9 


| 


NK Cells IL-2 rest 


0.0 




MCI-H292 IL-13 


0.0 




Two Way MLR 3 day 


4.0 




NCI-H292 IFN gamma 


0.0 


1 


Two Way MLR 5 day 


0.0 


jHPAEC none 


0.0 




Two Way MLR 7 day 


0.0 


iHPAEC TNF alpha + JL-1 
|beta 


0.0 


PBMC rest 


0.0 


jLung fibroblast none 


0.0 


PBMC PWM 


o , iLung fibroblast TNF alpha + 
|IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


jLung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


jLung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


^ung fibroblast IL-13 


0.0 


B lymphocytes PWM 


).0 


|Lung fibroblast IFN gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


19.6 


Dermal fibroblast CCD 1070 
Jrest 


0.0 


EOL-1 dbcAMP 


16.7 


JDermal fibroblast CCD 1 070 
jTNF alpha ■ 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


jDermal fibroblast CCD1070 
|IL-1 beta 


0.0 


Dendritic cells none 


3.0 


jDermal fibroblast IFN gamma 


3.0 


Dendritic cells LPS 


15 


jDermal fibroblast IL-4 


3.0 


Dendritic cells anti- 


3.0 


jlBD Colitis 2 


3.0 
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Monocytes rest 



Monocytes LPS 



Macrophages rest 



Macrophages LPS 



HUVEC starved 



Lung 



Thymus 



Kidney 



CNS_neurodegeneration_vl.O Summaiyf "Ag3b72 Expression of this gene is 
lovv/undetectable (CTs > 34.5) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag3012 This gene is expressed at low levels in the 
samples derived from spinal cord (CT = 32.1) and testis (CT=33.6). Thus, the expression 
of this gene could be used to distinguish these samples from the other samples in the 
panel. Furthermore, therapeutic modulation of the expression or function of this gene may 
be effective in the treatment of CNS disorders, fertility and hypogonadism. 

Panel 4D Summary: Ag3012 This gene is expressed at low levels in a sample 
derived from liver cirrhosis (CT = 33.4). In addition, expression of this gene is not 
detected in normal liver in Panel 1 .3D, suggesting that its expression is unique to liver 
cirrhosis. This gene encodes a putative GPCR; therefore, antibodies or small molecule 
therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. This gene is also 
expressed at low levels in the lung and colon. 

General oncology screening panel_v_2.4 Summary: Ag3012 Expression of this 
gene is low/undetectable (CTs > 34.5) across all of the samples on this panel (data not 
shown). 



P. CG56777-01: Prostaglandin-F Synthase 1 

Expression of gene CG56777-01 was assessed using the primer-probe set Ag3017, 
described in Table PA. 

Table PA . Probe Name Ag3017 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -agaaacttggaccggactatgt-3 ' 


22 


349 


251 


Probe 


TET-5 1 -tcatgtaccatttgctatgaagcctg-3 ' -TAMRA 


26 


386 


252 


Reverse 


5 ' -tcctttggcagtaattctttcc-3 ' 


22 


412 


253 
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CNS_neurodegeneration_vl.O Summary: Ag3017 Results from one experiment 
with the CG56777-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

Panel 1.3D Summary: Ag3017 Expression of the CG56777-01 gene is 
5 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag301 7 Expression of the CG56777-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology' screening panel_v_2.4 Summary: Ag3017 Expression of the 
CG56777-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
10 (data not shown). 

Q. CG56941-01: >ptnr:SPTREMBL-ACC:O60523 Ribonuclease H Type II 

Expression of gene CG56941-01 was assessed using the primer-probe set Ag3096. 
described in Table QA. Results of the RTQ-PCR runs are shown in Tables QB, QC, QD 
15 andQE. 

Table OA . Probe Name Ag3096 



Primers 


Sequences 


Length 


Start 
Position 


SE( < ID 

No 


Forward 


5 ' -gtttcagaagggcaggaaaa-3 ' 


20 


321 


254 


Probe 


TET-5 ' -caacatggacaagaatcggagacgaa-3 ' -TAMRA 


26 


342 


255 


Reverse 


5 ' -ttcatctccatctccatcca-3 ' 


20 


394 


256 



Table QB . CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3096, Run 
208976837 


Tissue Name 


Rel. Exp.(%) Ag3096, 
Run 208976837 


AD 1 Hippo 


18.9 


Control (Path) 3 Temporal 
Ctx 


11.3 


AD 2 Hippo 


17.1 


Control (Path) 4 Temporal 
Ctx 


43.2 


AD 3 Hippo 


7.1 


AD 1 Occipital Ctx 


46.7 


AD 4 Hippo 


5.1 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


6.7 


AD 6 Hippo 


34.6 


AD 4 Occipital Ctx 


29.3 


Control 2 Hippo 


13.8 


AD 5 Occipital Ctx 


20.3 
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Control 4 Hippo 


8.0 


AD 6 Occipital Ctx 


27.2 


Control (Path) 3 
Hippo 


6.0 


Control 1 Occipital Ctx 


11.5 


AD 1 Temporal Ctx 


16.2 


Control 2 Occipital Ctx 


4.1 


AD 2 Temporal Ctx 


25.5 


Control 3 Occipital Ctx 


49.0 


AD 3 Temporal Ctx 


3.4 


Control 4 Occipital Ctx 


4.0 


AD 4 Temporal Ctx 


22.5 


Control (Path) 1 Occipital 
Ctx 


61.6 


AD 5 lnf Temporal 

Ctx 


55.5 


Control (Path) 2 Occipital 
Ctx 


18.2 


AD 5 Sup Temporal 
Ctx 


33.7 


Control (Path) 3 Occipital 
Ctx 


5.5 


AD 6 lnf Temporal 
Ctx 


45.1 


Control (Path) 4 Occipital 
Ctx 


79.6 


AD 6 Sup Temporal 
Ctx 


61.1 


Control 1 Parietal Ctx 


9.6 


Control 1 Temporal 
Ctx 


8.9 


Control 2 Parietal Ctx 


58.2 


Control 2 Temporal 
Ctx 


20.4 


Control 3 Parietal Ctx 115.9 

1 


Control 3 Temporal 
Ctx 


18.8 


Control (Path) 1 Parietal Ctx |37.9 


Control 3 Temporal 
Ctx 


12.9 


Control (Path) 2 Parietal Ctx j 1 8.3 


Control (Path) 1 
Temporal Ctx 


46.0 


Control (Path) 3 Parietal Ctx 


2.2 


Control (Path) 2 
Temporal Ctx 


18.6 


Control (Path) 4 Parietal Ctx 


59.0 


Table OC. Panel 1.3D 


Tissue Name 


Rel. Exp.(%) 
Ag3096, Run 
167985249 


Tissue Name 


Rel. Exp.(%) Ag3096, 
Run 167985249 


Liver 

adenocarcinoma 


0.5 


Kidney (fetal) 


0.2 


Pancreas 


0.0 


Renal ca. 786-0 


0.2 


Pancreatic ca. 
CAP AN 2 


0.1 


Renal ca. A498 


0.2 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.1 


Thyroid 


0.0 


Renal ca. ACHN 


0.1 


Salivary gland 


0.1 


Renal ca. UO-31 j 


0.2 


Pituitary gland 


0.5 


Renal ca. TK- 10 ] 


0.1 


Brain (fetal) 


83.5 


Liver j 


3.0 
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Brain (whole) 


0,7 


Liver (fetal) 


0.0 


Brain (amygdala) 


1.1 


Liver ca. (hepatoblast) 
HepG2 


3.7 


Brain (cerebellum) 


100.0 


Lung 


0.0 


Brain 

(hippocampus) 


1.1 


Lung (fetal) 


0.1 


Brain (substantia 
nigra) 


0.0 'Lungca. (small cell) LX-1 


0.3 


Brain (thalamus) 


'Lung ca. (small cell) NCI- 

Uj jH69" 


0.1 


Cerebral Cortex 


1.0 


Lung ca. (s.cell var.) SHP- 
77 


1.4 


Spinal cord 


0.1 


Luns ca. (large eel 1)NCI- 
H460 


0.4 


glio/astro U87-MG 


0.5 


Lung ca. (non-sm. cell) 
A549 


1.2 


glio/astro U-118- 
MG 


0.1 ;Lun g ca.(non-,cen)NCI- 


0.3 


astrocytoma 
SW1783 


Qn Lungca.(non-s.cell) HOP- 

•62 


0.1 


neuro*; met SK-N- 
AS 


. . Lunsj ca. (non-s.cl) NCI- i. , 
4 '' H522 i Uj 


astrocytoma SF-539 


0.1 .Lung ca. (squam.) SW 900 jO.l 


astrocytoma SNB- 


0.4 


Lung ca. (squam.) NC1- 
H596 


0.3 


glioma SNB-19 


0.1 


Mammary gland 


0.0 


glioma U251 


0.4 


Breast ca.* (pl.ef) MCF-7 


0.3 


glioma SF-295 


0.3 


Breast ca.* (pl.ef) MDA- 
MB-231 


0.1 


Heart (fetal) 


0.0 


Breast ca.* (pl.ef) T47D 


0.1 




0.1 


Breast ca. BT-549 


0 2 


Skeletal muscle 
(fetal) 


0.0 


Breast ca. MDA-N 


0.2 


Skeletal muscle 


0.1 


Ovary 


0.0 




0.0 


Ovarian ca. OVCAR-3 


4.4 


Thyir rrOVV 


0.0 


Ovarian ca. OVCAR-4 


0.1 


Spleen 


0.0 


Ovarian ca. OVCAR-5 


0.4 


Lymph node 


0 1 


Ovarian ca. OVCAR-8 


0.5 


Colorectal 


0.0 


Ovarian ca. IGROV-1 


0.5 


Stomach 


0.1 


Ovarian ca.* (ascites) SK- 
OV-3 


0.6 


Small intestine 


0.0 


Uterus 


0.1 
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Colon ca. SW480 


0 4 


Placenta 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.8 


Prostate 


0.0 


Colon ca. HT29 


0.1 


Prostate ca.* (bone met)PC- 

3 


0.3 


Colon ca. HCT-116 


0.3 


Testis 


1.2 


Colon ca. CaCo-2 


0.5 


Melanoma Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.1 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC- 
2998 


0.3 


Melanoma UACC-62 


0.1 


Gastric ca.* (liver 
met) NCI-N87 


0.6 


Melanoma M 1 4 


0.1 


Bladder 


2.0 


Melanoma LOX IMV1 


0.3 


Trachea 


0.0 


Melanoma* (met) SK- 
MEL-5 


0.1 


Kidney [0. 1 


Adipose 


0.1 


Table OD. Panel 4.1D 


t- x, : A^ ( w'r >) I- x] Rel.Exp.(%)Ag3096J 
Tissue Name ,A«j096. Run Tissue Name , n r ,L nft0!:c 

U69990855 ! | Run 16 " 908:O 


Secondary Thl act 


41.2 


HUVEC IL-lbeta |45.4 


Secondary Th2 act 


70.7 


HUVEC IFN gamma 


35.1 


Secondary Trl act 


65.5 


HUVEC TNF alpha + IFN 
gamma 


38.7 


Secondary Thl rest 


14.4 


HUVEC TNF alpha + IL4 


334 


Secondary Th2 rest 


46.0 


HUVEC IL-1 1 


18.4 


Secondary Trl rest 


34.2 


Lung Microvascular EC 
none 


46.0 


Primary Thl act 


54.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


39.5 


Primary Th2 act 


57.0 


Microvascular Dermal EC 
none 


100.0 


Primary Trl act 


44.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


40.9 


Primary Thl rest 


12.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


17.3 


Primary Th2 rest 


18.2 


Small airway epithelium 
none 


21.6 


Primary Trl rest 


17.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


34.4 
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CD45RA CD4 
lymphocyte act 


67.4 


Coronery artery SMC rest 


37.1 


CD45RO CD4 
lymphocyte act 


60.3 


Coronery artery SMC 
TNFalpha + IL-1 beta 


27.5 


CD8 lymphocyte act 


55.1 


Astrocytes rest 


20.2 


Secondary CD8 J 
lymphocyte rest | ' 


Astrocytes TNFalpha + IL- 
lbeta 


26.1 


Secondary CD8 
lymphocyte act 


42.0 :KU-812 (Basophil) rest 


■18.2 


CD4 lymphocyte 
none 


26.4 


KU-812 (Basophil) 
PMA/ionomycin 




2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


17.9 


CCD1 106 (Keratinocytes) 
none 


62.4 


LAK cells rest 


31.4 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


25.2 


■ I .ellslL-2 


41.5 


Liver cirrhosis 


11.8 


LAK cells IL-2+IL- 
12 


53.6 


NCI-H292 none b.3 


LAKceIIsIL-2+IFN 
gamma 


59.5 INCI-H292 IL-4 


29.9 


LAK cells IL-2+1L- 
18 


36.6 . NCI-H292 IL-9 


45.4 


LAK cells 
PMA/ionomycin 


36.3 |NC1-H292 IL-13 


48.6 


NK Cells IL-2 rest 


42.9 INCI-H292 I FN gamma 


48.0 


Two Way MLR 3 
day 


69.3 5HPAEC none 

j 


28.5 


Two Way MLR 5 


79 0 jHPAEC TNF alpha + IL-1 
jbeta 


46.0 


Two Way MLR 7 
day 


21.5 jLung fibroblast none 


26.4 


PBMC rest 


j 3 j jLung fibroblast TNF alpha 
j+ IL-1 beta 


30.6 


PBMC PWM 


46.0 jLung fibroblast IL-4 


34.4 


PBMC PHA-L 


18.3 |Lung fibroblast IL-9 


62.9 


Ramos (B cell) none 


59.0 jLung fibroblast IL-13 


62.0 


Ramos (B cell) 
ionomycin 


49.3 JLung fibroblast I FN gamma 


28.5 


B lymphocytes 
PWM 


S4 q jDermal fibroblast 
jCCDl 070 rest 


58.6 


B lymphocytes 
CD40L and IL-4 


37 ^ jDermal fibroblast 

|CCD1070 TNF alpha 


57.4 
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EOL-1 dbcAMP 


22.8 


Dermal fibroblast 
CCD1070IL-1 beta 


33.7 


IpOT 1 Ahr* A\AT> 
ID/Uij- 1 QDC/vlVir 

[PMA/ionomycin 


20.2 


Dermal fibroblast IFN 
gamma 


12.9 


(Dendritic cells none 


81.8 


Dermal fibroblast IL-4 


35.6 


IDendritic cells LPS 


15.5 


Dermal Fibroblasts rest 


22.8 


IDendritic cells anti- 
|CD40 


14.9 


Neutrophils TNFa+LPS 


0.0 


(Monocytes rest 


25.9 

----- 


Neutrophils rest 


3.4 


| Monocytes LPS 




jColon 


3.5 


(Macrophages rest 


36.9 


Lung 


11.2 


| Macrophages LPS 


14.8 


Thymus 


16.8 


HUVI none 


47.6 


Kidney 


14.6 


jHUVEC starved 


28.3 ] 




Table OE. General oncology screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag3096. Run 
26~7920136 


Tissue Name 


Rel. Exp.(%) Ag3096, 
Run 267920136 


Colon cancer 1 


1 1.8 


Bladder cancer NAT 2 


0.0 


Colon cancer NAT 1 !4.8 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


7.2 


Bladder cancer NAT 4 


3.3 


Colon cancer NAT 2 7.8 


Adenocarcinoma of the 
prostate 1 


31.2 


Colon cancer 3 


22.7 


Adenocarcinoma of the 
prostate 2 


0.0 


Colon cancer NAT 3 


12.2 


Adenocarcinoma of the 
prostate 3 


3.2 


Colon malignant 


6.4 


Adenocarcinoma of the 


12.0 


cancer 4 


prostate 4 


Colon normal 
adjacent tissue 4 


5.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


10.8 


Adenocarcinoma of the 
prostate 6 


2.0 


Lung NAT 1 


0.0 


Adenocarcinoma of the 
prostate 7 


9.0 


Lung cancer 2 


100.0 


Adenocarcinoma of the 
prostate 8 


0.0 


Lung NAT 2 


2.6 


Adenocarcinoma of the 
prostate 9 


11.4 


Squamous cell 
carcinoma 3 


17.2 


Prostate cancer NAT 10 


0.0 


Lung NAT 3 


3.0 


Sidney cancer 1 


9.3 
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metastatic 
melanoma 1 


4.6 


KidneyNAT 1 


10.5 


Melanoma 2 


0.0 


Kidney cancer 2 


40.6 


Melanoma 3 


2.6 


Kidney NAT 2 


7.7 


metastatic j 
melanoma 4 ~" 


Kidney cancer 3 


5.4 


metastatic 
melanoma 5 


Kidney NAT 3 


0.0 


Bladder cancer 1 10.0 


Kidney cancer 4 


10.4 


Bladder cancer NAT 
1 


0.0 


Kidney NAT 4 


5.4 


Bladder cancer 2 


0.0 







expression of the CG56941-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

Panel 1.3D Summary: Ag3096 Highest expression of the CG56941-01 gene is 
detected in brain (cerebellum) (CT=27.7). In addition, high to moderate expression of this 
gene is also detected in fetal brain and other regions of central nervous systems, including 
amygdala, hippocampus, substantia nigra, thalamus, cerebellum, cerebral cortex, and 
spinal cord. Therefore, therapeutic modulation of this gene product may be useful in the 
treatment of central nervous system disorders such as Alzheimer's disease, Parkinson's 
disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Low levels of expression of this gene is also seen in a ovarian cancer, two lung 
cancer, liver cancer, a colon cancer and a CNS cancer cell line. Therefore, therapeutic 
modulation of this gene could be useful in the treatment of these cancers. 

Panel 4.1D Summary: Ag3096 Highest expression of the CG56941-01 gene is 
detected in microvascular dermal endothelial cells (CT=34.4). In addition, low levels of 
expression of this gene is also seen in dendritic cells, and two way MLR. Therefore, 
therapeutic modulation of this gene could be useful in the treatment of autoimmune and 
inflammatory diseases that involve these cells, such as lupus erythematosus, asthma, 
emphysema, Crohn's disease, ulcerative colitis, rheumatoid arthritis, osteoarthritis, and 
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General oncology screening panel_v_2.4 Summary: Ag3096 Highest 
expression of the CG56941-01 gene is detected in lung cancer (CT=34.3). Expression of 
this gene is higher in the cancer sample, than in the corresponding adjacent control sample 
(CT=39.6). Thus, expression of this gene may be useful as diagnostic marker for detection 
of lung cancer and therapeutic modulation of this gene may be useful in the treatment of 
this cancer. 

R. CG57109-01 and CG57109-02 and CG57109-03 and CG57109-04 and CG57109-05 
and CG57109-06: Doublecortin/CAMKinase 

Expression of gene CG571 09-01, variants CG57 109-02, CG57 109-03, CG57109- 
04, CG571 09-06 and full length physical clone CG571 09-05 was assessed using the 
primer-probe sets Agl 137, Agl 150, Agl860, Ag31 12 and Ag4281. described in Tables 
RA, RB, RC, RD and RE. Results of the RTQ-PCR runs are shown in Tables RF, RG, RH, 
RI, RJ, RK, RL, RM and RN. Please note that the variants CG57109-03, CG57109-04, 
CG5 7 109-06 correspond to the probe and primer sets Agl 50 and Ag31 12 only. CG57109- 
05 represents a full-length physical clone of the CG57109-01 gene, validating the 
prediction of the gene sequence. 



Table RA . Probe Name Agll37 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 

No 


Forward 


5 ' -gacatggtggacagtgagatct-3 ' 


22 


1338 


257 


Probe 


TET-5 ' -cctctctcaccccaacatcgtgaaat-3 ' -TAMRA 


26 


1373 


258 


Reverse 


5 ' -tctgtttcgtagacttcatgca-3 ' 


22 


1399 


250 



Table RB . Probe Name Agl 150 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -gaaattggctgattttggactt-3 ' 


22 


1634 


260 


Probe 


TET-5 1 -cctatatttactgtgtgtgggacccca-3 ' -TAMRA 


27 


1674 


261 


Reverse 


5 ' -agaatttcgggagctacgtaag-3 ' 


22 


1702 


262 



Table RC . Probe Name Agl860 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -gacatggtggacagtgagatct-3 ' 


22 


1338 


263 


Probe 


TET-5 ' -cctctctcaccccaacatcgtgaaat-3 ' -TAMRA 


26 


1373 


264 


Reverse 


5 ' -tctgtttcgtagacttcatgca-3 ' 


22 


1399 


265 
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Table RD . Probe Name Ag3112 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -gaaattggctgattttggactt-3 1 


22 


1634 


266 


Probe 


TET-5 1 -cctatatttactgtgtgtgggacccca-3 ' -TAMRA 


27 


1674 


267 


Reverse 


5' -agaatttcgggagctacgtaag-3 ' 


22 


1702 


268 



Table RE . Probe Name Ag4281 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

No 


Forward 


5 ' -gacatggtggacagtgagatct-3 * 


22 


1338 


269 


Probe 


TET-5 ' -atccagagcctctctcaccccaacat-3 ' -TAMRA 


26 


1365 


270 


Reverse 


5 ' -tcgtagacttcatgcaatttca-3 ' 


22 


1393 


271 



Table RF . A[ comprehensive panel vl.0 



[ issue Name 


Rel. Exp.(%) j 
Ag 1 860, Run Tissue Name 
1 104259 : 


Rel. Exp.(%) Agl860, 
Run 225404259 


1 10967 COPD-F 


37 [112427 Match Control 
jPsoriasis-F 


7.9 


110980 COPD-F 


0.0 |H24I8Psoriasis-M 


13.1 


110968 COPD-M 


20.0 


112723 Match Control 
Psoriasis-M 


17.9 


110977 COPD-M 


3.0 


I 12419 Psoriasis-M 


26.8 


110989 Emphysema- 
F 


26.2 


112424 Match Control 
Psoriasis-M 


19.5 


1 1 0992 Emphysema- 
F 


4.2 


112420 Psoriasis-M 


36.9 


1 1 0993 Emphysema- 
F 


11.0 


112425 Match Control 
Psoriasis-M 


23.3 


1 10994 Emphysema- 
F 


19.8 


104689 (MF) OA Bone- 
Backus 


13.0 


110995 Emphysema- 
F 


8.5 


1 04690 (MF)Adj 
"Normal" Bone-Backus 


3.9 


110996 Emphysema- 
F 


0.0 


104691 (MF) OA 
Synovium-Backus 


29.5 


110997 Asthma-M 


17.2 


1 04692 (BA) OA 
Cartilage-Backus 


0.0 


111001 Asthma-F 


38.4 


1 04694 (BA) OA Bone- 
Backus 


17.3 


111002 Asthma-F 


27.9 


104695 (BA) Adj 
'Normal" Bone-Backus 


0.0 
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111003 Atopic 
Asthma-F 


50.7 


104696 (BA)OA 
Synovium-Backus 


93.3 


111004 Atopic 
Asthma-F 


68.3 


104700 (SS) OA Bone- 
Backus 


11.7 


1 11005 Atopic 
Asthma-F 


56.6 


104701 (SS)Adj 
"Normal" Bone-Backus 


15.3 


111006 Atopic 
Asthma-F 


17.9 


104702 (SS)OA 
Synovium-Backus 


100.0 


111417 Allergy-M 


31.0 


117093 OA Cartilage 
Rep7 


19.1 


112347 Allergy-M 


8.3 


1 1 2672 OA Bone5 


6.8 


112349 Normal 
Lung-F 


2.5 


1 12673 OA Synovium5 


4.8 


1 12357 Normal 
Lung-F 


4.4 


112674 OA Synovial Fluid 
cells5 


13.9 


11 2354 Normal 
Lung-M 


0.0 


117100 OA Cartilage 
Rep 14 


4.5 


112374 Crohns-F 


27.5 


1 12756 OA Bone9 


63.7 


112389 Match 
Control Crohns-F 


9.0 


1 12757 OA Synovium9 


17.7 


112375 Crohns-F 


36.6 


112758 OA Synovial Fluid 
Cells9 


11.7 


112732 Match 
Control Crohns-F 


0.0 


1 17125 RA Cartilage 
Rep2 


27.5 


112725 Crohns-M 


4.4 


113492 Bone2 RA 


20.0 


112387 Match 
Control Crohns-M 


17.1 


113493 Synovium2 RA 


6.8 


1 12378 Crohns-M 


4.2 


113494 Syn Fluid Cells 
RA 


8.8 


112390 Match 
Control Crohns-M 


49.3 


1 13499 CartiIage4RA 


4.7 


112726 Crohns-M 


30.1 


113500 Bone4 RA 


8 8 


112731 Match 
Control Crohns-M 


0.0 


113501 Synovium4 RA 


3.5 


112380 Ulcer Col-F 


37.9 


113502 Syn Fluid Cells4 
RA 


4.0 


112734 Match 
Control Ulcer CoFF 


17.9 


113495 Cartilage3 RA 


10.4 


112384 Ulcer Col-F 


29.7 


113496 Bone3 RA 


4.2 


112737 Match 
Control Ulcer Col-F 


9.0 


1 13497 Synovium3 RA 


0.0 


112386 Ulcer Col-F 


5.0 


113498 Syn Fluid Cells3 
RA 


7.9 
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112738 Match 
Control Ulcer Col-F 


9.8 


11 71 06 Normal Cartilage 
Rep20 


0.0 


112381 Ulcer Col-M 


0.0 


1 13663 Bone3 Normal 


0.0 


112735 Match 
Control Ulcer Col-M 


28.7 


1 13664 Synovium3 
Normal 


0.0 


112382 Ulcer Col-M 


10.0 


113665 Syn Fluid Cells3 
Normal 


0.0 


112394 Match 
Control Ulcer Col-M 


0.0 


1 17107 Normal Cartilage 
Rep22 


0.0 


112383 Ulcer Col-M 


35.4 


1 13667 Bonc4 Normal 


13.3 


112736 Match 
Control Ulcer Col-M 


6.3 


1 13668 Synovium4 
Normal 


0.0 


1 12423 Psoriasis-F 


21.5 


113669 Syn Fluid Cells4 

N °^ ! _ 


9.4 



Table RG . CNS neurodegeneration vl.O 



Lei. 
Exp.(%) 
Tissue Name ;Aszl860, 
iRun 

|207807786 


Rel. 

Exp.(%) 
Ag3112, 
Run 

208976863 


Rel. 

Exp.(%) 
Ag428 1 , 
Run 

2240752 
94 


Name 


Rel. 

Exp.(% 

) 

Agl860 
, Run 
207807 
786 


Rel. 

Exp.(%) 
Ag3112, 
Run 

208976863 


Rel. 

Exp.(%) 
Ag428 1, 
Run 

22407529 
4 


AD 1 Hippo 


7.6 


2.7 


9.7 


Control 
(Path) 3 
Temporal 
Ctx 


0.5 


2.9 


3.1 


AD 2 Hippo 


29.1 


21.8 


24.5 


Control 
(Path) 4 
Temporal 
Ctx 


28.7 


23.0 


27.9 


AD 3 Hippo 


1.5 


3.2 


7.1 


AD 1 

Occipital 
Ctx 


17.6 


9.8 


27.4 


AD 4 Hippo 


4.2 


1.1 


5.6 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


0.0 


AD 5 hippo 


100.0 


100.0 


95.3 


AD 3 

Occipital 

Ctx 


4.1 


3.0 


1.3 


AD 6 Hippo 


19.9 


12.2 


26.1 


AD 4 

Occipital 

Ctx 


7.9 


4.8 


8.5 


Control 2 
Hippo 


27.2 


15.9 


39.0 


AD 5 

Occipital 

Ctx 


13.8 


44.1 


50.0 
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Control 4 
Hippo 


3.1 


2.1 


7.2 


AD 6 

Occipital 
Ctx 


33.4 


8.3 


14.8 


Control (Path) 
3 Hippo 


2.1 


0.0 


5.0 


Control 1 
Occipital 
Ctx 


0.0 


0.0 


2.0 


AD 1 

Temporal Ctx 


2.9 


4.1 


9.3 


Control 2 
Occipital 
Ctx 


62.0 


81.2 


100.0 


AD 2 

Temporal Ctx 


21.6 


27.0 


23.5 


Control 3 
Occipital 
Ctx 


5.0 


2.2 


15.5 


AD 3 

Temporal Ctx 


2.7 


0.0 


6.3 


Control 4 
Occipital 
Ctx 


0.5 


0.0 


2.4 


AD 4 

Temporal Ctx 


14.6 


15.6 


12.8 


Control 
(Path) 1 
Occipital 
Ctx 


34.6 


42.6 


43.8 


AD 5 Inf 
Temporal Ctx 


38.2 


48.3 


54.7 


Control 
(Path) 2 
Occipital 
Ctx 


6.2 


7.1 


6.7 


AD 5 

SupTemporal 
Ctx 


26.8 


26.2 


27.0 


Control 
(Path) 3 
Occipital 
Ctx 


0.0 


0.0 


0.6 


AD 6 Inf 
Temporal Ctx 


12.6 


11.7 


25.2 


Control 
(Path) 4 
Occipital 
Ctx 


15.8 


4.7 


25.7 


AD 6 Sup 
Temporal Ctx 


17.7 


10.9 


17.4 


Control 1 

Parietal 

Ctx 


3.7 


1.0 


1.2 


Control 1 
Temporal Ctx 


4.0 


1.1 


4.6 


Control 2 

Parietal 

Ctx 


18.8 


12.9 


19.9 


Control 2 
Temporal Ctx 


22.5 


12.6 


23.0 


Control 3 

Parietal 

Ctx 


8.1 


2.7 


7.9 


Control 3 
Temporal Ctx 


7.0 


4.8 


4.8 


Control 
(Path) 1 
Parietal 
Ctx 


33.4 


44.1 


42.9 


Control 4 
Temporal Ctx 


4.8 


1.9 


6.5 


Control 
(Path) 2 
Parietal 


17.4 


11.0 


13.6 
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Ctx 








Control (Path) 
1 Temporal 
Ctx 


37.6 


34.9 


47.0 


Control 
(Path) 3 
Parietal 
Ctx 


1.7 


0.0 


0.7 


Control (Path) 
2 Temporal 
Ctx 


28.1 


13.6 


27.9 


Control 
(Path) 4 
Parietal 
Ctx 


19.9 


6.9 


24.8 



Table RH . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4281, Run 
222183233 


Tissue Name 


Rel. Exp.(%) Ag4281, 
Run 222183233 


Adipose 


0.5 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


5.4 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* M 14 


0.0 


Gastric ca. KATO 111 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.2 


Squamous cell 
carcinoma SCC-4 


0.0 


C< Ion ca ( >\Y4S0 m< ) 
SW620 


0.0 


Testis Pool 


34.2 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.9 


Colon ca. CaCo-2 


2.6 


Placenta 


2.0 


Colon cancer tissue 


5.6 


Uterus Pool 


0.8 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.7 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV- 
3 


0.6 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


4.0 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.5 


Ovarian ca. IGROV- 
1 


0.0 


Stomach Pool 


0.4 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


1.1 
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Ovary 


0.6 


Fetal Heart 


4.5 


Breast ca. MCF-7 


0.0 


Heart Pool 


2.7 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


5.9 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


22.1 


Breast ca. T47D 




Skeletal Muscle Pool 


9.3 


Breast ca. MDA-N 


0.0 


Spleen Pool 


2.4 


Breast Pool 


3.1 


Thymus Pool 


3.1 


Trachea 


1.7 


CNS cancer (glio/astro) 
U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U- 
118-MG 


0.0 


Fetal Lung 


14.4 


CNS cancer (neuro;met) 
SK-N-AS 


0.8 


Lung ca. NCI-N4 1 7 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


0 0 CNS cancer (astro) SNB- 

175 


0.0 


Lung ca.NCI-H146 


0.9 


CNSc; ncci (glio) ! 1MB- 19 


0.0 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


0.0 


jLung ca. A549 jO.O 


Brain (Amygdala) Pool 


25.5 


Lung ja N< 1 H526 0.0 


Brain (cerebellum) 


5.4 


ILung ca. NCI-H23 


2.1 


Brain (fetal) 


100.0 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) Pool 


59.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


14.1 


Lung ca. NCI-H522 


0.7 


Brain (Substantia nigra) 
Pool 


15.1 


Liver 


0.3 


Brain (Thalamus) Pool 


28.9 


Fetal Liver 


1.0 


Brain (whole) 


33.9 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


25.2 


Kidney Pool 


4.2 


Adrenal Gland 


2.3 


Fetal Kidney 


4.6 


Pituitary gland Pool 


36.6 


Renal ca. 786-0 


0.0 


Salivary Gland j0.4 


Renal ca. A498 


0.O 


rhyroid (female) j0.4 


Renal ca. ACHN 


D.O 


Pancreatic ca. CAPAN2 jo.O 


Renal ca. UO-3 1 


).0 


Pancreas Pool J2.1 



Table RI . Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Agl860, Run 
165981809 


Rel. Exp.(%) 
Ag3112. Run 
167985258 


Tissue Name 


Rel. Exp.(%) 
Agl860 : Run 
165981809 


Rel. 

Exp.(%) 
Ag3112, 
Run 
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1 


167985258 


adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


|o.8 


1.4 


Pancreas 


0.0 


0.0 jRenal ca. 786-0 


jo.o 


jo 0 


Pancreatic ca. 
CAP AN 2 


0.0 


0.0 (Renal ca. A498 


|o.o 


jo.o 


Adrenal gland 


0.0 


0.0 |Renal ca. RXF 393 


jo.o )o.o 


Thyroid 


0.0 


0.0 jRenal ca. ACHN 


0.0 io.o 


Salivary gland 


0.0 


0.0 [Renal ca. UO-3 1 


o.o jo.o 


Pituitary gland 


12.6 


6.7 Renal ca. TK- 10 


0.0 


o.o ~ " 
]o.o 


Brain (fetal) 


26.6 


99.3 |Liver 


0.0 


Brain (whole) 


7.0 


8.3 (Liver (fetal) 


o.o jo.o 


Brain (amygdala) 


35.6 


jLiver ca. 
19.3 i(hepatoblast) 
|HepG2 


o.o Io.o 

1 


Brain 

(cerebellum) 




1.3 


Lung 


0.0 


0.0 


Brain 

(hippocampus) 




19.5 


Lung (fetal) 


1.0 


2.4 ! 


Brain (substantia 
nigra) 




6.7 


Lung ca. (small 
cell)LX-l 


_, ....... 


— — 


Brain (thalamus) 


5.8 


0.6 


Lung ca. (small 
cell)NCI-H69 


0.0 


0.0 


Cerebral Cortex 


13.6 


9.3 


Lung ca. (s.cell 
var.) SHP-77 


0.0 


0.0 


Spinal cord 




100.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astro U87- 
MG 




0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


1.3 


glio/astro U-l 18- 
MG 




0.0 


Lung ca. (non- 
s.cell) NCI-H23 


2.1 


0.0 


astrocytoma 
SW1783 




0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.0 


0.0 


neuro*; met SK- 
N-AS 


0.0 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


0.0 


astrocytoma SF- 
539 


0.0 


0.6 


Lung ca. (squam.) 
SW 900 


0.0 


0.0 


astrocytoma 
SNB-75 


0.0 


3.0 


Lung ca. (squam.) 
NCI-H596 


o.o 


0.0 


glioma SNB-19 


0.0 


3.0 


Mammary gland 


3.0 


0.0 


glioma U251 


0.0 


3.0 


Breast ca.* (pl.ef) 
MCF-7 


3.0 


0.0 


glioma SF-295 


3.0 


3.0 


Breast ca.* (pl.ef) 


3.0 


0.0 
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MDA-MB-231 






Heart (fetal) 


0.0 


0.0 


Breast ca.* (pl.ef) 
T47D 


0.0 




Heart 


0.7 


4.5 


Breast ca. BT-549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


4.6 


19.2 


Breast ca. MDA-N 


0.0 


0.0 


Skeletal muscle 


4.9 


8.1 


Ovary 


0.0 


0.0 


Bone marrow 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Thymus 


1.1 


2.2 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


0.0 


0.0 


OVCAR-5 


0.0 


0.0 


Lymph node 


0.7 


1.3 


Ovarian ca. 
OVCAR-8 


0.0 


00 


Colorectal 


2.8 


0.3 


Ovarian ca. 
IGROV-l 


0.0 


0.0 


Stomach 


0.0 


1.1 


Ovarian ca.* 
(ascites) SK-OV-3 


0.0 


1.7 


Small intestine 


1.1 


2.0 


Uterus 


0.6 


0.0 


Colon ca. SW4S0 


0.0 


0.0 


Placenta 


2.8 


0.0 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Testis 


42.6 


36.1 


Colon ca. CaCo-2 


0.7 


0.0 


Melanoma 
Hs688(A).T 


0.0 


0.0 


Colon ca. 
tissue(OD03866) 


2.0 


4.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


0.0 


Colon ca. HCC- 
2998 


0.0 


0.0 


Melanoma UACC- 
62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.0 


0.0 


Melanoma Ml 4 


0.0 


0.0 


Bladder 


4.0 


1.3 


vlelanoma LOX 
IMVI 


0.0 


0.0 


Trachea 


3.0 


1.4 


Melanoma* (met) 
SK.-MEL-5 


0.0 


0.0 


Kidney 


3.0 


0.0 jAdipose 


1.3 


2.6 



Table RJ . Panel 2.2 



jTissue Name jRel. Exp.(%) [Tissue Name jRel. Exp.(%) 
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Agl860, Run 
174229165 




Agl860, Run 

174229165 

0.0 


Normal Colon 


20/ 


Kidney Margin (OD04348) 


Colon cancer 
(OD06064) 


0.0 


Kidney malignant cancer 
(OD06204B) 


28.9 


Colon Margin 
(OD06064) 


12.2 


Kidney normal adjacent 
tissue (OD06204E) 


0.0 


Colon cancer 
(OD06159) 


nn Kidne\ f diver (OD04450- 

jOl) 


0.0 


Colon Margin 
(OD06159) 


12.6 


Kidney Margin (OD04450- 
03) 


0.0 


Colon cancer 
(OD06297-04) 


37.6 


Kidney Cancer 8120613 


20.0 


Colon Margin 
(OD06297-05) 


15.0 


Kidney Margin 8120614 


0.0 


CC Gr.2 ascend colon 
(OD0392I) 


45.4 


Kidney Cancer 90 1 0320 


17.7 


CC Margin (OD03921) 


35.1 


Kidney Margin 9010321 


21.0 


Colon cancer metastasis 
(OD06I04) 


0.0 Kidney Cancer 8120607 


16.5 


'Lung Margin 
(OD06104) 


19.1 


Kidney Margin 8120608 


0.0 j 


Colon mets to km a 
(OD04451-01) 


51.8 


Normal Uterus 


27.4 


Lung Margin 
(OD04451-02) 


0.0 


Uterine Cancer 064011 




Normal Prostate 


0.0 


Normal Thyroid 


0.0 


Prostate Cancer 
(OD04410) 


0.0 


Thyroid Cancer 064010 


28.9 


Prostate Margin 
(OD04410) 


0.0 


Thyroid Cancer A3021 52 




Normal Ovary 


0.0 


Thyroid Margin A302 153 


0.0 


Ovarian cancer 
(OD06283-03) 


0.0 


Normal Breast 


28.3 


Ovarian Margin 
(OD06283-07) 


0.0 


Breast Cancer (OD04566) 


56.3 


Ovarian Cancer 064008 


30.4 


Breast Cancer 1 024 


55.5 


Ovarian cancer 
(OD06145) 


0.0 . 


Breast Cancer (OD04590- 
01) 


94.0 


Ovarian Margin 
(OD06145) 


35.1 


Breast Cancer Mets 
(OD04590-03) 


36.9 


Ovarian cancer 
(OD06455-03) 


g q Breast Cancer Metastasis 
|(OD04655-05) 


40.3 
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Ovarian Margin 
(OD06455-07) 


0.0 


Breast Cancer 064006 


0.0 


(Normal Lung 


13.3 


Breast Cancer 9100266 


0.0 


Invasive poor diff. lung 
jadeno (ODO4945-01 




Breast Margin 9100265 


0.0 


{Lung Margin 
j(ODO4945-03) 


— 


Breast Cancer A209073 


37.4 


ILung Malignant Cancer 
j(OD03126) 




Breast Margin A2090734 


23.2 


(Lung Margin 
|(OD03126) 




Breast cancer (OD06083) 


19.3 


ILung Cancer 
j(OD05014A) 


0.0 


Breast cancer node 
metastasis (OD06083) 


60.3 


jLung Margin 
|(OD05014B) 




Normal Liver 


15.5 


jLung cancer 
(OD06081) 





Liver Cancer 1 026 


16.5 


Lung Margin 
(OD06081) 




Liver Cancer 1025 


0.0 


Lung Cancer 
(OD04237-01) 


Liver Cancer 6004-T 


0.0 


Lung Margin 
(OD04237-02) 


100.0 


Liver Tissue 6004-N 0.0 


Ocular Melanoma 
Metastasis 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Melanoma 
Margin (Liver) 


0.0 


Liver Tissue 6005-N 


0.0 


Melanoma Metastasis 


0.0 


Liver Cancer 064003 


0.0 


Melanoma Margin 
(Lung) 


0.0 


Normal Bladder 


0.0 


Normal Kidney 


D.O 


Bladder Cancer 1023 


27.2 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


3.0 


Bladder Cancer A302173 


0.0 


Kidney Margin 
(OD04338) 


).0 


formal Stomach 


7.3 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


.0 


3astric Cancer 9060397 


zo.o 


Kidney Margin 
(OD04339) C 


.0 


stomach Margin 9060396 


59.0 


Kidney Ca, Clear cell 
type (OD04340) C 


.0 ( 


jastric Cancer 9060395 


4.4 


Kidney Margin 
(OD04340) C 


•0 5 


tomach Margin 9060394 ( 


9.7 
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Kidney Ca, Nuclear 
grade 3 (OD04348) 


0.0 


Gastric Cancer 064005 


48.0 


Table RK. Panel 3D 




Tissue Name 


Rel. Exp.(%) 
Ag3112, Run 
1821 14339 


Tissue Name 


Rel. Exp.(%) 
Ag3112, Run 
182114339 


Daoy- 

Medulloblastoma 
TE671- 

Medulloblastoma 


0.0 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) 


0.0 


!o.o 


ES-2- Ovarian clear cell 
jcarcinoma 


0.0 


D283 Med- 
Medulloblastoma 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 6h 


0.0 


ri oiv-i- rrimilive 

Neuroectodermal 


0.0 


Ramos- Stimulated with 
PMA/ionomycin 14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


;SF-268- Glioblastoma 
:T98G- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma ;0.0 




U266- B-cell plasmacytoma fo.O 


iSK-N-SH- 
j(metastasis) 




CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkin's B-cell 
lymphoma 


0.0 


Cerebellum 




JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


2.4 


Jurkat- T cell leukemia 


7.3 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


0.0 


TF-1- Erythroleukemia 


0.0 


DMS-1 14- Small cell 
lung cancer 


0.0 


HUT78-T-ceIl lymphoma 


0.0 


DMS-79- Small cell 
lung cancer 


00.0 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell 
lung cancer 


,0 


CU-8 12- Myelogenous 
eukemia 


0.0 


NCI-H526- Small cell 
lung cancer 


).0 


769-P- Clear cell renal 
.arcinoma 


10 


NCI-N417- Small cell 
lung cancer 


.0 ( 
c 


:aki-2- Clear cell renal 
arcinoma 


10 


NCI-H82- Small cell 
lung cancer ( 


.0 f 
c 


»W 839- Clear cell renal 
arcinoma 


).0 


NCI-HI 57- Squamous C 


-0 ( 


3401- Wilms' tumor ( 


.0 
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cell lung cancer 
(metastasis) 








NCI-HI 155- Large eel 
lung cancer 


3.6 


Hs766T- Pancreatic 
carcinoma (LN metastasis) 


0.0 


NCI-H1299- Large eel 
lung cancer 


0.0 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


0.0 


1NC1-H727- Lung 
carcinoid 


0.0 


SU86.86- Pancreatic 
jcarcinoma (liver metastasis) 


- 

0.0 


NCI-UMC-ll-Lung 
carcinoid 


0 o ;BxPC-3- Pancreatic 
^adenocarcinoma 


0.0 


LX-1- Small cell lung 
cancer 


0 0 HPAC- Pancreatic 
'adenocarcinoma 


0.0 


Colo-205- Colon 
cancer 


0 O j'MlA PaCa-2- Pancreatic 
|carcinoma 


0.0 


KM 12- Colon cancer 


0 0 ]CFPAC-1 - Pancreatic ductal 
iadenocarcinoma 


0.0 


KM20L2- Colon 
cancer 


00 jPANC-1- Pancreatic 

'epithelioid ductal carcinoma 


0.0 


NCI-H716- Colon 
cancer 


0.0 


T24- Bladder carcinma [ 
(transitional cell) j 0 0 


SW-48- Colon j 
adenocarcinoma j 


5637- Bladder carcinoma 


0.0 


SW111 6- Colon j 
adenocarcinoma 


HT-1 197- Bladder carcinoma 


0.0 


LS 174T- Colon 
adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1 080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric 
carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
'vulva) 


0.0 


NCI-SNU-16- Gastric 
carcinoma 


O.O 


SJRH30- Rhabdomyosarcoma 
(met to bone marrow) 


0.0 


NCI-SNU- 1 -Gastric 
carcinoma 


).0 


A43 1 - Epidermoid carcinoma 


0.0 


RF- 1 -Gastric 
adenocarcinoma 


).0 


WM266-4- Melanoma 


3.0 


RF-48- Gastric 
adenocarcinoma 


).0 


DU 145- Prostate carcinoma 
brain metastasis) 


10 


MKN-45- Gastric ( 


).0 


MDA-MB-468- Breast 


).0 
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carcinoma 




adenocarcinoma 




NCI-N87- Gastric 
carcinoma 


0.0 


SCC-4- Squamous cell 
carcinoma of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell 
carcinoma of tongue 


0.0 


RL95-2- Uterine 
carcinoma 


0.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.0 


HelaS3- Cervical i Q Q jCAL 27- Squamous cell 
adenocarcinoma j " [carcinoma of tongue 




Table RL. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4281, Run 
181080824 


Tissue Name 


Rcl I xp.(%) 
Ag4281, Run 
181080824 


Secondary Th 1 act 


0.0 


HUVEC lL-lbeta 


17.4 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act |0.0 l HUVEC TNF al P ha + IFN 
jgamma 


0.0 


Secondary Th 1 rest 0.0 'hUVEC TNF alpha + IL4 
Secondary Th2 rest 0.0 jHUVEClL-11 


0.0 

o.o " 1 


Secondary Ti l rest 
Primary Th 1 act 


0.0 -Lung Microvascular EC none !o.O 


0 0 ILung Microvascular EC 
;TNFa!pha+ IL-lbeta 


8.5 


Primary Th2 act 


0.0 


Microvascular Dermal EC 
none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha+ IL-lbeta 


1.6 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 ■ 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha+ IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


2.8 


Coronery artery SMC rest 


5.7 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha+ IL-lbeta 


25.7 


CDS lymphocyte act 


).0 


Astrocytes rest 


0.0 


Secondary CDS 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL- 
lbeta 


1.1 


Secondary CDS 
lymphocyte act 


0.0 


K.U-812 (Basophil) rest 


3.0 


CD4 lymphocyte none 


1.3 


KU-8I2 (Basophil) 
PMA/ionomycin 


3.0 
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2ryThIATh2/TrI anti- 
CD95 CHI 1 


2.9 


CCD1 106 (Keratinocytes) 
none 


0.0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


1.4 


LAK cells IL-2+1L-12 


0.0 


NC1-H292 none 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NC1-H292 IL-4 


0.0 


LAK cells IL-2+ IL-I8 


0.0 


NC1-H292 1L-9 


1.2 


LAK cells 
PMA/ionomycin 


0.0 


NC1-H292 IL-13 


4.9 


NK Cells IL-2 rest 


1.8 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 3 day 


1.1 


HPAEC none 


2.9 


Two Way MLR 5 day 


1.3 


HPAECTNF alpha + IL-1 
beta 


100.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


2.9 


PBMC rest 


0.0 


Lung fibroblast I NF alpha + 
IL-I beta 


3.4 


PBMC PWM 


0.0 "~ 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


oTo 


Lung fibroblast 1L-9 


0.0 " 


Ramos (B cell) none 


0.0 


Lung fibroblast 1L-I3 


T.9 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1 070 
rest 


5.4 


B lymphocytes CD40L 
and IL-4 


00 


Dermal fibroblast CCD 1070 
TNF alpha 




EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1 070 
IL-1 beta 


49.0 


EOL-1 dbcAMP 
PM A/ i ono my c i n 


0.0 


Dermal fibroblast IFN 
gamma 


6.7 


Dendritic cells none 


i^L_ — 


Dermal fibroblast IL-4 


0.6 


Dendritic cells LPS 


tl 


Dermal Fibroblasts rest 


3.5 


Dendritic cells anti- 
CD40 


8.7 


Neutrophils TNFa+LPS 


0.9 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


3.9 


Colon 


0.9 


Macrophages rest 


0.0 


Lung 


27.4 


Macrophages LPS 


4.7 


Thymus 


17.0 


HUVEC none 


0.0 


K.idney 


81.8 


HUVEC starved 


0.0 
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Table RM . Panel 4D 



Tissue Name 


Rel. Exp.(% 
Agl860, Rui 
165828919 


R^l Pvn /'O/X 

Ag3112, Run 
164526081 


Tissue Name 


jRel. 
Exp.(%) 
Agl860, 
|Run 

[165828919 


jRel. 
Exp.(%) 
]Ag3112, 
iRun 

1 164526081 


Secondary ThI ac 


0.0 


io.o 


HUVECIL-lbeta 


.6.5 




Secondary Th2 ac 


0.0 


0.0 


,HUVEC IFN 


io.o 


0.0 


Secondary Trl act 


0.8 


0.0 


(HUVEC TNF alpha 
+ IFN gamma 


10.6 


0.0 


Secondary Thl 
rest 


0.0 


1.7 


HUVEC TNF alpha 
+ IL4 


0.6 


0.0 


Secondary Th2 
rest 


0.0 


0.0 


HUVEC IL-11 


0.0 


0.0 


Secondary Trl rest 


0.6 


0.0 


Lung Microvascular 
EC none 


0.0 


0.0 


Primary Th ] act 


0.0 


0.0 


Lung Microvascular 
EC TNFalpha + IL- 
lbeta 


9.7 


9.7 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


0.0 ! 


Primary Trl act 


0.7 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + 1 L- 
lbeta 


1.4 


5.9 


Primary Thl rest 


0.7 


0.0 


Bronchial 
epithelium 
TNFalpha + IL1 beta 


0.6 


0.0 


Primary Th2 rest 


0.6 


0.0 


Small airway 
epithelium none 


0.0 


0.0 


Primary Trl rest 


3.0 


0.0 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


0.0 


3.0 


CD45RA CD4 
lymphocyte act 


5.7 


2.9 


Coronery artery 
SMC rest 


?.7 


0.4 


CD45RO CD4 
lymphocyte act 


).0 


).0 


"oronery artery 
SMC TNFalpha + 
L-lbeta 


>3.7 


2.8 


CDS lymphocyte 

act j 


•4 ( 


.0 j 


Astrocytes rest ( 


•0 ( 


.0 


Secondary CDS 
lymphocyte rest 


•0 C 


i 

.0 1 


Astrocytes 

rNFalpha + IL- 1 
beta 


1.7 C 


.0 
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Secondary CDS 
lymphocyte act 


0.0 


1.9 


KU-8 12 (Basophil) 
rest 


0.0 


0.0 


CD4 lymphocyte 
"° ne 


0.0 


0.0 


KU-8 12 (Basophil) 
PMA/iononiycin 


0.0 


0.0 


2ry 

Thl/Th2/Trl anti- 
CD95 CHI 1 


o.o . # 


2.0 


CCD1 1 06 

(Keratinocytes) 

none 


0.0 


0.0 


LAK cells rest 


0.9 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL- 
Ibeta 


0.0 




0.0 


LAK cells IL-2 


2.7 


0.0 


Liver cirrhosis 


20.0 


6.4 


LAK cells IL- 
2+1 L- 12 


0.0 


0.0 


Lupus kidney 


0.0 


0.0 


LAK cells IL- 
2+IFN gamma 


0.4 


0.0 


NCI-H292 none 


0.0 


0.0 


LAK. cells IL-2+ 
IL-18 


0.0 


0.0 


NC1-H292 IL-4 


0.0 


0.0 


LAK cells 
PMA/ionomycin 


1.8 


0.0 


NC1-H292 1L-9 


0.0 


2.6 


NK Cells IL-2 rest 


1.7 


0.0 


NCI-H292 IL-13 


0.0 


1.7 
0.0 


Two Way MLR 3 
day 


1.6 


0.0 


NC1-H292 I FN 
gamma 


0.0 


Two Way MLR 5 
day 


0.0 


0.0 


HPAEC none 


0.0 


0.0 


Two Way MLR 7 
day 


0.0 


0.0 


HPAEC TNF alpha 
+ IL-1 beta 


100.0 


69.3 


PBMC rest 


0.8 


0.0 


Lung fibroblast 
none 


0.0 


0.0 


PBMC PWM 


1.5 


1.2 


Lung fibroblast 
TNF alpha +IL-1 
beta 


16.4 


3.5 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast IL-4 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast IL- 
13 


0.7 


0.0 


B lymphocytes 
PWM 


0.0 


2.5 


Lung fibroblast IFN 
gamma 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


1.7 


Dermal fibroblast 
CCD 1070 rest 


2.3 


4.1 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


28.7 


42.6 
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EOL-I dbcAMP 
PMA/ionomycin 


0.0 


0.0 


jDermal fibroblast 
JCCD1070IL-1 beta 


|49.7 


100.0 


Dendritic cells 
none 


4.6 


0.0 


jDermal fibroblast 
JIFN gamma 


0.0 


0.9 


Dendritic cells 
LPS 


4.2 


2.9 


Dermal fibroblast 
J1L-4 


0.3 


0.0 


Dendritic cells 
anti-CD40 


15.8 


11.5 


IBD Colitis 2 


1.9 


0.0 


Monocytes rest 


0.0 


0.0 


IBD Crohn's 


2.7 


5.1 


Monocytes LPS 


2.3 io.O 


Colon 


15.6 


3.2 


Macrophages rest 


5.7 


3.4 


Lung 


23.0 


27.7 


Macrophages LPS 


6.8 


3.4 


Thymus 


0.0 


1.7 


HUVEC none 


0.0 


0.0 


Kidney 


26.8 


27.5 


HUVEC starved 


0.0 


0.0 









Tissue Name 


Rel ; p<" ,) \. 1860. 
Run 171634856 


(Tissue Name 


Rel. Exp.(%) Agl 860, 
Run 171634856 


BA4 Control 11.7 ; BA17PSP 




B \4 rontrol2 
BA4 

Alzheimer's2 


L 5 : 2 _ |BA17PSP2 

4 - 8 jSub Nigra Control 


7.9 
0.0 


BA4 Parkinson's 


23.3 


Sub Nigra Control2 


1.4 


BA4 

Parkinson's2 


18.6 


Sub Nigra 
Alzheimer's2 


0.0 


BA4 

Huntington's 


4.9 


Sub Nigra Parkinson's2 


2.3 


BA4 

Huntington's2 


2.5 


Sub Nigra Huntington's 


10.5 


BA4 PSP 


5.0 


Sub Nigra 
Huntington's2 


23.3 


BA4 PSP2 


2.5 


Sub Nigra PSP2 


1.2 


BA4 Depression 


5.9 


Sub Nigra Depression 


0.0 


BA4 

Depression2 


3.3 


Sub Nigra Depression2 


5.1 


BA7 Control 


19.5 


Glob Palladus Control 


12.1 


BA7 Control 


23.5 


Glob Palladus Control2 


16.6 


BA7 

Alzheimer's2 


10.5 


3lob Palladus 
Alzheimer's 


5.8 


BA7 Parkinson's 


1.0 


Hob Palladus 
Mzheimer's2 


1.0 


BA7 


5 - 5 |Glob Palladus 


00.0 
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Parkinson's2 




Parkinson's 




BA7 

Huntington's 


11.8 


Glob Palladus 
Parkinson's2 


30.6 


Huntington's2 


9.5 


Glob Palladus PSP 


1.7 


BA7 PSP 


2.3 


Glob Palladus PSP2 


21.9 


BA7 PSP2 


5 5 jGlob Palladus 
Repression 


1.5 


BA7 Depression 


6 0 jTemp Pole Control 


1 1.1 


BA9 Control 


14.8 'Temp Pole Control 


47.3 


BA9 Control 


33 - 4 jTemp Pole Alzheimer's 


2.1 




g 7 jTemp Pole 
; Alzheimer's2 


4.6 


BA9 

Alzheimer's2 


1 1 • 1 JTemp Pole Parkinson's 


19.1 


BA9 Parkinson's 


] j g ! Temp Pole 
jParkinson's2 


20.9 


BA9 

Parkinson's2 


24 ] Temp Pole 
.Huntington's 


25.5 


BA9 ! I 

Huntington's kVt> jTemp Pole PSP 


00 


BA9 

Huntington's2 


8-0 jTemp Pole PSP2 


7.7 


BA9 PSP 


0.0 


Temp Pole 
Depression2 


1.9 


BA9 PSP2 


3.8 


Cing Gyr Control 


4.7 


BA9 Depression 


3.5 


Cing Gyr Control 


20.4 


BA9 

Depress ion2 


5.6 


Cing Gyr Alzheimer's 


6.0 


BA1 7 Control 


21.1 


Cing Gyr Alzheimer's2 


4.5 


BA17Control2 


46.0 


Cing Gyr Parkinson's 


1.7 


BA17 

AIzheimer's2 


4.0 


Cing Gyr Parkinson's2 


10.4 


BA17 
Parkinson's 


12.5 


Cing Gyr Huntington's 


20.2 


BA17 

Parkinson's2 


35.6 


Cing Gyr 
Huntington's2 




BA17 

Huntington's 


8.6 


Cing Gyr PSP 


D.O 


BA17 

Huntington's2 


0.2 


Cing Gyr PSP2 


2.3 


BA17 
Depression 


1.2 


Cing Gyr Depression 


,0 
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IbA17 

|Depression2 



18.4 



Cing Gyr Depression2 



1.4 



AI_comprehensive panel_vl.O Summary: Agl860 Highest expression of the 
CG571 09-01 transcript in this panel is seen in synovium from an OA patient (CT=33.7). 
Overall, this gene is expressed in OA tissue but not in normal joint tissue and is expressed 
in pulmonary tissue from patients with atopic asthma but not in normal lung tissue. Please 
: see panel 4D for discussion of utility of this gene in inflammation. 

CNS_neurodegeneration_vl.O Summary: Agl 860/Ag31 12/Ag4281 Three 
experiments with two different probe and primer sets produce results that are in very good 
agreement. This panel does not show differential expression of the CG57109-01 gene in 
Alzheimer's disease. However, this expression profile confirms the presence of this gene 
in the brain. Please see Panel 1.4 for discussion of utility of this gene in the central & 
nervous system. One experiment with Ag6051, which is specific to CG571 09-06 only, is 
in general agreement with the results above. Highest expression in this panel was in the 
temporal cortex of an Alzheimer's patient (CT=34.6). 

General_scrcening_panel_vl.4 Summary: Ag428 1 Highest expression of the 
CG57109-01 gene appears to be in the fetal brain (CT=29.5). Overall, expression of this 
gene appears to be highly brain-specific in this panel, with moderate levels of expression 
in the amygdala, hippocampus, thalamus and spinal cord and low but significant levels in 
the cerebral cortex and the substantia nigra. This gene encodes a novel doublecortin/CAM 
kinase like protein. Other members of this family have been implicated in the calcium- 
signaling pathway that controls neuronal migration in the developing brain. In addition, 
CAM kinase has been shown to play a crucial role in hippocampal LTP from studies in 
transgenic and knock-out mice, and may also play a role in memory formation in the 
mature nervous system as well as the developing brain. CAM kinases have also been 
shown to phosporylate tau, an integral component of the neurofibrillary tangles seen in 
Alzheimer's, in a manner which shifts tau electrophorytic motility to that seen in the AD 
brain. Furthermore, tau from AD brains shows aberrent phosphorylation. Thus, based on 
the expression of this DCAM kinase homolog in the brain, therapeutic modulation of the 
expression or function of this gene product may be useful in the treatment of learning and 
memory deficits that are a result of aging or neurodegenerative disease and also in the 
treatment of neurologic disorders themselves, including Alzheimer's disease. 
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Moderate to low levels of expression are also seen in a variety of samples from 
normal tissues, including testis, fetal and adult heart and skeletal muscle and fetal lung. 

In addition, this gene is expressed at much higher levels in fetal lung (CT=32.3) 
when compared to expression in the adult counterpart (CT=40). Thus, expression of this 
gene may be used to differentiate between the fetal and adult source of this tissue. 

Panel 1.3D Summary: Agl860/Ag31 12 Two experiments with the same probe 
and primer set produce results that are in excellent agreement, with highest expression of 
the CG57109-01 gene in the spinal cord (CTs=31.7). Expression of this gene is restricted 
to the nervous system and the testis. Thus, expression of this gene could be used to 
differentiate between neural and non-neural tissue. Please see Panel 1 .4 for further 
discussion of utility of this gene in the CNS. 

Panel 2.2 Summary: Agl860 Expression of the CG57109-01 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 3D Summary: Ag31 12 Expression of the CG571 09-01 gene is restricted to 
a sample derived from a lung cancer cell line (CT=32.6). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of lung cancer. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of lung cancer. 

Panel 4.1D Summary: Agl860 This CG571 09-01 transcript is highly expressed 
in activated dermal fibroblasts, endothelial cells, and astrocytes after treatment with IL-1 
or TNFalpha. Highest expression is seen in treated HPAECs (CT=3 1.3). Please see panel 
4D for discussion of utility of this gene in inflammation. 

Panel 4D Summary: Agl860 This transcript is highly expressed in activated 
dermal fibroblasts, endothelial cells, and astrocytes after treatment with IL-1 or TNFalpha, 
with highest expression in TNF alpha and IL-1 beta treated HPAECs (CT=30.9). This 
protein has homology to protein kinase and may be involved in leukocyte extravasation 
from the peripheral blood into tissues. (Borbiev T, Am J Physiol Lung Cell Mol Physiol 
2001 May;280(5):L983-90) Therefore, antagonistic therapeutics designed against the 
protein encoded by this transcript may reduce or inhibit inflammation due to asthma, 
allergy, emphysema, osteoarthritis, colitis, psoriasis, or delayed type hypersensitivity. 
Agonistic therapies may also direct leukocyte traffic into tumors or sites of infection. 
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Ag3 1 12 Highest expression of the transcript is seen in IL-1 beta treated dermal 
fibroblasts (CT-30.4). Expression is in agreement with the profile seen with Agl860, 
except no expression is seen in astrocytes. 

Panel CNS_1 Summary: Agl860 This panel confirms expression of the 
CG57 109-01 gene in the brain. Please see Panel 1.4 for discussion of utility of this gene in 
the central nervous system. 

General oncology screening panel_v_2.4 Summary: Agl860 Expression of the 
CG5 7 109-01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not 
shown.) Ag6051 Expression of the CG571 09-06 gene is low/undetectable in all samples 
on this panel (CTs>35). (Data not shown.) 

S. CG57366-01: Kiaal223 

Expression of gene CG57366-01 was assessed using the primer-probe set Ag3219, 
described in Table SA. Results of the RTQ-PCR runs are shown in Tables SB, SC, SD and 



Table SA . Probe Name Ag3219 



Primers Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -agggatacccttcctcagaga-3 • 


21 


3380 


272 


Probe 


TET- 5 ' -agccttaaacaagctctgaagcttca- 3 ' -TAMRA 


26 


3409 


273 


Reverse 


5 ' -gctagggtcagaaccttcaatc-3 ' 


22 


3435 


274 



Tissue Name 


Rel. Exp.(%) 
Ag3219, Run 
209862297 


Rel. Exp.(%) 
Ag3219, Run 
224079667 


Tissue Name 


Rel. Exp.(%) 
Ag3219, Run 
209862297 


Rel. Exp.(%) 
Ag3219, Run 
224079667 


AD 1 Hippo 


18.9 


14.7 


Control (Path) 
3 Temporal Ctx 


9.5 


9.2 


AD 2 Hippo 


41.2 


52.5 


Control (Path) 
4 Temporal Ctx 


47.3 


41.2 


AD 3 Hippo 


1.9 


14.0 


AD 1 Occipital 
Ctx 


20.4 


28.5 


AD 4 Hippo 


6.3 


9.9 


AD 2 Occipital 
Ctx (Missing) 


0.0 


0.0 


AD 5 Hippo 


85.3 


90.1 


AD 3 Occipital 
Ctx 


5.0 


10.7 


AD 6 Hippo 


54.3 


64.2 


AD 4 Occipital 
Ctx 


28.9 


19.5 
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Control 2 
Hippo 


30.1 


39.8 


AD 5 Occipital 
Ctx 


32.8 


21.5 


Control 4 
Hippo 


15.7 


18.2 


AD 6 Occipital 
Ctx 


24.1 


40.9 


Control 
(Path) 3 
Hippo 


1 1.8 




Control 1 
Occipital Ctx 


5.8 


6.7 

j 

44.4 


AD 1 

Temporal 

Ctx 


0.4 


36.3 


Control 2 1 
Occipital Ctx p 6 - 9 


AD 2 

Temporal 

Ctx 


31.6 


42.3 


Control 3 | 
Occipital Ctx \ Z - b 


|21.3 


AD 3 

Temporal 

Ctx 


2.3 


8.6 


Control 4 j 
Occipital Ctx | 8 - 2 


11.8 

J 


AD 4 

Temporal 

Ctx 


19.9 


30.1 


Control (Path) j 
I Occipital Ctx j 


j 

79.6 

I 


AD 5 Inf j 
Temporal 199.3 
Ctx 


100.0 .Control (Path) 

12 Occipital Ctx ! y 


21.0 | 


AD 5 Sup 
Temporal 
Ctx 


55.9 


7 , 7 jControl (Path) 
3 Occipital Ctx 


h.6 


7.0 


AD 6 Inf 
Temporal 
Ctx 


52.5 


66.0 


Control (Path) 
4 Occipital Ctx 


21.6 


26.6 


AD 6 Sup 
Temporal 
Ctx 


58.2 


65.5 


Control 1 
Parietal Ctx 


13.2 


13.5 


Control 1 
Temporal 
Ctx 


9.5 


10.7 


Control 2 
Parietal Ctx 


0.0 


55.1 


Control 2 
Temporal 
Ctx 


32.8 


37-1 


Control 3 
Parietal Ctx 


21.9 


8.0 


Control 3 
Temporal 
Ctx 


>3.7 : 


.4.8 ( 


lontrol (Path) 
Parietal Ctx 


100.0 




Control 3 
Temporal < 
Ctx 


.1 


1.2 \ 


Control (Path) 
Parietal Ctx 


27.2 : 


4.3 


Control 

(Path) 1 * 
Temporal 


3.5 6 


3.3 < 


Antral (Path) 
Parietal Ctx 


5.4 7 


.5 
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Ctx 












Control 
(Path) 2 
Temporal 
Ctx 


50.3 


52.5 


Control (Path) 
4 Parietal Ctx 


53.6 


48.0 



Tissue Name 


Rel. Exp.(%) Ag3219. 
Run 168013879 


Tissue Name 


Rel. Exp.(%) 
Ag3219, Run 
168013879 


Liver adenocarcinoma 


14.9 


Kidney (fetal) 


31.0 


Pancreas 


1.8 


Renal ca. 786-0 


19.3 


Pancreatic ca. CAPAN 
2 


17.4 




Renal ca. A498 
Renal ca. RXF 393 


10.4 


Adrenal gland 


1.7 


8.5 


Thyroid 


5.3 


Renal ca. ACHN 




Salivary gland 


b.9 


Renal ca. UO-3 1 


14.4 


[pituitary gland 


2.4 


Renal ca. TK-10 


16.8 


Brain (fetal) 
jBrain (whole) 


,58.2 

112.9 " 1 


Liver 

Liver (fetal) 


3.0 
3.7 


jBrain (amygdala) 


10 6 ILiver ca. (hepatoblast) 
;HepG2 


r ~ 

6.2 


Brain (cerebellum) 


9-7 jLung 


'4.5 


Brain (hippocampus) 


8 4 ILung (fetal) 


27.7 


Brain (substantia nigra) 


8 5 (Lung ca. (small cell) 
LX-1 


5.3 


Brain (thalamus) 


53 ILung ca. (small cell) 
|NCI-H69 


17.1 


Cerebral Cortex 


7 i iLung ca. (s.cell var.) 

ISHP-77 


42.9 


Spinal cord 


5 5 jLung ca. (large 
|cell)NCI-H460 


4.0 


glio/astro U87-MG 


99 jLung ca. (non-sm. cell) 
IA549 


22.7 


glio/astro U-l 18-MG 


8 2 iLung ca. (non-s.cell) 
INCI-H23 


17.0 


astrocytoma SW1783 


7 0 jLung ca. (non-s.cell) 
iHOP-62 


7.2 


neuro*; met SK-N-AS 1 


8.1 


Lung ca. (non-s.cl) 
NCI-H522 


4.4 


astrocytoma SF-539 1 


8.4 


Lung ca. (squam.) SW 
900 6 


5.1 


astrocytoma SNB-75 4 


4.1 


Lung ca. (squam.) NCI- 5 


7.4 
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H596 




glioma SNB-1 9 


14.5 


Mammary gland 


7.5 


glioma U251 


55.9 


Breast ca.* (pl.ef) 
MCF-7 


46.3 


glioma SF-295 


21.2 


Breast ca.* (pl.ef) 
MDA-MB-231 


16.3 


Heart (fetal) 


2.2 


Breast ca.* (pl.ef) T47D 


31.9 


Heart 


3.9 


Breast ca. BT-549 


9.5 


Skeletal muscle (fetal) 


1.6 


Breast ca. MDA-N 


10.4 


Skeletal muscle 


3.1 


Ovary- 


3.8 


Bone marrow 


1.5 


Ovarian ca. OVCAR-3 


17.6 


Thymus 


6.4 


Ovarian ca. OVCAR-4 


5.4 


Spleen 


1.1 


Ovarian ca. OVCAR-5 


80.1 


Lymph node 


2.8 


Ovarian ca. OVCAR-8 


9.3 


Colorectal 


4.0 


Ovarian ca. 1GROV-1 


4.9 


Stomach 


50 


Ovarian ca.* (ascites) 
SK-OV-3 


100.0 


Small intestine 


3.7 


Uterus 


7.2 


Colon ca. SW480 


2.8 


Placenta 


1.4 


Colon ca.* 
SW620(SW480 met) 


9.5 


Prostate 


4.8 


Colon ca. HT29 


9.5 


Prostate ca.* (bone 
met)PC-3 


29.3 


Colon ca. HCT-116 


8.3 


Testis 


8.2 


Colon ca. CaCo-2 


5.6 


Melanoma Hs688(A).T 


8.9 


Colon ca. 
tissue(OD03866) 


5.4 


Melanoma* (met) 
Hs688(B).T 


13.4 


Colon ca. HCC-2998 


8.7 


Melanoma UACC-62 


3.0 


Gastric ca.* (liver met) 
NCI-N87 


16.5 


Melanoma M14 


2.8 


Bladder 


21.3 


Melanoma LOX IMVI 


11.0 


Trachea 


2.3 


Melanoma* (met) SK- 
MEL-5 


7.5 


Kidney 


6.3 


Adipose 


17.3 


Table SD. Panel 2.2 


Tissue Name 


ReI.Exp.(%)Ag3219, 
Run 174416266 


Tissue Name 


Rel. Exp.(%) 
Ag3219, Run 
174416266 


Normal Colon 


22.8 


Kidney Margin 
(OD04348) 


51.4 


Colon cancer 


25.2 


Kidney malignant cancer 


36.6 
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(OD06064) 




OD06204B) 




Colon Margin 
(OD06064) 


2.0 


Cidney normal adjacent 
issue (OD06204E) 


5.3 


Colon cancer 
(OD06159) 


23 


Kidney Cancer 
OD04450-01) 


54.3 


Colon Margin 
(OD06159) 


14.5 


Kidney Margin 
(OD04450-03) 


5.7 


Colon cancer 
(OD06297-04) 


1.1 


Kidney Cancer 8120613 


0.0 


Colon Margin 
(OD06297-05) 


23.0 jKidney Margin 8120614 


0.9 


CC Gr.2 ascend colon 
(OD03921) 


3.6 


Kidney Cancer 9010320 


3.0 


CC Margin (OD03921) 


6.9 


Kidney Margin 9010321 


3.0 


Colon cancer metastasis 
(OD06104) 


3.5 


Kidney Cancer 8 120607 


2.6 


Luna Margin ' 
(OD06104) j 


Kidney Margin 8120608 


0.1 


Colon mets to lung •.. , n 
(OD04451-01) , J 


Normal Uterus 


44.1 


Lung Margin > 
(OD04451-02) f 


Uterine Cancer 064011 


13.2 


Nortnal Prostate 0.8 


Normal Thyroid 


3.0 


Prostate Cancer 
(OD04410) 


11.3 


Thyroid Cancer 064010 


10.2 


Prostate Margin 
(OD04410) 


12.2 


Thyroid Cancer 
A302152 


14.0 


Normal Ovary 


2.4 


Thyroid Margin 
A302153 


5.1 


Ovarian cancer 
[(OD06283-03) 


2.7 


Normal Breast 


25.2 


Ovarian Margin 
|(OD06283-07) 


14.0 


Breast Cancer 
(OD04566) 


19.6 


[Ovarian Cancer 064008 


9.9 


Breast Cancer 1024 




Ovarian cancer 
((OD06145) 


7.1 


Breast Cancer 
(OD04590-01) 


17.1 


Ovarian Margin 
|(OD06145) 


23.0 


Breast Cancer Mets 
(OD04590-03) 


31.2 


Ovarian cancer 
(OD06455-03) 


17.3 


Breast Cancer 
Metastasis (OD04655- 
05) 


100.0 


bvarian Margin 
|(OD06455-07) 


12.2 


Breast Cancer 064006 


27.9 
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Normal Lung 


21.2 


Breast Cancer 9 1 00266 


91 


Invasive poor diff. lung 
adeno (ODO4945-01 


20.7 


Breast Margin 9100265 


13.2 


Lung Margin 
(ODO4945-03) 


24.1 


Breast Cancer A209073 


14.1 


Lung Malignant Cancer 
(OD03126) 


8.7 


Breast Margin 
A2090734 


22.4 


Lung Margin 
(OD03126) 


16.2 


Breast cancer 
;(OD06083) 


100.0 


Lung Cancer 
(OD05014A) 


17.0 


imetastasis (OD06083) 


60.7 


Lung Margin 
(OD05014B) 


42.3 


Normal Liver 


12.8 


Lung cancer 
(OD06081) 


10.4 


Liver Cancer 1026 


0.4 


Lung Margin 
(OD06081) 


13.2 


Liver Cancer 1 025 




Lung Cancer 
(OD04237-01) 


10.4 


Liver Cancer 6004-T 


1.1 


Lung Margin 
(OD04237-02) 


30.6 


Liver Tissue 6004-N 


0-5 


Ocular Melanoma 
Metastasis 


9.8 


Liver Cancer 6005-T 




Ocular Melanoma 
Margin (Liver) 


7.3 






Melanoma Metastasis 


32.5 


Liver Cancer 064003 


7.1 


Melanoma Margin 
(Lung) 


21.6 


Normal Bladder 


18.6 


Normal Kidney 


4.7 


Bladder Cancer 1023 


3.5 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


28.3 


Bladder Cancer 
A302173 


10.8 


Kidney Margin 
(OD04338) 


1 1.8 


Normal Stomach 


27.4 


Kidney Ca Nuclear 
grade 1/2(OD04339) 


34.2 


Gastric Cancer 9060397 


2.4 


Kidney Margin 
(OD04339) 




Stomach Margin 
9060396 


3.1 


Kidney Ca, Clear cell 
type (OD04340) 


11.0 


Gastric Cancer 9060395 


5.2 


Kidney Margin 
(OD04340) 


3.6 


Stomach Margin 
9060394 


4.3 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


1.5 


3astric Cancer 064005 


5.8 
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Tissue Name 


ReI.Exp.(%)Ag3219, 
Run 16453 1987 


Tissue Name 


Rel. Exp.(%) 
Ag3219,Run 
164531987 


Secondary Th 1 act 


0.1 


HUVECIL-lbeta 


27.5 


Secondary Th2 act 


0.1 


HUVEC I FN gamma 


35.8 


{Secondary Trl act 


0.3 


HUVECTNF alpha + 
I FN gamma 


39.5 


Secondary Th] rest 


0.3 


HUVECTNF alpha + 
IL4 


54.7 


Secondary Th2 rest 


0.4 


HUVEC 1L-II 


30.4 


Secondary Trl rest 


0.3 


Lung Microvascular EC 
none 


63.7 


Primary TIi 1 act 


0.5 


Lung Microvascular EC 
TNFalpha + 1L-1 beta 


43.2 


Primary Th2 act 


0.6 


Microvascular Dermal 
EC none 


49.3 


Primary Trl act 


q , jMicrosvasular Dermal 

jECTNFalpha + lL-Ibeta 


19.5 


.'Primary Th 1 rest 


■ s Bronchial epithelium 
jTNFalpha + ILIbeta 


24.3 


jPrimary Th2 rest 
[Primary Trl rest 


1.2 
0.5 


Small airway epithelium 
none 

Small airway epithelium 
TNFalpha + 1 L- 1 beta 


6.7 
50.0 


CD45RACD4 
lymphocyte act 


6.6 


Coronery artery SMC 
rest 


39.0 


CD45RO CD4 
lymphocyte act 


0.2 


Coronery artery SMC 
TNFalpha + IL-1 beta 


18.6 


CD8 lymphocyte act 


0.1 


Astrocytes rest 


10.2 


Secondary CD8 
lymphocyte rest 


04 | Astrocytes TNFalpha + 
11.-1 beta 


7.9 


Secondary CD8 
lymphocyte act 


0.1 


KU-8 12 (Basophil) rest 


13.3 


CD4 lymphocyte none 


0.1 


KU-8 12 (Basophil) 
PMA/ionomycin 


42.0 


2ryThl/Th2/Trl_anti- 
CD95 CHI 1 


0.2 


CCD 1106 

Xeratinocytes) none 


16.2 


LAK cells rest 


2.3 


CCD 1106 
Keratinocytes) 
TNFalpha +IL-1 beta 


3.8 


LAK cells IL-2 


0.4 


Jver cirrhosis 


2.1 


LAK cells IL-2+IL-12 


0.2 


Lupus kidney 


1.3 


LAK cells IL-2+IFN 


0.9 


NCI-H292 none 


16.6 
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gamma 








LAK cells IL-2+IL-18 


0.5 


NCI-H292 IL-4 


2? 8 


LAK. cells 
PMA/ionomycin 


1.2 


NCI-H292 IL-9 


19.8 


NK Cells IL-2 rest 


0.1 


NCI-H292 IL-13 


14.5 


Two Way MLR 3 day 


1.0 


NCI-H292 I FN gamma 


14.2 


Two Way MLR 5 day 


0.7 


HP A EC none 


43.8 


Two Way MLR 7 day 


0.3 


HPAEC TNF alpha + IL- 
1 beta 


24.0 


PBMC rest 


1.8 


Lung fibroblast none 


8.9 


PBMC PWM 


2.2 




Lung fibroblast TNF 
alpha + IL-1 beta 




PBMC PHA-L 


0.8 


Lung fibroblast IL-4 


16.5 


Ramos (B cell) none 


2.9 


Lung fibroblast IL-9 


12.6 


Ramos (B cell) 
ionomycin 


15.3 


Lung fibroblast IL-13 


11.9 


B lymphocytes PWM 


4.4 




Lung fibroblast I FN 
gamma 


10.4 ! 


B lymphocytes CD40L 
and 1L-4 


0.7 


iDermal fibroblast 
;CCDI070 rest 


28.1 


EOL-1 dbcAMP 


3.7 




Dermal fibroblast 
CCD 1070 TNF alpha 


22.2 1 


EOL-1 dbcAMP 
PMA/ionomycin 


2.0 




Dermal fibroblast 
CCD 10 70 IL-1 beta 


16.4 


Dendritic cells none 


4.6 


Dermal fibroblast I FN 
gamma 


8.7 


Dendritic cells LPS 


3.2 


Dermal fibroblast IL-4 


— 


Dendritic cells anti- 
CD40 


4.4 


IBD Colitis 2 


1.3 


Monocytes rest 


12.8 


IBD Crohn's 


1.7 


Monocytes LPS 


2.4 


Colon 


9.7 


Macrophages rest 


9.2 


Lung 


27.2 


Macrophages LPS 


1.1 


Thymus 


14.2 


HUVEC none 


57.4 


Kidney 


22.8 


HUVEC starved 


100.0 







CNS neurodegeneration_vl.O Summary: Ag3219 Two experiments with the 
same probe and primer set produce results that are in excellent agreement. This gene is 



downregulated in the temporal cortex of Alzheimer's disease patients when compared with 
non-demented controls (p = 0.001 when analyzed by Ancova, estimate of total cDNA 
loaded per well used asa covariate). Therefore, up-regulation of this gene or its protein 
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product, or treatment with specific agonists for this receptor may be of use in reversing the 
dementia/memory loss and neuronal death associated with this disease. 

Panel 1.3D Summary: Ag3219 Highest expression of the CG57366-01 gene is 
seen in an ovarian cancer cell line (CT=29.9). In addition, expression appears to be 
5 significant in all the cell lines on this panel. In addition, this gene appears to be expressed 
in most of the samples on this panel, suggesting a role for this gene product in cell survival 
and proliferation. Thus, expression of this gene could be used as a marker of ovarian 
cancer. Furthermore, therapeutic modulation of the expression or function of this gene 
product may be useful in the treatment of cancer. 

1 0 Among tissues with metabolic function, this gene is expressed at moderate to low 

levels in pituitary, adipose, thyroid, skeletal muscle, heart, and adult and fetal liver. This 
widespread expression among these tissues suggests that this gene product may play a role 
in normal neuroendocrine and metabolic function and that disregulated expression of this 
gene may contribute to neuroendocrine disorders or metabolic diseases, such as obesity 

1 5 and diabetes. 

This gene is also expressed at moderate to low levels in the CNS, including the 
hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 

20 schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 2.2 Summary: Ag3219 Highest expression of the CG57366-01 gene is seen 
in breast cancer (CT=29.3). In addition, significant expression is seen in a cluster of breast 
cancer samples. Thus, expression of this gene could be used to differentiate between this 
sample and other samples on this panel and as a marker to detect the presence of breast 

25 cancer. Furthermore, therapeutic modulation of the expression or function of this gene 
may be effective in the treatment of breast cancer. 

Panel 4D Summary: Ag321 9 Highest expression of the CG57366-01 gene is seen 
in untreated HUVEC (CT=27.7). Expression on this panel is seen mainly in endothelial 
cells and fibroblasts from lung and skin, basophils and astrocytes, and normal lung, 

30 thymus and kidney. Thus, this gene product may be involved in pathological and 
inflammatory lung and skin conditions, including asthma, emphysema, allergy and 
psoriasis. 
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T. CG57368-01: Adenylate Cyclase Type IV 

Expression of gene CG57368-01 was assessed using the primer-probe set Ag3221, 
described in Table TA. Results of the RTQ-PCR runs are shown in Tables TB, TC, TD 
and TE. 

Table TA . Probe Name Ag3221 



Primers 


Sequences 


[Length! 

! ' 1 


Start 
Position 


SEQ ID 
No 


Forward 


5 1 -aactgatgggtgctatctcctt-3 ' 


1 22 ! 


2390 


275 


Probe 


TET-5 1 -cttcttcttcaccctccttgtcctgg-3 ' 


-TAMRA! 26 1 


2418 


276 


Reverse 


5 ' -aggcggcagtagtactcattct-3 ' 


! 22~1 


2450 


277 



Table TB . CNSneurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) Ag3221.Run 
209862298 


Tissue Name 


Rel. Exp.(%) Ag3221 
Run 209862298 


AU 1 Hippo 


4.2 


Control (Path) 3 j 
Temporal Ctx 1 1 12 


r\L) Z nippo 


41.5 


Control (Path) 4 
Temporal Ctx 


42.9 


AD 3 Hippo 


8 0 |aD 1 Occipital Ctx 


1.7 


AD 4 Hippo 


13 4 lAD 2 Occipital Ctx 
l(Missing) 


0.0 


AD 5 hippo 


56.6 }aD 3 Occipital Ctx 


5.3 


AD 6 Hippo 


49.0 


AD 4 Occipital Ctx 


18.9 


Control 2 Hippo 


13.5 


AD 5 Occipital Ctx 


33.0 


Control 4 Hippo 


21.9 


AD 6 Occipital Ctx 


32.1 


C< htroliPatii) 3 
Hippo 


14.7 


Control 1 Occipital Ctx 


57.8 


AD 1 Temporal 
Ctx 


12.6 


Control 2 Occipital Ctx 


40.3 


AD 2 Temporal 

Ctx 


20.3 


Control 3 Occipital Ctx 


23.2 


AD 3 Temporal 
Ctx 


5.1 


Control 4 Occipital Ctx 


10.2 


AD 4 Temporal 
Ctx 


32.8 


Control (Path) 1 
Occipital Ctx 


87.7 


AD 5 Inf 
Temporal Ctx 


62.9 


Control (Path) 2 
Occipital Ctx 


40.1 


AD 5 

SupTemporal 
Ctx 


47.3 


Control (Path) 3 
Occipital Ctx 


25.5 


AD 6 Inf 


56.3 


Control (Path) 4 


71.7 
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Temporal Ctx 




Occipital Ctx 




AD 6 Sup 
Temporal Ctx 


65.5 


Control 1 Parietal Ctx 


24.0 


Control 1 
Temporal Ctx 


33.9 


Control 2 Parietal Ctx 


67.8 


Control 2 
Temporal Ctx 


27.2 


Control 3 Parietal Ctx 


18.0 


Control 3 
Temporal Ctx 


1 1.9 


Control (Path) 1 Parietal 
Ctx 


43.8 


Control 4 
Temporal Ctx 


17.8 




Control (Path) 2 Parietal 
Ctx 


25.9 


Control (Path) 1 
Temporal Ctx 


43.5 


Control (Path) 3 Parietal 
Ctx 


25.0 


Control (Path) 2 
Temporal Ctx 


62.9 


Control (Path) 4 Parietal 
Ctx 


100.0 


Table TC. Panel 1.3D 


Tissue Name 




Rel. Exp.(%) 
Ag3221. Run 
168014001 


Tissue Name 


Rel.Exp.(%) Ag3221, 
Run 168014001 


Liver adenocarcinoma i0.4 


Kidnex (fetal) 


73.2 


Pancreas 




9.3 


Renal ca. 786-0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.4 


Renal ca. A498 


0.7 


Adrenal gland 


14.9 


Renal ca. RXF 393 


0.0 


Thyroid 


12.2 


Renal ca. ACHN 


0.0 


Salivary gland 


6.5 


Renal ca. UO-31 


0.0 


Pituitary gland 


10.8 


Renal ca. TK-10 


0.0 


Brain (fetal) 


5.4 


Liver 


15.0 


Brain (whole) 


11.2 


Liver (fetal) 


3.0 


Brain (amygdala) 


10.2 


Liver ca. (hepatoblast) 
HepG2 


0.0 


Brain (cerebellum) 


12.1 


Lung 


54.7 


Brain (hippocampus) 


7.3 


Lung (fetal) 


94.0 


Brain (substantia 
nigra) 


11.7 


Lung ca. (small cell) 
LX-1 


2.1 


Brain (thalamus) 


11.3 


Lung ca. (small cell) 
NCI-H69 


1.5 


Cerebral Cortex 


10.3 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


17.1 


Lung ca. (large 
cell)NCI-H460 


0.0 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. cell) 


0.0 
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A549 




glio/astro U-l 18-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


1.3 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) NCI- 
H522 


0.0 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) SW j. . 
900 " j 3 


astrocytoma SNB-75 


0.7 


Lung ca. (squam.) NCI- j 
H596 j " 


glioma SNB-19 


0.0 


Mammary gland 


100.0 


glioma U251 


0.0 


Breast ca.* (pl.ef) MCF- 
7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) MDA- 
MB-231 


0.0 


Heart (fetal) 


95.3 {Breast ca.* (pl.ef) T47D 


2.0 


Heart 

Skeletal muscle (fetal) 


50.3 [Breast ca. BT-549 
57.8 jBreastca. MDA-N 


0.0 

0.0 


Skeletal muscle 


28.5 [Ovary 


18.2 


Bone marrow 


1 1 .6 'Ovarian ca. OVCAR-3 


0.0 


Thymus 


19.3 


Ovarian ca. OVCAR-4 


0.0 


Spleen 


27.2 


Ovarian ca. OVCAR-5 


43.5 


Lymph node 


41.8 


Ovarian ca. OVCAR-8 


0.6 


Colorectal 


5.8 


Ovarian ca. IGROV-1 


0.0 


Stomach 


10.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


21.2 


Uterus 


62.9 


Colon ca. SW480 


0.0 


Placenta 


22.2 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


10.4 


Colon ca. HT29 


0.4 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Testis 


5.0 


Colon ca. CaCo-2 


0.0 


Melanoma Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


6.7 


Melanoma* (met) 
Hs688(B).T 


5.5 


Colon ca. HCC-2998 


7.2 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


2.8 


Melanoma M 14 


0.0 


Bladder 


8.5 


Melanoma LOX IMV1 


0.0 
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Trachea 


6.1 


Melanoma* (met) SK- 
MEL-5 


0.0 


Kidney 


27.f 


Adipose 


42.3 


Table TD. Panel 2.2 


Tissue Name 


Rel. Exp.(%) 
Ag322I, Run 
173817111 


Tissue Name 


Rel. Exp.(%) Ag3221, 
Run 173817111 


Normal Colon 

Colon cancer 
(OD06064) 


8.0 


Kidney Margin 
(OD04348) 


j 100.0 


17.4 


Kidney malignant I 
cancer(OD06204B) | 


Colon Margin 
(OD06064) 


10.2 


Kidney normal adjacent 
tissue (OD06204E) r b 


Colon cancer 
(OD06159) 


2.5 


Kidney Cancer 
(OD04450-01) 


;o.o 

|l8.4 


Colon Margin 
;(OD06I59)' 


4.2 


Kidney Margin 
(OD04450-03) 


iColon cancer 
;(OD06297-04) 




Kidney Cancer 8 1 206 1 3 is.2 

1 


jColon Margin 
i(OD06297-05) 

jCC Gr.2 ascend colon 
(OD0392 1 ) 




Kidney Margin I 
8120614 J"*" 9 


6.1 


Kidney Cancel 9010320 


5.3 


jCC Margin 
|(OD03921) 


1.6 


Kidney Margin 
9010321 


4.4 


jColon cancer 
[metastasis (OD06 1 04) 


4.3 


Kidney Cancer 8 1 20607 


8.0 


jLung Margin 
[(OD06104) 


1.8 


Kidney Margin 
8120608 


17.1 


jColon mets to lung 
|(OD04451-01) 


8.6 


Normal Uterus 


47.3 


|Lung Margin 
|(OD04451-02) 


16.7 


Uterine Cancer 064011 


23.8 


[Normal Prostate 


2.3 


Normal Thyroid 


6.0 


Prostate Cancer 
(OD04410) 


3.1 


Thyroid Cancer 064010 


2.7 


Prostate Margin 
(OD04410) 


7.0 


Thyroid Cancer 
A302152 


15.9 


Normal Ovary 


>8.3 


Thyroid Margin 
\302153 


4.5 


Ovarian cancer 
(OD06283-03) ( 


>.o 


Mormal Breast 


26.6 


Ovarian Margin i 


41 (Breast Cancer 


3.8 
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ior)')(.:s3-07) 




(OD04566) 




Ovarian Cancer 
064008 


4.3 


Breast Cancer 1024 


20.3 


Ovarian cancer 
(OD06I45) 


2.3 


Breast Cancer 
(OD04590-0I) 


4.7 


Ovarian Margin 
(OD06145) 


164 


Breast Cancer Mets 
(OD04590-03) 


14.5 


Ovarian cancer 
(OD06455-03) 


0.9 


jBreast Cancer 

1 Metastasis (OD04655- 

■05) 


11.0 


Ovarian Margin 
(OD06455-07) 


13.0 


Breast Cancer 064006 


18.8 


Normal Lung 


26.1 


Breast Cancer 9100266 


5.3 


Invasive poor diff. Iuna 
adeno(ODO4945-0I 


3.2 


Breast Margin 9100265 


4.4 


Lung Margin 
(ODO4945-03) 


36.6 


Breast Cancer A2090 73 




Lung Malignant 
Cancer (OD03 126) 


8.2 


Breast Margin \ 
A2090734 j 18 " 2 


Lung Margin 
(OD03126) 
Lung Cancer 
(OD05014A) 


03 0 ! Breast cancer 
{(OD06083) 

5 5 j Breast cancer node 

(metastasis (OD06083) 


'2.2 ) 
9.1 


Lung Margin 
(OD05014B) 


29.1 


Normal Liver 


6.1 


Lung cancer 
(OD06081) 


4.0 


Liver Cancer 1 026 


4.7 


Lung Margin 
(OD06081) 


31.0 


Liver Cancer 1025 


13.0 


Lung Cancer 
(OD04237-01) 


1.5 


Liver Cancer 6004-T 


5.0 


Lung Margin 
(OD04237-02) 


35.1 


Liver Tissue 6004-N 


1.6 


Ocular Melanoma 
Metastasis 


3.0 


Liver Cancer 6005-T 


3.7 


Ocular Melanoma 
Margin (Liver) 


\.2 


Liver Tissue 6005-N 


7.8 


Melanoma Metastasis : 


-.7 r 


Liver Cancer 064003 


.5 


Melanoma Margin 
(Lung) 1 


.6.8 ] 


Mormal Bladder 


2.9 


Normal Kidney 


1.5 r 


3ladder Cancer 1 023 f 


.5 


Kidney Ca, Nuclear <. 
grade 2 (OD04338) 


5.1 ~1 


3Iadder Cancer 
\302173 


.0 
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Kidney Margin 
(OD04338) 


7.6 


Normal Stomach 


32.1 


Kidney Ca Nuclear 
grade 1/2(OD04339) 


5.6 


Gastric Cancer 9060397 


1.0 


Kidney Margin 
(OD04339) 


14.8 


Stomach Margin 
9060396 


8.5 


Kidney Ca, Clear cell 
type (OD04340) 


1 8.4 Gastric Cancer 9060395 


10.9 


Kidney Margin 
(OD04340) 


o o .Stomach Margin 
•9060394 


16.0 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


1 .5 jGastric Cancer 064005 


11.3 


Table TE. Panel 4D 


Tissue Name 


Rel. Exp.(%) 

Ag3221,Run iTissue Name 
164531988 ! 


Rel. Exp.(%) Ag3221, 
Run 164531988 


Secondary Till act 


0.1 HUVEC IL-1 beta ;?..6 


Secondary Th2 act 
[Secondary Trl act 


0.2 'HUVEC IFN gamma (25.5 
0 4 HUVEC TNF alpha + '["'" 
IFN gamma f' 1 


(Secondary Thl rest h.O HUVEC TNF alpha + ' 
1 L4 


31.9 


Secondary Th2 rest |0.7 .HUVEC IL-1 1 


21.3 


Secondary Trl rest 


1.3 


Lung Microvascular EC 
none 


48.0 


Primary Thl act 


0.2 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


100.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


47.3 


Primary Trl act 


0.6 


Microsvasular Dermal 
ECTNFalpha + IL-lbeta 


44.1 


Primary Thl rest 


1.6 


Bronchial epithelium 
TNFalpha + ILlbeta 


6.8 


Primary Th2 rest 


2.6 


Small airway epithelium 
none 


1.7 


Primary Trl rest 


1.7 


Small airway epithelium 
TNFalpha+ IL-1 beta 


6.9 


CD45RA CD4 
lymphocyte act 


3.3 


Coronery artery SMC 
rest 


1.7 


CD45RO CD4 
lymphocyte act 


0.2 


Coronery artery SMC 
TNFalpha + IL-lbeta 


2.1 


CDS lymphocyte act 


0.4 


Astrocytes rest 


0.0 


Secondary CD8 


0.5 


Astrocytes TNFalpha + 


0.0 
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lymphocyte rest 




IL-1 beta 




Secondary CD8 
lymphocyte act 


0.2 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


1.1 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95CH11 


3.5 


CCD1 106 

(Keratinocytes) none 


0.0 


LAK cells rest 


0.7 


CCD 1106 
(Keratinocytes) 
TNFalpha+ IL-1 beta 


0.7 


LAK cells IL-2 


2-4 


Liver cirrhosis 


10.6 


LAK cells IL-2+IL-12 


0.0 


iLupus kidney 


6.5 


LAK cells IL-2+IFN 
gamma 


0.6 


""" "~ T ~ — 

jNCI-H292 none 


0.5 


LAK cells IL-2+ IL-18 


0.5 


INCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.2 


"1 ~— 

{NCI-H292 IL-9 


0.5 


NK Cells IL-2 rest 


0.8 


]NCI-H292 IL-13 


0.0 

o.o 


Two Way MLR 3 day 


0.7 


;NCI-H292IFN gamma " 


Two Way MLR 5 day 


0.0 


'HPAEC none 


45.1 


Two Way MLR 7 day 


0.2 


HPAE< LNF ilpha-f 1 
■ 1-1 beta HI- 2 


PBMC rest 


1.1 


|l 


ung fibroblast none 


1.5 


PBMC PWM 


0.5 


iLung fibroblast TNF 
(alpha + I L- 1 beta 


0.2 


PBMC PIIA-L 


0.5 


jLung fibroblast IL-4 


3.2 


Ramos (B cell) none 


0.0 


jLung fibroblast IL-9 


10 


Ramos (B cell) 
ionomycin 


0.0 


J Lung fibroblast IL-13 


13 


B lymphocytes PWM 


14 


jLung fibroblast I FN 
{gamma 


).2 


B lymphocytes CD40L 
and IL-4 


19 


Dermal fibroblast 

[CCD 1 070 rest 4 


.0 


EOL-1 dbcAMP 


.3 


(Dermal fibroblast 
JCCDI070 TNF alpha 8 


.0 


EOL-1 dbcAMP 
PMA/ionomycin A 


.6 


Dermal fibroblast 
|CCD 1070 IL-1 beta 5 


.3 


Dendritic cells none 1 


.3 


Dermal fibroblast I FN 
{gamma ' 


.1 


Dendritic cells LPS I 


.0 


(Dermal fibroblast IL-4 9 


7 


Dendritic cells anti- 
CD40 3 


.7 


JlBD Colitis 2 0 


7 
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Monocytes rest 


5.7 


IBD Crohn's 


3.0 


Monocytes LPS 


4.5 


Colon 


9.4 


Macrophages rest 


12.4 


Lung 


17.0 


Macrophages LPS 


1.4 


Thymus 


14.0 


HUVEC none 


17.9 


Kidney 


7.4 


HUVEC starved 


16.6 







CNS_neurodegenerarion_vl.O Summary: Ag3221 This panel confirmsthe 
expression of this gene at low levels in the brains of an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 



see Panel 1.4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

Panel 1.3D Summary: Ag3221 Highest expression of the CG57368-01 gene is 
detected in mammary gland (CT=30). High expression of this gene is seen mainly in the 
normal tissue samples suggesting an important role in cellular function. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 2.2 Summary: Ag3221 Highest expression of the CG57368-01 gene is 
detected in control kidney (OD04348) sample (CT=32.7). Expression of this gene is 
generally higher in the normal control margin samples as compared to the cancer tissues. 
Interestingly, expression of this gene is higher in kidney cancer nuclear grade 2 
(OD04338) (Ct=33.5) as compared to corresponding control sample (Ct-36.4). Therefore, 
expression of this gene can be used as a diagnostic marker for nuclear grade 2 kidney 
cancer and therapeutic modulation of this gene could be beneficial in the treatment of 
kidney cancer. 
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Panel 4D Summary: Ag3221 Highest expression of the CG57368-01 gene is 
detected in TNFalpha + IL-lbeta treated lung microvascular EC (CT=28). In addition, 
moderate to low expression of the gene is also seen in resting primary and secondary 
Thl,Th2,Trl, B lymphocytes, LAK cells, dendritic cells, monocytes, macrophages, 
5 endothelial cells, eosinophils, small airway epithelium, dermal fibroblasts, lupus kidney 
and normal tissues represented by colon, lung, thymus and kidney. Therefore, therapeutic 
modulation of this gene through the use of small molecul drug may be useful in the 
treatment of autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
1 0 osteoarthritis 

Panel CNS_1 Summary: Ag3221 Results from one experiment with the 
CG57368-01 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 



15 U. CG59955-01: GPCR 

Expression of gene CG59955-01 was assessed using the primer-probe set Ag3637, 
described in Table UA. 

Table UA . Probe Name Ag3637 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -acactaggaagccccatgtact-3 ' 


22 


189 


278 


Probe 


TET-5 1 -tgtcctttgcagattcttgcttttca-3 1 -TAMRA 


26 


226 


279 


Reverse 


5 ' -aattaatctaggggctgtggaa-3 ' 


22 


254 


280 



CNS_ncurodegeneration_vl.O Summary: Ag3637 Expression of the CG59955- 



20 01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General_screening_panel_vl.4 Summary: Ag3637 Expression of the CG59955- 
01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4.1D Summary: Ag3637 Expression of the CG59955-01 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
25 General oncology screening panel_v_2.4 Summary: Ag3637 Expression of the 

CG59955-01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not 
shown.) 
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V. CG89211-01: GPCR 

Expression of gene CG8921 1-01 was assessed using the primer-probe set Ag3691, 
described in Table VA. Results of the RTQ-PCR runs are shown in Table VB. 

Table VA . Probe Name Ag3691 



jPrimers jSequences i Length 

II | 


Start 
Position 


SEQ ID 

No 


jForwardlS ' -tgcattttaattcgtccagttc-3 ' \ 22 


464 


281 


jProbe JTET- 5 ' -cactcccgataatc tatctcatctaccg- 3 1 -TAMRa! 28 


492 


(Reverse |5 ' -atgagci 1 a ia ijgtaaa-3' 1 22 


520 


_____ 


Table VB. General oncology screening panel 


v_2.4 



Tissue Name 


Rel. Exp.(%) Ag369l, 
Run 267739062 


Tissue Name 


Rel. Exp.(%) Ag3691, 
Run 2 67739 0 62 


Colon cancer 1 


0.0 


Bladder cancer NAT 2 


0.0 


Pr»lr»n M AT 1 

v_oion in/\ i i 


0.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 


0.0 


Adenocarcinoma of the 
prostate 1 


100.0 


'Colon cancer 3 


q q Adenocarcinoma of the 
jprostate 2 


0.0 


Colon cancer NAT 
3 


0.0 


Adenocarcinoma of the 
prostate 3 


0.0 


Colon malignant 
cancer 4 


0.0 


Adenocarcinoma of the 
prostate 4 


0.0 


Colon normal 
adjacent tissue 4 


0.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


0.0 


Adenocarcinoma of the 
prostate 6 


0.0 


Lung NAT 1 


0.0 


Adenocarcinoma of the 
prostate 7 


67.4 


Lung cancer 2 


0.0 


Adenocarcinoma of the 
prostate 8 


0.0 


Lung NAT 2 


0.0 


Adenocarcinoma of the 
prostate 9 


0.0 


Squamous cell 
carcinoma 3 


0.0 


Prostate cancer NAT 10 


0.0 


Lung NAT 3 


0.0 


Kidney cancer 1 


0.0 


metastatic 
melanoma 1 


0.0 


KidneyNAT 1 


0.0 


Melanoma 2 


0.0 


Kidney cancer 2 


0.0 


Melanoma 3 


0.0 


Kidney NAT 2 


0.0 
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metastatic 
[melanoma 4 
metastatic 
melanoma 5 



Kidney cancer 3 



Kidney NAT 3 



jBladder cancer 
NAT I 



Kidney cancer 4 



Kidney NAT 4 



CNSneurodegenerationvl.O Summary: Ag3691 Expression of the CG892H 
01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Gencral screening panel vl.4 Summary: Ag3691 Expression of the CG89211- 
01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4.1D Summary: Ag3691 Expression of the CG8921 1-01 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General oncology screening panel_v_2.4 Summary: Ag3691 Expression of the 
CG8921 1-01 gene is low/undetectable in all samples on this panel (CTs>35). (Data not 
shown.) 



W. CG90530-02: Ubiquitin-Conugating Enzyme 

Expression of full length physical clone CG90530-02 was assessed using the 
primer-probe sets Ag3410 and Ag4036, described in Tables WA and WB. Results of the 
RTQ-PCR runs are shown in Tables WC, WD, WE and WF. 



Table WA . Probe Name Ag3410 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -tgagaggtacccatttgaacct-3 ' 


22 


372 


284 


Probe 


TET- 5 ' - cctcagatccgatttctcactccaat-3 1 -TAMRA 


26 


346 


285 


Reverse 


5 ' -aaatccttccagcagaatcaat-3 ' 


22 


311 


286 



Table WB . Probe Name Ag4036 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 1 -tgagaggtacccatttgaacct-3 ' 


22 


372 


287 


Probe 


TET- 5 ' -cctcagatccgatttctcactccaat-3 ' -TAMRA 


26 


346 


288 


Reverse 


5 ' -aaatccttccagcagaatcaat-3 1 


22 


311 


289 



Table WC . CNS_neurodegeneration_vl.O 

508 



WO 03/010327 



PCT/US02/14199 



Tissue Name 


Rel. Exp.(%) Ag3410, 
Run 210350165 


Tissue Name 


Rpl Fxn f%\ 

Ag3410, Run 
210350165 


AD 1 Hippo 


4.7 


Control (Path) 3 Temporal 
Ctx 


1.0 


AD 2 Hippo 


17.4 


Control (Path) 4 Temporal 


28.7 


AD 3 Hippo 


2.1 


~ .-— - 

AD 1 Occipital Ctx 




AD 4 Hippo 


2.2 


AD 2 Occipital Ctx 


0.0 


AD 5 Hippo 


80.7 


AD 3 Occipital Ctx 


___ .. 


AD 6 Hippo 


32.3 


AD 4 Occipital Ctx 


14.1 


Control 2 Hippo 


30.4 


AD 5 Occipital Ctx 


48.3 


Control 4 Hippo 


0.8 


AD 6 Occipital Ctx 


16T2 


Control (Path) 3 
Hippo 


0.9 


Control 1 Occipital Ctx 


0.5 


AD 1 Temporal 

Ctx 


4.5 


Control 2 Occipital Ctx 


76.8 


^AD 2 Temporal 

jCtx 


29.3 


Control 3 Occipital Ctx 


7.0 


iAD 3 Temporal 
Ctx 


1.7 


Control 4 Occipital Ctx 


1.9 


AD 4 Temporal 
Ctx 


8.9 


Control (Path) 1 Occipital 
Ctx 


84.1 


AD 5 Inf 
Temporal Ctx 


100.0 


Control (Path) 2 Occipital 
Ctx 


4.5 


AD 5 Sup 
Temporal Ctx 


20.7 


Control (Path) 3 Occipital 
Ctx 


0.0 


AD 6 Inf 
Temporal Ctx 


29.3 


Control (Path) 4 Occipital 
Ctx 


7.7 


AD 6 Sup 
Temporal Ctx 


30.8 


Control 1 Parietal Ctx 


2.0 


Control 1 
Temporal Ctx 


0.5 


Control 2 Parietal Ctx 


23.2 


Control 2 
Temporal Ctx 


31.9 


Control 3 Parietal Ctx 


5.8 


Control 3 
Temporal Ctx 


3.1 


Control (Path) 1 Parietal 
Ctx 


73.2 


Control 3 
Temporal Ctx 


.4 1 


Control (Path) 2 Parietal 
Ctx 


4.5 


Control (Path) 1 
Temporal Ctx 


56.3 


Control (Path) 3 Parietal 
Ctx 


.7 


Control (Path) 2 : 


.3.7 


:ontrol (Path) 4 Parietal \l 


6.6 
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[Temporal Ctx j ] C tx j 

Table WD . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag34I0, Rui 
216839094 


Rel. 

Exp.(%) 
Ag4036, 
Run 

220283343 


Tissue Name 


IRel. 
|Exp.(%) 
|Ag3410, 
;Run 

|2 1 6839094 


Rel. 

Exp.(%) 
Ag4036, 

2202833 
43 


Adipose 


,0.1 


0.3 




Renal ca. TK-10 


|I4.0~ 


21.9 


Melanoma- Hs6SS(AiT 


j2.0 


|2.5 




Bladder 


2.2 


|1.7 


Melanoma* Hs688(B).T 


2.2 


1.9 




Gastric ca. (liver j 
met.) NCI-N87 ! 


13.2 


^Melanoma* M 1 4 


45.1 


34.9 


iGastric ca. KATO 
[ill 


•85.3 




90.1 


Melanoma* LOXIMVI 


11.0 


17.4 




Colon ca. SW-948 


!l3.6 


17.2 


Melanoma* SK-MEL-5 


55.5 


44.8 


Colon ca. SW480 


1 100.0 


|1000 


Squamous cell 
carcinoma SCC-4 
Testis Pool 


14.2 

2.r™" 


22.1 
8.0 


(Colon ca.* (SW480 
! met) SW620 
Colon ca. HT29 


57.0 
2 1 .9 


•49 1 

V 


Prostate ca.* (bone met) 
PC-3 


3, 


5.7 


Colon ca. HCT-1 16 


j82.9 


86.5 


Prostate Pool 


0.3 


0.0 


Colon ca. CaCo-2 


|I7.8 


~>\ 0 i 


Placenta 


0.7 


0.8 


jColon cancer tissue 


[4.2 


3.6 j 


Uterus Pool 


0.2 


0.0 


jColonca. SW1116 




4.3 


Ovarian ca. OVCAR-3 


30.8 |21.5 


jColon ca. Colo-205 


12.9 


\Z5 


Ovarian ca. SK-OV-3 


33.7 


31.4 


~]Colon ca. SW-48 


14.9 


26.2 


< * inan c ; CA L AR--1 


27.2 


30.6 


{Colon Pool 


0.9 


0.8 


Ovarian ca. OVCAR-5 


28.3 i24.7 


JSmall Intestine Pool 


0.3 


0.2 


Ovarian ca. IGROV-1 


9.4 


9.5 


| Stomach Pool 


0.4 


0.3 


Ovarian ca. OVCAR-8 


8.2 


6.2 


]Bone Marrow Pool 


0.1 


0.1 


Ovary 


0.6 


9.6 


[Fetal Heart 


5.3 


5.0 


Breast ca. MCF-7 


32.1 


25.2 


{Heart Pool 


0.5 


0.5 


Breast ca. MDA-MB- 
231 


47.0 


51.8 


jLymph Node Pool 


2.1 


1.5 


Breast ca. BT 549 


55.1 


50.7 


(Fetal Skeletal 
(Muscle 


1.1 


1.2 


Breast ca. T47D 


51.8 


3.9 


Skeletal Muscle 
jPool 


2.2 


.5 


Breast ca. MDA-N 


14.5 


5.9 


|Spleen Pool 


0.7 


).6 


Breast Pool 


).8 ( 


.6 


(Thymus Pool 


2.9 


3.8 


Trachea 


).5 ( 


.5 


]CNS cancer 


27.9 


7.8 j 
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(glio/astro) U87 \h, 






Lung 


0.1 


0.0 


CNS cancer 
(glio/astro) U- 11 8- 
MG 


49.0 


48.6 


Fetal Lung 


3.1 


2.2 


CNS cancer 
(neuro;met) SK-N- 
AS 


41.8 


47.0 


Lung ca. NCI-N417 


27.9 


25.0 


CNS cancer (astro) 
SF-539 


28.9 


6.8 


Lung ca. LX-1 


40.9 


38.7 


CNS cancer (astro) 
SNB-75 


34.4 


29.7 


Lung ca. NCI-H146 


4.4 


6.5 


CNS cancer (glio) 
SNB-19 


9.1 


7.6 


Lung ca. SHP-77 


59.5 


1 1.4 


CNS cancer (glio) 
SF-295 


4.6 


5.4 


iLung ca. A549 


17.6 


26.1 


Brain (Amygdala) 
Pool 


1.6 




Lung ca. NCI-H526 


11.6 


12.0 


Brain (cerebellum) 


1.5 


0.9 


Lung ca. NCI-H23 


14.6 


22.1 


Brain (fetal) 


3.6 


3.6 H 


Lung ca. NCI-H460 


2.5 


2.4 


(Hippocampus) Pool 


1.3 


1.2 


Lungca. HOP-62 


3.3 


4.1 


Cerebral Cortex 
Pool 


2.4 


1 4 


Lung ca. NCI-H522 


17.4 


23.5 


Brain (Substantia 
nigra) Pool 


1.8 


1.0 


Liver 




0.1 


0.1 


Brain (Thalamus) 
Pool 


2.4 


2.0 


Fetal Liver 


16.0 


12.2 


Brain (whole) 


2.8 


2.4 


Liver ca HepG2 






Spinal Cord Pool 






Kidney Pool 


0.5 


0.3 


Adrenal Gland 


0.2 


0.4 


Fetal Kidney 


5.6 


4.0 


Pituitary gland Pool 


0.1 


0.3 


Renal ca. 786-0 


16.5 


21.0 


Salivary Gland 


0 1 


0.1 


Renal ca. A498 


3.1 


2.8 


Thyroid (female) 


0.4 


0.2 


Renal ca. ACHN 


10.2 


10.8 


Pancreatic ca. 
CAPAN2 


52.5 


39.2 


Renal ca. UO-3 1 


4.7 


7.6 


Pancreas Pool 


1.1 


1.0 



Table WE . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3410, Run 
165296375 


Tissue Name 


Rel. Exp.(%) 
Ag3410, Run 
165296375 


Secondary Th 1 act 


29.5 


HUVEC IL-lbeta 


3.4 


Secondary Th2 act 


19.5 


HUVEC I FN gamma 


6.2 
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Secondary Trl act 


29.9 


HUVEC TNF alpha + IFN 
gamma 


5.1 


Secondary Thl rest 


0.8 


HUVEC TNF alpha + IL4 


4.6 


Secondary Th2 rest 




HUVEC IL-1 1 


4.0 


Secondary Trl rest 


1.1 


Lung Microvascular EC none 


3.8 


[Primary Thl act 


10.6 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


3.8 


Primary Th2 act 


1 1.7 


Microvascular Dermal EC none 


6.0 


Primary Trl act 


25.7 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


3.7 


[Primary Thl rest ' 


23.7 


Bronchial epithelium TNFalpha 
+ ILIbeta 


1.6 


Primary Th2 rest 


4.5 


Small airway epithelium none 


0.3 


iPrimary Trl rest 


20.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


5.1 


CD45RA CD4 
lymphocyte act 


13.1 


Coronery artery SMC rest 


2, 


CD45ROCD4 J 
lymphocyte act ] " 


Coronery artcrv SMC ! 
TNFalpha + IL-lbeta | J " 


;CD8 lymphocyte act 13.9 


Astrocytes rest 1 1 .3 


Secondary CD8 
lymphocyte rest 


20.2 


Astrocytes TNFalpha + IL- 
lbeta 


0.6 


Secondary CD8 
[lymphocyte act 


13.7 


KU-812 (Basophil) rest 


10.9 


jCD4 lymphocyte 
(none 


0.1 


KU-8 12 (Basophil) 
PMA/ionomycin 


21.8 


|2ry 

Thl/Th2/Trl anti- 
{CD95CH11 


3.0 


CCD1 106 (Keratinocytes) none 


12.4 


|LAK cells rest 


2.0 


CCD1106(Keratinocytes) 
TNFalpha + IL-lbeta 


2.6 


jLAK cells IL-2 


9.5 


Jver cirrhosis 


0.0 


jLAK cells IL-2+IL- 
12 


19.9 


^upus kidney 


0.1 


LAK cells IL-2+IFN 
gamma 


23.0 


NCI-H292 none 


2.2 


LAK cells IL-2+ IL- 
18 


17.3 


NCI-H292 IL-4 


30.8 


LAK cells 
PMA/ionomycin 


3.8 


VCI-H292 IL-9 


15.7 


NK Cells IL-2 rest 


0.6 


MCI-H292 IL-13 


)9 


Two Way MLR 3 
day 


5.0 


MCI-H292 IFN gamma 


2.4 
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Two Way MLR 5 
day 


9.0 


HPAEC none 


X.5 


Two Way MLR 7 
day 


7.1 


HPAEC TNF alpha + IL-1 beta 


2.5 


PBMC rest 


0.2 


Lung fibroblast none 


1.7 


PBMC PWM 


47.0 


Lung fibroblast TNF alpha + \ n ] 
IL-1 beta 


PBMC PHA-L 


18.2 


Lung fibroblast 1L-4 2.3 


Ramos (B cell) none i23.7 


Lung fibroblast 1L-9 3.4 


Ramos (B cell) 
ionomycin 


39.5 


Lung fibroblast 1L- 13 19 

, 


B lymphocytes 
PWM 


100.0 


Lung fibroblast IFN gamma j 1 .2 


B lymphocytes 
CD40L and 1L-4 


SDermal fibroblast CCD 1 070 . „ 
9 3 ;2'-U 
yj jrest 


EOL-1 dbcAMP 


SDermal fibroblast CCD 1 070 i . 
4 8 jTNF alpha ; ' 


EOL-1 dbcAMP 
PMA/ionomycin 


iDermal fibroblast CCD 1 070 1L- g 
4 - 7 jl beta ' i 


Dendritic cells none |0.8 


(Dermal fibroblast IFN gamma 
Dermal fibroblast 1L-4 


7.0 


Dendritic cells LPS 


X).l 


5.8 


: Dendritic cells anti- 

CD40 


0.3 |l BD Colitis 2 


0.4 


Monocytes rest ;0.1 JlBD Crohn's 


0.1 


Monocytes LPS 


0.0 Colon 


0.9 


Macrophages rest 


2.7 jLung 


0.6 


Macrophages LPS 


0.1 jThymus 


0.4 


HUVEC none 


8.0 kidney 


8.1 


HUVEC starved 


28.1 J 





Tissue 
Name 


Rel.Exp.(%) 
Ag3410, Run 
267171484 


Rel. Exp.(%) 
Ag4036, Run 
268362917 


Tissue Name 


Rel. Exp.(%) 
Ag3410, Run 
267171484 


Rel. Exp.(%) 
Ag4036, Run 
268362917 


Colon 
cancer 1 


34.4 


11.3 


Bladder cancer NAT 2 


0.1 


0.0 


Colon 

cancer NAT 
1 


12.3 


10.8 


Bladder cancer NAT 3 


0.1 


0.0 


Colon 
cancer 2 


42.9 


7.1 


Bladder cancer NAT 4 


0.6 


0.0 


Colon 

cancer NAT 


2.9 


7.0 


Adenocarcinoma of 
the prostate 1 


3.2 


0.0 
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: 












cancer 3 


70.2 


47.3 


Adenocarcinoma of 
the prostate 2 


0.7 


0.0 


Colon 

cancer NAT 
3 


14.6 


18.0 


Adenocarcinoma of 
the prostate 3 


0.9 


0.0 


Colon 
cancer 4 






Adenocarcinoma of 
the prostate 4 


1 1 .7 


6.6 


normal 
adjacent 
tissue 4 


6.2 


1.7 


Prostate cancer NAT 5 


1.3 


0.6 


Lung cancer 
1 






Adenocarcinoma of 
the prostate 6 


0.5 


0.0 


Lung NAT 
1 


0.1 


0.0 


Adenocarcinoma of 
the prostate 7 


0.8 


0.0 


Lung cancer 
2 


100.0 


100.0 


Adenocarcinoma of 
the prostate 8 


0.1 


0.0 


Lung NAT 
2 


0.1 


0.0 

— 


Adenocarcinoma of 
the prostate 9 


5.3 


1.7 


Squamous 
cell 

carcinoma 3 


26.2 


17.6 


Prostate cancer NAT 
10 




0.1 


■ 

0.0 


Lung NAT 
3 


0.2 


0.0 


Kidney cancer 1 


4.6 


3.3 


metastatic 
melanoma 1 


0.9 


0.0 


KidneyNAT 1 


0.6 


0.5 


Melanoma 2 


0.2 


0.0 


Kidney cancer 2 


21.0 


15.0 


Melanoma 3 


0.9 


0.0 


Kidney NAT 2 


2.9 


2.0 


metastatic 
melanoma 4 


7.6 


3.4 


ICidney cancer 3 


2.8 


1.5 


metastatic 
melanoma 5 


7.7 


3.1 


Kidney NAT 3 0.7 


0.7 


Bladder 
cancer 1 


0.3 


0.0 


Kidney cancer 4 


2.1 


2.0 


Bladder 
cancer NAT 
1 


0.0 


0.0 


Kidney NAT 4 


0.6 


1.0 


Bladder 
cancer 2 


0.4 


0.2 









CNS_neurodegeneration_vl.O Summary: Ag3410 This panel does not show 
differential expression of the CG90530-02-01 gene in Alzheimer's disease. However, this 
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expression profile confirms the presence of this gene in the brain. Please see Panel 1 .4 for 
discussion of utility of this gene in the central nervous system. 

General_screening_panel vl.4 Summary: Ag3410/Ag4036 Two experiments 
with the same probe and primer set produce results that are in excellent agreement. 
5 Highest expression of the CG90530-02 gene is seen in a colon cancer cell line (CTs=25- 
26). In addition, expression is significantly higher in all the cancer cell lines on this panel 
when compared to expression in the normal tissues. Thus, expression of this gene could be 
used as a marker of cancer. In addition, this gene is expressed at much higher levels in 
fetal lung, liver, kidney and heart tissue (CTs=28-30) when compared to expression in the 

10 adult counterpart (CTs=33-36). Thus, expression of this gene may be used to differentiate 
between the fetal and adult source of these tissues. This gene encodes a putative member 
of the ubiquitin conjugating enzyme family. Ubiquitin-dependent protein degradation 
plays a role in many cellular processes and has been shown to be upregulated in some 
cancers (Eliseeva E, Cell Growth Differ 2001 Aug;12(8):427-33) Furthermore, higher 

1 5 levels of expression of this gene in cancer cell lines and fetal tissues suggests that 

therapeutic modulation of the expression or function of this gene may be useful in the 
treatment of cancer. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 

20 muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic function and that 
disregulated expression of this gene may contribute to neuroendocrine disorders or 
metabolic diseases, such as obesity and diabetes. 

This gene is also expressed at moderate levels in the CNS, including the 

25 hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 

Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 4D Summary: Ag3410 Highest expression of the CG90530-02 is seen in 

30 PWM stimulated B lymphocytes (CT=23.7). High levels of expression are also seen in 
activated T cells, LAK cells, dermal fibroblasts and the pulmonary mucoepidermoid cell 
line NCI-H292. Thus, therapeutic regulation of the transcript or the protein encoded by the 
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transcript could be important in immune modulation and in the treatment of T and B cell- 
mediated diseases such as asthma, arthritis, psoriasis, IBD, and lupus. 

General oncology screening panel_v_2.4 Summary: Ag3410/Ag4036 Two 
experiments with the same probe and primer set produce results that are in excellent 
agreement. Highest expression of the CG90530-02 gene is seen in lung cancer (CTs=28- 
31). In addition, this gene appears to be overexpressed in lung, kidney and colon cancers 
when compared to expression in normal adjacent tissue. Thus, expression of this gene 
could be used as a marker of lung and colon cancer. Ubiquitinylation is a cyclical process 
operating in all cells to target specific proteins (eg, p53) for degradation. Abnormal 
accumulations of ubiquitinylated proteins have been identified in colorectal carcinoma. 
This gene encodes a putative ubiquitin conjugating enzyme. Therefore, therapeutic 
modulation of the expression or function of this gene could be effective in the treatment of 
lung, kidney and colon cancer. 

X. CG93076-01: GPCR 

Expression of full length physical clone CG93076-01 was assessed using the 
primer-probe set Ag2129. described in Table XA. Results of the RTQ-PCR runs are 
shown in Tables XB and XC. 



Table XA . Probe Name Ag2129 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -gcaacagcatggtgatctg-3 ' 


19 


133 


290 


Probe 


TET-5 ' -ctttcgaatgcacaggaaccccttct-3 ' -TAMRA 


26 


161 


291 


Reverse 


5 ' -cgccaggttgaggatatagat-3 1 


21 


189 


292 



Table XB . Panel 1.3D 



Tissue Name 


ReI.Exp.(%) Ag2l29, 
Run 148719315 


Tissue Name 


Rel. Exp.(%) Ag2129, 
Run 148719315 


Liver 

adenocarcinoma 


11.7 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. 
CAPAN 2 


0.0 


Renal ca. A498 


22.8 


Adrenal gland 6.8 


Renal ca. RXF 393 


0.0 


Thyroid 


0.0 


Renal ca. ACHN 


0 0 


Salivary gland 


0.0 


Renal ca. LIO-3 1 


0.0 
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Pituitary gland 


1 I.X 


Rei A ca TK-10 


11.4 


Brain (fetal) 


51.8 


Liver 


0.0 


Brain (whole) 


]l4.9 


Liver (fetal) 


24.0 


Brain (amygdala) 


14.2 


Liver ca. (hepatoblast) 
HepG2 


9.5 


IBrain (cerebellum) 


55.1 jLung 


0.0 


jBrain 

[(hippocampus) 


11.1 ILung (fetal) 


24.3 


Brain (substantia 
nigra) 


0 q jLung ca. (small cell) 
|LX-1 


0.0 


Brain (thalamus) 


12 2 ' Lung ca. (small cell) 
(NCI-H69 


0.0 


.Cerebral Cortex 


nn iLungca. (s.cell var.) 

jSHP-77 


0.0 


1 

■Spinal cord 


0 o 'Lung ca. (large 
|cell)NCI-H460 


0.0 


!glio/astroU87-MG 

glio/astroU-1 18- 
MG 


0 o l Lu "g ca - (non-sm. 

!cell)A549 

nn [Lung ca. (non-s.cell) 
INCI-H23 


24.8 
100.0 


astrocytoma jLunn ca. (non-s.cell) 
:SW1783 |hoP-62 


0.0 


neuro*; met SK-N- 
AS 


e 5 jLung ca. (non-s.cl) 
|NCI-H522 


9.2 


astrocytoma SF-539 


0 0 jLung ca. (squam.) SW 
|900 


0.0 


astrocytoma SNB- 
75 


n n jLung ca. (squam.) 

]NC1-H596 


0.0 


glioma SNB-19 


0.0 (Mammary gland 


4.3 


glioma U251 


j , r Breast ca.* (pl.ef) 
|MCF-7 


0.0 


glioma SF-295 


87 (Breast ca.* (pl.ef) 
|MDA-MB-2 5 1 


0.0 


Heart (fetal) 


jq jBreastca* (pl.ef) 
JT47D 


0.0 


Heart 


3.0 jBreast ca. BT-549 


0.0 


Skeletal muscle 
(fetal) 


78.5 


Breast ca. MDA-N 


9.8 


Skeletal muscle 


).0 


Ovary 


0.0 


Bone marrow 


2.5 


Ovarian ca. OVCAR-3 


10.4 


Thymus 


4.9 


Ovarian ca. OVCAR-4 


D.O 


Spleen |( 


).0 


Ovarian ca. OVCAR-5 


12.2 
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Lymph node 


28.7 


Ovarian ca. OVCAR-8 


16.5 


Colorectal 


76.3 


Ovarian ca. IGROV-1 


22.1 


Stomach 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


81.8 


Colon ca. SW480 


19.2 


Placenta 


0.0 


Colon ca.* 

:S\V620(SW480 

(met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-1 16 


0.0 


Testis 


55.1 


Colon ca. CaCo-2 


0.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC- 
2998 


0.0 


Melanoma UACC-62 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


14.5 jMelanomaM14 


0.0 


Bladder 


0.0 


Melanoma LOX IMVI 


0.0 


Trachea 


q Q jMelanoma* (met) SK- 
•MEL-5 


16.5 


Kidney 


0.0 \ 


Adipose 


20.3 



Table XC . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2129, Run 
165823129 


Tissue Name 


Rel. Exp.(%) 
Ag2129, Run 
165823129 


Secondary Thl act 


3.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


12.6 


HUVEC I FN gamma 


3.8 


Secondary Trl act 


7.5 


HUVEC TNF alpha + I FN 
gamma 


2.6 


Secondary Thl rest 


12.3 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


8.7 


HUVEC 1L-11 


0.0 


Secondary Trl rest 


15.7 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


2.3 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


5.2 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


84.7 


Bronchial epithelium TNFalpha 
+ ILlbeta 


4.8 
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Primary Th2 rest 


34.6 


Small airway epithelium none 


0.0 


Primary Trl rest 


17.1 


Small airway epithelium 
TNFalpha+IL-lbeta 


5.7 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


4.3 


JCD45RO CD4 
•lymphocyte act 


6.5 


Coronery artery SMC TNFalpha 
+ lL-lbeta 


0.0 
0.0 
0.0 


CD8 lymphocyte act 
Secondary CD8 
lymphocyte rest 


2.4 
9.0 


Astrocytes rest 

Astrocytes TNFalpha + IL-lbeta 


Se« ondary CD8 
lymphocyte act 


1.3 


KU-8 12 (Basophil) rest 


- 

0.0 


CD4 lymphocyte 
jnone 


39.8 


KU-8 12 (Basophil) 
PMA/ionomycin 




2ry 

!Thl/Th2/Trl anti- 
■CD95CH11 


45.7 


CCD 1 106 (Keratinocytes) none 


0.0 


; LAK cells rest 


9.8 


CCD 1 106 (Keratinocytes) 
TNFalpha + )L-lbeta" 


0.0 


LAK cells 1L-2 


31.9 jLiver cirrhosis ] 100.0 


LAK cells IL-2+IL- L, , ! L n 
p j 1 7.6 Lupus Kidney 0.0 


LAK cells 1L-2+1FN 


20.0 


NC1-H292 none 


7.3 


LAK cells 1L-2+ IL- 
18 


13.3 


NC1-H292 1L-4 


10.6 


LAK cells 
PMA/ionomycin 


10.1 


NC1-H292 1L-9 


4.1 


NK Cells IL-2 rest 


8.5 


NC1-H292 IL-13 


6.3 


Two Way MLR 3 

day 


9.2 


NCI-H292 I FN gamma 


1.2 


Two Way MLR 5 
day 


2.5 


HPAEC none 


0.0 


Two Way MLR 7 
day 


0.0 


HPAEC TNF alpha + IL-1 beta 


0.0 


PBMC rest 


S 1 


Lung fibroblast none 


2.5 


PBMC PWM 


3.8 


Lung fibroblast TNF alpha + 1L- 
1 beta 


9.0 


PBMC PHA-L 


1.7 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes 


5.8 


Lung fibroblast IFN gamma 


0.0 
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B lymphocytes 
CD40L and IL-4 



EOL-1 dbcAMP 
PMA/ionomycin 



Dendritic cells none 
Dendritic cells LPS 



Dendritic cells anti- 
CD40 



Monocytes rest 



Monocytes LPS 



Dermal fibroblast CCD 1070 rest 



Dermal fibroblast CCD 1070 
[TNF alpha 



jDermal fibroblast CCD 1070 IL-1 
ibeta 



(Dermal fibroblast I FN gamma 
iDermal fibroblast IL-4 

[IBD Colitis 2 



Macrophages rest 



Macrophages LPS 



Thymus 
Kidney 



HUVEC starved 



Panel 1.3D Summary: Ag2T29Expression of the €093(376^0 fg^]g 'restrict 
to a sample derived from a lung cancer cell line (CT=34.3). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as ; 
marker to detect the presence of lung cancer. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of lung cancer. 

Panel 4D Summary: Ag2129 Expression of the CG93076-01 gene is highest in 
liver cirrhosis (CT=31.8). Low but significant levels of expression are seen in primary 
resting T cells, colon, kidney, lung, LAK cells, and untreated CD4s. Expression of this 
gene is not detected in normal liver in Panel 1.3D, suggesting that its expression is unique 
to liver cirrhosis. This gene encodes a putative GPCR; therefore, antibodies or small 
molecule therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. In 
addition, antibodies to this putative GPCR could also be used for the diagnosis of liver 
cirrhosis. 



1 5 Y. CG94235-01: Thymidylate Kinase 

Expression of gene CG94235-01 was assessed using the primer-probe sets Agl980 
and Ag3909, described in Tables YA and YB. Results of the RTQ-PCR runs are shown in 
Tables YC, YD, YE, YF, YG, YH, YI and YJ. 

Table YA . Probe Name Agl980 
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Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 • -gggtacgatggctgaagtaaa-3 ■ 


21 


1437 


293 


Probe 


TET-5 • -ccagttttctgccacacaacatgctt-3 ' -TAMRA| 


26 


1394 


294 


Reverse 


5 ' -ttatgcagtgttcccaaatttc-3 • 


22 


1364 


295 


Table YB. Probe Name Ag3909 


Primers 


Sequences 


Lengtl 


Start ' SEQ ID 
Position No 


Forward 


5 ' -caggtgccacgtctaactagat-3 1 


22 


1307 '■ 296 


Probe 


TET-5 ' -tgttgtttgaaacatctacatccacca-3 1 -TAMRP 


27 


1333 


297 


Reverse 


5 ' -gaaatttgggaacactgcataa-3 ' 


22 


1364 


298 



Table YC . Al comprehensive panelvl.O 



Tissue Name 


Rel. 

Exp.(%) 

Agl VoU. 

Run 

211061884 


Rel. Exp.(%) 
Agl 980. Run 
212317511 

1 1.6 


Tissue Name 

112427 Match 
Control Psoriasis-F 


Rel. Exp.(%) 
Agl 980, Run 
21 1061884 


Rel. Exp.(%) 
Agl 980. Run 
21231751 1 


M 10967 COPD- 
F 


10.1 


100.0 


100.0 


1 10980 COPD- 
F 


12.5 

r 9 


16.8 

' 


112418 Psoriasis- M J 19.1 


18.7 


1 10968 COPD- 
M 




112723 Match ; 
Control Psoriasis-M j ' 


7.2 


110977 COPD- 
M 


50.3 


32.8 


1 12419 Psoriasis-M 


35.6 


17.0 


110989 

Emphysema-F 


20.7 


24.5 


112424 Match 
Control Psoriasis-M 


7.6 


10.1 


110992 

Emphysema-F 


12.2 


6.0 


112420 Psoriasis-M 


54.0 


53.2 


110993 

Emphysema-F 


12.9 


8.7 


112425 Match 
Control Psoriasis-M 


62.4 


72.7 


110994 

Emphysema-F 


7.0 


5.8 


104689 (MF) OA 
Bone-Backus 


66.0 


56.3 


110995 

Emphysema-F , 


17.0 


17.6 


104690 (MF) Adj 
"Normal" Bone- 
Backus 


34.6 


24.0 


110996 

Emphysema-F 


3.3 


7.0 


104691 (MF) OA 
Synovium-Backus 


36.6 


40.9 


110997 
Asthma-M 


8.0 


3.1 


1 04692 (BA) OA 
Cartilage-Backus 


12.1 


6.8 


111001 
Asthma-F 


21.8 


15.8 


1 04694 (BA) OA 
Bone-Backus 


51.8 


36.3 


111002 


24.0 


13.1 


1 04695 (BA) Adj 


26.4 


19.5 
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Asthma-F 






"Normal" Bone- 
Backus 






111003 Atopic 
Asthma-F 


20.3 


22.5 


104696 (BA) OA 
Synovium-Backus 


63.7 


40.9 


111004 Atopic 
Asthma-F 


26.1 


28.5 


104700 (SS) OA 
Bone-Backus 


20.9 


62.0 


1 11005 Atopic 
Asthma-F 


24.5 


10.3 


104701 (SS)Adj 
"Normal" Bone- 
Backus 


28.7 


25.7 


1 1 1 006 Atopic 
Asthma-F 


1 1 .0 


8.1 


104702 (SS)OA 
Synovium-Backus 


80.1 


59.5 


111417 
Allergy-M 


8.7 


7.7 


1 17093 OA Cartilage 
Rep7 


17.8 


20.9 


112347 
Allergy-M 


0.0 


0.0 


112672 OA Bone5 


36.6 


33.9 


112349 Normal 
Lung-F 


0.0 


0.0 


112673 OA 
Synovium5 


21.3 


22.4 


1 12357 Normal 
Lung-F 


67.4 


64.6 


112674 OA Synovial 
Fluid cellsS 


20.3 


20.0 


11 2354 Normal 
Lung-M 


9.8 


15.6 


117100 OA Cartilage 
Rep 14 




6.9 


j 11 2374 
iCrohns-F 


11.1 


15.0 


1 12756 OA Bone9 


95.3 


66.0 


1 1 ov iviar.cn 
Control 
Crohn s-F 


10.4 


18.4 


112757 OA 
Synovium9 


17.4 


19.2 


112375 
Crolms-F 


14.5 


9.9 


1 12758 OA Synovial 
Fluid Cells9 


14.1 


17.8 


1 1 2732 Match 

Control 

Crohns-F 


29.3 


28.5 


117125 RA Cartilage 
Rep2 


19.9 


22.5 


112725 
Crohns-M 


5.1 


3.6 


113492 Bone2 RA 


76.3 


66.0 


1 12387 Match 

Control 

Crohns-M 


14.4 


15.7 


113493 Synovium2 
RA 


20.9 


19.3 


112378 
Crohns-M 


0.0 


0.0 


1 13494 Syn Fluid 
Cells RA 


48.0 


43.2 


112390 Match 

Control 

Crohns-M 


33.7 


48.6 


1 13499 Cartilage4 
RA 


40.3 


49.7 


112726 
Crohns-M 


25.0 


19.9 


1 1 3500 Bone4 RA 


63.3 


50.3 


112731 Match 
Control 


19.1 


16.8 


113501 Synovium4 
RA 


44.8 

! 


36.6 
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Crohns-M 












112380 Ulcer 
Col-F 


14.8 


15.9 


1 13502 Syn Fluid 
Cells4 RA 


33.7 


21.5 


112734 Match 
Control Ulcer 
Col-F 


74.2 


59.9 


113495 Cartilage3 
RA 


48.3 


29.3 


11 2384 Ulcer 
Col-F 


26.2 


28.9 


1 13496 Bone3 RA 


51.8 


45.1 


1 1 ll'Xl \Aatnh 

i i z / j i iviaicn 
Control Ulcer 
Col-F 

112386 Ulcer 
Col-F 


8.8 
11.8 


4.1 


1 13497 Synovium3 
RA 


21.9 


25.3 


10.9 


1 13498 Syn Fluid 
Cells3 RA 


44.1 


47.6 


1 12738 Match 
Control Ulcer 
Col-F 


27.0 


15.0 


117106 Normal 
Cartilage Rep20 


2.0 


5.3 


112381 Ulcer 
Col-M 


0.0 


0.0 


1 13663 Bone3 
Normal 


0.0 


0.2 


i i l i j j iviatcn 
Control Ulcer 
Col-M 


6.3 


6.5 


1 13664 Synovium3 
Normal 


0.0 


0.0 


112382 Ulcer 
Col-M 


13.5 10.8 


1 13665 Syn Fluid 
Cells3 Normal 


0.0 


0.0 


1 1 z j v^t iviaicn 
Control Ulcer 
Col-M 


1.6 


7.7 


117107 Normal 
Cartilage Rep22 


10.6 


11.8 


1 12383 Ulcer 
Col-M 


18.7 


26.6 


1 13667 Bone4 
formal 


10.1 


8.4 


112736 Match 
Control Ulcer 
Col-M 


12.9 


6.7 


1 13668 Synovium4 
formal 


10.6 


8.1 


112423 
Psoriasis-F 


20.7 


18.6 


113669 Syn Fluid 
Cells4 Normal 


15.8 


21.3 



Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 
212248424 


Tissue Name 


Rel.Exp.(%) Ag3909, 
Run 212248424 


AD 1 Hippo 


11.7 


Control (Path) 3 Temporal Ctx 


5.6 


AD 2 Hippo 


41.5 


Control (Path) 4 Temporal Ctx 


33.7 


AD 3 Hippo 


7.6 


AD 1 Occipital Ctx 


12.; 


AD 4 Hippo 


7.2 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


75.3 


AD 3 Occipital Ctx 


i.i 


AD 6 Hippo 


36.1 


AD 4 Occipital Ctx 


23.7 
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Control 2 Hippo 


19.1 


AD 5 Occipital Ctx 


29 9 


Control 4 Hippo 


12.9 


AD 6 Occipital Ctx 


35.4 


Control (Path) 3 
Hippo 


12.2 


Control 1 Occipital Ctx 


9.1 


AD 1 Temporal 
Ctx 


9.5 


Control 2 Occipital Ctx 


69.7 


AD 2 Temporal 

Ctx 


36.3 


Control 3 Occ ipital Ctx S 1 8.9 


AD 3 Temporal 

jCtx 


3.7 


Control 4 Occipital Ctx 


3.9 


AD 4 Temporal 
Ctx 


3 1.9 


Control (Path) 1 Occipital Ctx 


71.2 


AD 5 Inf 
Temporal Ctx 


100 0 


Control (Path) 2 Occipital Ctx 


31.9 


AD 5 

SupTemporal Ctx 


56.6 


Control (Path) 3 Occipital Ctx 


4.3 


AD 6 Inf 
Temporal Ctx 


22.4 


Control (Path) 4 Occipital Ctx 


23.0 


AD 6 Sup 
Temporal Ctx 


32.3 


Control 1 Parietal Ctx 


8.5 


Control 1 
Temporal Ctx 


5.3 


Control 2 Parietal Ctx 


38.2 


Control 2 
Temporal Ctx 


42.9 


Control 3 Parietal Ctx 


29.1 


Control 3 
Temporal Ctx 


3.0 


Control (Path) 1 Parietal Ctx 


73.2 


Control 4 

Temporal Ctx \ 


4.4 


Control (Path) 2 Parietal Ctx 


44.4 


Control (Path) 1 t 
Temporal Ctx 


8.2 


Control (Path) 3 Parietal Ctx 


7.3 


Control (Path) 2 
Temporal Ctx 


1.8 ( 


Control (Path) 4 Parietal Ctx < 


49.0 



Tissue Name 


Rel. 

Exp.(%) 
Ag3909, 
Run 

2172358 
26 


Rel. 

Exp.(%) 
Ag3909, 
Run 

219173644 


Tissue Name 


Rel. 

Exp.(%) 
Ag3909, 
Run 

217235826 


Rel. 

Ex P .(%) 
Ag3909, 
Run 

2191736 
44 


Adipose 


0.6 


0.6 


Renal ca. TK-10 


0.0 


0.0 


Melanoma* Hs688(A).T 


0.0 


0.0 


Bladder 


9.7 


11.7 


Melanoma* Hs688(B).T 


0.0 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


100.0 


100.0 
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Melanoma* M14 


0.3 


0.4 


Gu in ca k \ 111 


0.5 


0.6 


Melanoma* LOXIMVI 


0.1 


0.2 


Colon ca. SW-948 


0.5 


0.5 


Melanoma* SK-MEL-5 


0.2 


0.2 


Colon ca. SW480 


0.1 


0.1 


Squamous cell carcinoma 
SCC-4 


0.5 


0.5 


Colon ca* (SW480 
met) SW620 


0.0 


0.0 


Testis Pool 


0.6 


0.5 


Colon ca. HT29 


0.1 


Q j 


Prostate ca.* (bone met) 
PC-3 


0.1 


lo.i 




Colon ca. HCT-1 16 


0.3 


— 

0.3 


Prostate Pool 


0.3 


0.3 


Colon ca. CaCo-2 


0.1 


0.0 


Placenta 


0.4 


0.5 


Colon cancer tissue 


0.6 


0.5 


Uterus Pool 


0.2 


0.2 


Colon ca. SW1116 


0.2 


0.1 


Ovarian ca. OVCAR-3 


0.2 


0.2 


Colon ca. Colo-205 


0.0 


0.0 


Ovarian ca. SK-OV-3 


0.5 


0.4 


Colon ca. SW-48 


0.0 


0.0 


Ovarian ca. OVCAR-4 


0.1 


0.1 


Colon Pool 


0.4 


0.5 


Ovarian ca. OVCAR-5 


0.6 


0.5 


Small Intestine Pool 


0.4 


0.4 


Ovarian ca. IGROV-1 


0.0 


0.0 


Stomach Pool 


0.1 


0.3 


Ovarian ca. OVCAR-8 


0.0 


0.0 




Bone Marrow Pool 


0.2 


0.2 


Ovary 


0.8 


0.6 




Fetal Heart 


0.2 


0.2 


Breast ca. MCF-7 


0.0 


0.0 




Heart Pool 


0.5 


0.5 


Br< istca MDA-MB-231 


0.2 


0.2 


jLyinph Node Pool 


0.4 




Breast ca. BT 549 


2.3 


3.1 


[Fetal Skeletal Muscle 


0.2 


0.2 


Breast ca. T47D 


1.1 


1.0 


jSkeletal Muscle Pool 


1.7 


1.9 


Breast ca. MDA-N 


0.2 


0.2 


{Spleen Pool 


2.3 


2.6 


Breast Pool 


0.3 


0.4 


jThymus Pool 


0.5 


0.5 






1.0 


|CNS cancer (glio/astro) 
]U87-MG 


0.0 


0.0 






0.1 


ICNS cancer (glio/astro) 
|U-118-MG 


1.0 


1.3 






0.9 


jCNS cancer 
j(neuro;met) SK-N-AS 


1.2 


1.5 


Lung ca. NC1-N4 1 7 


0.0 


0.0 


jCNS cancer (astro) SF- 
|539 


0.3 


0.4 


Lung ca. LX-1 


0.0 


0.0 


jCNS cancer (astro) 
jSNB-75 


0.1 


0.1 


Lung ca. NCI-H146 


0.9 


, , CNS cancer (glio) 
SNB-19 


0.0 


0.0 


Lung ca. SHP-77 


0.5 


0.5 


ICNS cancer (glio) SF- 
]295 


0.2 


0.2 


Lung ca. A549 


3.0 


o.o 


jBrain (Amygdala) Pool 


0.6 


0.5 


Lung ca. NCI-H526 


0.0 


D.O 


jBrain (cerebellum) 


0.1 


0.2 
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Lung ca. NCI-H23 


0.0 


0.0 


Brain (fetal) 


0.8 


0.7 


Lung ca. NCI-H460 


0.0 


0.0 


Brain (Hippocampus) 
Pool 


0.5 


0.5 


Lung ca. HOP-62 


0.0 


0.0 


Cerebral Cortex Pool 






Lung ca. NCI-H522 


0.0 


0.0 


Brain (Substantia 
nigra) Pool 


0.8 


0.7 


Liver 


0.1 


0.1 


Brain (Thalamus) Pool 


0.8 


0.9 


Fetal Liver 


4.7 


4.1 


Brain (whole) 


1.3 


1.3 


Liver ca. HepG2 


(0.0 


0.0 


iSpinal Cord Pool ; 






Kidney Pool 


j0.7 


0.8 


Adrenal Gland 




OS 


Fetal Kidney 


0.2 


0.2 


Pituitary gland Pool | 


D.3 


0.3 


Renal ca. 786-0 


0.0 


0.0 


Salivary Gland | 


D.5 


0.4 


Renal ca. A498 


0.2 


0.2 


Thyroid (female) I 


14 


0.2 


Renal ca. ACHN 


0.0 


0.0 


Pancreatic ca. 

CAPAN2 j 


D.3 


0.3 


Renal ca. UO-3 1 


0.1 


0.0 


Pancreas Pool j 


3.4 


0.3 


Table YF. Panel 1.3D 


Tissue Name 


Rel. Exp.(%) 
Agl980, Run 
165534458 


Tissue Name 


jRel. Exp.(%) 
;Agl980, Run 
1 165534458 


Liver 

adenocarcinoma 


1.4 


Kidney (fetal) 


1 — " 

!0.4 

I 




Pancreas 


0.4 


Renal ca. 786-0 


jo.o 




Pancreatic ca. 
CAPAN 2 


0.0 


Renal ca. A498 


2.0 


Adrenal gland 


0.6 


Renal ca. RXF 393 


0.7 


Thyroid 


0.7 


Renal ca. ACHN 


0.0 


Salivary gland 


1.0 


Renal ca. UO-3 1 


0.0 


Pituitary gland 


0.5 


Renal ca. TK-10 


0.1 


Brain (fetal) 


0.8 


Liver 


0.5 


Brain (whole) 


3.8 


Liver (fetal) 


5.3 


Brain (amygdala) 


2.9 


Liver ca. (hepatoblast) 
HepG2 


0.0 


Brain (cerebellum) 


0.6 


Lung 


1.5 


Brain (hippocampus) 


3.2 


Lung (fetal) 


2.5 


Brain (substantia 
nigra) 


1.6 


Lung ca. (small cell) LX-1 


0.0 


Brain (thalamus) 


5.1 


Lung ca. (small cell) NCI- 
H69 


0.4 


Cerebral Cortex 


1.9 


Lungca. (s.cell var.) SHP- 
77 


0.5 
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Spinal cord 


1.6 


iLung ca. (large cell)NCI- 
JH460 


0.0 


glio/astro U87-MG 




Lungca. (non-sm. cell) 
JA549 


0.0 


glio/astroU-118-MG 


1.8 


(Lung ca. (non-s.cell) NCI- 
H23 


0.0 




astrocytoma SW1 783 


0.0 


jLung ca. (non-s.cell) HOP- 
|62 


0.0 


1 

neuro*; met SK-N- 
AS 


1.1 


ILung ca. (non-s.cl) NC1- 
|H522 


0.0 


astrocytoma SF-539 


7 ' 9 


jLung ca. (squam.) SW 900 


1.5 




astrocytoma SNB-75 


1.7 


jLungca. (squam.) NCI- 
jH596 


0.1 


glioma SNB-19 


0.0 


jlvlammary gland 


0.7 


glioma U25 1 


0.7 


|Breastca.* (pl.ef) MCF-7 


0.1 


glioma SF-295 


0.0 


[ireast ca.* (pl.ef) M DA- 
|MB-231 


0.3 


Heart (fetal) 


0.1 


{Breast ca.* (pl.ef) T47D 


0.0 


Heart 


1.2 


jBreast ca. BT-549 


1.1 




Skeletal muscle 
(fetal) 


0.0 


jBreastca. MDA-N 


0.0 




Skeletal muscle 


1.0 


iOvaiy 


0.3 


Bone marrow 


4.5 


[Ovarian ca. OVCAR-3 


0.1 


Thymus 


1.2 


[Ovarian ca. OVCAR-4 


0.1 


Spleen 


1.4 


Ovarian ca. OVCAR-5 


0.3 


Lymph node 


2.9 


Ovarian ca. OVCAR-8 


0.0 


Colorectal 




Ovarian ca. IGROV-1 


0.0 


Stomach 


1.4 


Ovarian ca.* (ascites) SK- 
OV-3 


0.4 


— — 1 — r r 

otnall intestine 


1 2 - 


Uterus 


1.8 


v^oion ca. oW4oU 


0.0 


Placenta 


1.3 


Colon ca.* 
owoz^ow^ou metj 


0.0 


Prostate 


0.3 


Colon ca. HT29 


3.0 


Prostate ca.* (bone met)PC- 

3 


0.1 


Colon ca. HCT-1 16 


3.1 


Testis 


1.2 


Colon ca. CaCo-2 


3.0 


Melanoma Hs688(A).T 


D.l 


Colon ca. 
tissue(OD03866) 


).6 


Melanoma* (met) 
Hs688(B).T 


3.1 


Colon ca. HCC-2998 


2.0 


Melanoma UACC-62 


.0 


Gastric ca.* (liver 


OO.D 


Melanoma M14 


3.3 
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Bladder 


8.2 


Melanoma LOX IMVI 


0.1 


Trachea 


2.0 


Melanoma* (met) SK-MEL- 

5 


0.1 


Kidney 


0.4 


Adipose 


0 6 



Table YG . Panel 2D 



Tissue Name ;Agl980, Run 
j 169484 147 


Tissue Name ;Agl 980, Run 
1 169484 147 


Normal Colon 


14.7 


1 idn i 1 i 8120608 -1.1 


CC Well to Mod Diff 
(OD03866) 


3.2 


Kidney Cancer 8120613 


1.4 


CC Margin 
(OD03866) 


6.3 


Kidney Margin 8120614 


1.5 


CC Gr.2 rectosigmoid 
(OD03868) 


3.5 


Kidney Cancer 9010320 


6.2 


CC Margin 
(OD03868) 


3.3 


Kidney Margin 90 10321 h.6 


iCC Mod Diff 
'(ODO3920) 


1.3 


Normal Uterus jl.2 


jCC Margin j 
: (ODO3920) j 


" ~" 1 

Uterus Cancer 0640 1 1 ;3.4 


CC Gr.2 ascend colon 
(OD03921) 


3.7 


Normal Thyroid 


4.7 


CC Margin 
(OD03921) 


1.8 


Thyroid Cancer 064010 


12.1 


CC from Partial 
Hepatectomy 
(ODO4309) Mets 


5.0 


Thyroid Cancer A3021 52 


1.6 


Liver Margin 
(ODO4309) 


6.7 


Thyroid Margin A302153 


4.4 


Colon mets to lung 
(OD04451-01) 


5.0 


Normal Breast 


4.5 


Lung Margin 
(OD04451-02) 


8.4 


Breast Cancer (OD04566) 


8.1 


Normal Prostate 
6546-1 


5.2 


Breast Cancer (OD04590- 
01) 


11.8 


Prostate Cancer 
(OD04410) 


9.1 


3reast Cancer Mets 
(OD04590-03) 


7.9 


Prostate Margin 
(OD04410) 


3.1 


Breast Cancer Metastasis 
(OD04655-05) 


5.3 


Prostate Cancer 
(OD04720-01) 


3.2 


Breast Cancer 064006 


18.0 
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Prostate Margin 
(OD04720-02) 


6.4 


Breast Cancer 1024 


2.7 


Normal Lung 061010 


14.8 


Breast Cancer 9100266 


6.2 


Lung Met to Muscle 
(OD04286) 


3.4 


Breast Margin 9100265 


1.9 


Muscle Margin 
(0004286)^ 


2.8 


Breast Cancer A209073 


|6.5 


Lung Malignant 
Cancer (OD03126) 


5.2 


Breast Margin A209073 


1.5 


Lung Margin 
(OD03126) 


21 .5 


Normal Liver 


2.3 


Lung Cancer 
(OD04404) 




Liver Cancer 064003 


6.2 


Lung Margin 
(OD04404) 


6.8 


Liver Cancer 1025 


6.5 


Lung Cancer 
[(OD04565) 


6.8 


Liver Cancer 1 026 


2.1 


jLung Margin 
(OD04565) 


3.6 


Liver Cancer 6004-T 


6.0 


Lung Cancer j 
(OD04237-01) I 1 "' 1 


Liver Tissue 6004-N 


2.9 j 


jLung Margin 1 , 
'(OD04237-02) h 4 ' 5 


Liver Cancer 6005-T 


3.3 ! 


Ocular Mel Met to j 
Liver (OD043 10) f 


Liver Tissue 6005-N 


1.7 


Liver Margin 
(ODO4310) 


3.5 


Normal Bladder 


100.0 


Melanoma Mets to 
Lung (OD0432I) 


1.8 


Bladder Cancer 1023 


0.6 


Lung Margin 
(OD04321) 


10.3 


Bladder Cancer A3 02 1 73 


6.3 


Normal Kidney 


11.4 


Bladder Cancer (OD047 1 8- 
01) 


74.2 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


5.3 


Bladder Normal Adjacent 
(OD047 18-03) 


5.1 


Kidney Margin 
(OD04338) 


4.2 


formal Ovary 


3.5 


Kidney Ca Nuclear 
grade 1/2(OD04339) 


1.5 


Ovarian Cancer 064008 


11.7 


Kidney Margin 
(OD04339) 


„ 


Ovarian Cancer (OD04768- 
37) 


?9.3 


Kidney Ca, Clear cell 
type (OD04340) 


„ 


Ovary Margin (OD04768- 
)8) 


).9 


Kidney Margin ' 


.5 I 


"formal Stomach 


0.7 j 
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(OD04340) 








Kidney Ca, Nuclear 
grade 3 (OD04348) 


59.9 


Gastric Cancer 9060358 


3.0 


Kidney Margin 
(OD04348) 


81.2 


Stomach Margin 9060359 


9.9 


Kidney Cancer 
!(OD04622-01) 


6.1 


Gastric Cancer 9060395 


6.9 


Kidney Margin i 
(OD04622-03) 


Stomach Margin 9060394 


7.7 


Kidney Cancer 
(OD04450-01) 


1.5 


Gastric Cancer 9060397 


7.2 


Kidney Margin 
(OD04450-03) 


1.7 


Stomach Margin 9060396 


10.4 


Kidney Cancer 
8120607 


0.9 


Gastric Cancer 064005 


19.9 



, |Rel. Exp.(%) 
Tissue Name Ag3909. Run 
.170127176 


Tissue Name 


Rel. Exp.(%) 
Ag3909. Run 
170127176 


: Secondary Th 1 act !o.9 


HUVEC IL-lbeta 


0.1 


Secondary Th2 act 32.3 


HUVEC 1 FN gamma 


0.9 


Secondary Trl act 3.6 


HUVEC TNF alpha + IFN 
gamma 


10.5 


Secondary Thl rest 


5.5 


HUVEC TNF alpha + 1L4 


0.4 


Secondary Th2 rest 


2.0 


HUVEC 1L-11 


0.0 


Secondary Trl rest 


4.8 


Lung Microvascular EC 
none 


0.2 


Primary Thl act 


1.3 


Lung Microvascular EC 
TNFalpha+ IL-lbeta 


0.6 


Primary Th2 act 


2.0 


Microvascular Dermal EC 
none 


0.1 


Primary Trl act 


1.1 


Microsvasular Dermal EC 
TNFalpha+ IL-lbeta 


0.4 


Primary Thl rest 


3.5 


Bronchial epithelium 
TNFalpha + lLlbeta 


0.0 


Primary Th2 rest 


0.3 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


2.5 


Small airway epithelium 
TNFalpha+ IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


9.3 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act j 


12.4 


Coronery artery SMC 
TNFalpha+ IL-lbeta 


0.0 



530 



WO 03/010327 



PCT/US02/14199 



CD8 lymphocyte act 


1.2 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


4.7 


Astrocytes TNFalpha + IL- 
lbeta 


0.5 


Secondary CDS 
lymphocyte act 


0.7 


KU-812 (Basophil) rest 


1.4 


CD4 lymphocyte 
none 




KU-8 12 (Basophil) 
PMA/ionomycin 




2ry 

Thl/Th2/Trl anti- 
CD95 CH 1 1 


1.2 


CCD1 106 (Keratinocytes) 
none 


0.4 


LAK cells rest 


2.8 


CCD 11 06 (Keratinocytes) 
TNFalpha +IL-I beta 


7.7 


LAK cells IL-2 


7.5 


Liver cirrhosis 


0.1 


LAK cells IL-2+IL- 
12 


7.1 


NCI-H292 none 


0. 1 


LAK cells IL-2+IFN 
gamma 


9.5 


NCI-H292 IL-4 


0.3 


LAK cells IL-2+ IL- 
18 


11.7 


NCI-H292 IL-9 


0.4 


LAK cells 69 
PMA/iononiycin i 


NCI-H292 1L-I3 


0.5 


NK Cells IL-2 rest 


11.3 


NC1-H292 IFN gamma 


2.0 


Two Way MLR 3 day 


14.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


5.3 


HPAEC TNF alpha + 1L-I 
beta 


2.9 


Two Way MLR 7 day 


1.7 


Lung fibroblast none 


0.0 


PBMC rest 


0.4 


Lung fibroblast TNF alpha + 
IL-1 beta 


9.5 


PBMC PWM 


2.7 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


1.1 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN gamma 




B lymphocytes PWM 


0.9 


Dermal fibroblast CCD1070 
rest 




B lymphocytes 
CD40L and IL-4 


0.5 


Dermal fibroblast CCD 1070 
TNF alpha 


1.1 


EOL-1 dbcAMP 


0.6 


Dermal fibroblast CCD 1070 
IL-1 beta 


0.8 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


6.3 


Dendritic cells none 


0.6 


Dermal fibroblast IL-4 


0.3 
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Dendritic cells LPS 


20.0 




Dermal Fibroblasts rest 


0.1 


Dendritic cells anti- 
CD40 


0.3 




Neutrophils TNFa+LPS 


0.1 


Monocytes rest 


2.7 


Neutrophils rest 


0.3 


Monocytes LPS 


100.0 


Colon 


0.5 


Macrophages rest 


0.9 


Lung 


0.9 


Macrophages LPS 


36.9 


Thymus 


1.0 


1 1 '\ EC none 


0.0 


Kidney 


0.4 


HUVEC starved 


0.1 






Table YI. Panel 4D 




Tissue Name 


Rel. Exp.(%) 
Ag1980, Run 
161683379 


Tissue Name 


Rel. Exp.(%) | 
Agl980, Run I 
161683379 


Secondary Th 1 act 


1.0 


jHUVECIL-lbeta 


0.1 


Secondary Th2 act 


52.5 




HUVEC I FN gamma 


2.4 


Secondary Trl act 


9.4 


[HUVEC TNF alpha + I FN 
jaamma 


29.5 


Secondary Thl rest 


8.9 


[HUVEC TNF alpha + IL4 


1.0 


Secondary Th2 rest 


1.3 


^HUVEC 1L-1 1 


0.0 


Secondary Trl rest 


10.5 


jLung Microvascular EC 
[none 




Primary Th 1 act 


4.5 


Lung Microvascular EC 
jTNFalpha+IL-lbeta 


- - 


Primary Th2 act 


3.4 


(Microvascular Dermal EC 
:none 


_ 


Primary Trl act 


4.5 


Microsvasular Dermal EC 
TNFalpha+IL-lbeta 




Primary Thl rest 


34.2 


Bronchial epithelium 
TNFalpha + ILlbeta 




Primary Th2 rest 


3.5 


Small airway epithelium 
none 




Primary Trl rest 


5.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


D.2 


CD45RA CD4 
lymphocyte act 


1.1 


Coronery artery SMC rest 


10 


CD45RO CD4 
lymphocyte act 


1.7 


Coronery artery SMC 
TNFalpha + IL-lbeta 


3.0 


CD8 lymphocyte act 


.2 


Astrocytes rest ( 


).0 


Secondary CD8 
lymphocyte rest 


.4 


Astrocytes TNFalpha + IL- 
lbeta 


.7 


Secondary CD8 
lymphocyte act ? 


.1 


KU-812 (Basophil) rest ( 


.0 
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CD4 lymphocyte 
none 


08 


KU-812 (Basophil) 
PMA/ionomycin 


15.1 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


1.8 


t^i-.-LM l uo ^N.eraTinocyiesj 
none 


0.8 


LAK cells rest 


6.0 


CCD1106(Keratinocytes) 
TNFalpha + IL-lbeta 


2.6 


f LAK cells IL-2 


19.5 


Liver cirrhosis 


0.3 


LAK cells IL-2+1L- 
12 


12.4 


Lupus kidney 


0.2 


LAK cells IL-2+IFN 
gam ma 


53.2 


NCI-H292 none 


0.7 


1 A V r*f*\\c 11 0-1- II 
L/AxV Cells 1 JL-Z ' 1 L,- 

18 


61.1 


NCI-H292 IL-4 


1.6 


LAK cells 
PMA/ionomycin 


12.4 


NCI-H292 IL-9 


2.2 


NK Cells IL-2 rest 


14.7 


NCI-H292 IL-13 


0.4 


i wo way lviLK j day 


?1L_ _ 


NCI-H292 I FN gamma 


6.2 


Two Way M LR 5 day 


7.9 


HPAEC none 


0.0 


Two Way MLR 7 day 


3.4 


HPAEC TNF alpha + 1L-1 
beta 


0.0 




L. ._ 


Lung fibroblast none 


0.0 j 


PBMC PWM 


17.8 


Lung fibroblast TNF alpha + 
IL-1 beta 


17.9 




3.3 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


12.6 


Lung fibroblast I FN gamma 


11.3 


B lymphocytes 
CD40L and IL-4 


2.7 


Dermal fibroblast CCD 1070 
rest 


0.1 


EOL-1 dbcAMP 


0.1 


Dermal fibroblast CCD 1 070 
TNF alpha 


2.2 


EOL-1 dbcAMP 
PMA/ionomycin 


0.1 


Dermal fibroblast CCD1070 
IL-1 beta 


0.5 


Dendritic cells none 


1.1 


L/ermai riDrooiast lr in 
gamma 


9.1 


Dendritic cells LPS 


48.6 


Dermal fibroblast IL-4 


0.4 


Dendritic cells anti- 
CD40 


0.4 


IBD Colitis 2 


0.1 


Monocytes rest 


5.7 


BD Crohn's 


0.3 


Monocytes LPS 


100.0 


Colon 


3.6 
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Macrophages rest 


.3 I 


.ung 3 


.6 


Macrophages LPS 


9.6 


rhymus 


.8 


HUVEC none ( 


).0 I 


Cidney 


.0 


HUVEC starved 


).2 








Table YJ. Panel 5 Islet 






Rel. Exp.(%) 
Ag3909, Run 
242413199 


rissue Name 


*el. Exp.(%) 
\g3909, Run 
1 4241 3 199 


97457 Patient- 
02go_adipose 


90.1 


?4709_Donor 2 AM - 
A adipose 


3.0 


97476 Patient- 
07sk_skeletal muscle 


22.2 


94710_Donor2 AM - 
B_adipose 


1.8 


97477 Patient- 
07ut_uterus 


18.2 


9471 l_Donor2 AM - 
C_adipose 


0.0 


97478 Patient- 
07pl_placenta 


53.6 


94712_Donor2 AD - 
Aadipose 


1 .6 


99167 Bayer Patient 
1 


100.0 


94713_Donor2 AD - 
B_adipose 


1.2 


97482 Patient- 
08ut_uterus 


44.8 


94714_Donor2 AD - 
C_adipose 


0.0 


97483 Patient- 
08pl_placenta 


74.2 


94742_Donor 3 U - 

A Mesenchymal Stem Cells 


,.1 


97486 Patient- 
09sk_skeletal muscle 


9.3 


94743_Donor 3 U - 

B Mesenchymal Stem Cells 


3.7 


97487 Patient- 
09ut_uterus 


24.1 


94730 Donor 3 AM - 
A_adipose 


1.7 


97488 Patient- 
09pl_placenta 


44.1 


9473 1 Donor 3 AM - 
B_adipose 


3.6 


97492 Patient- 
10ut_uterus 


47.0 


94732_Donor3 AM - 
C_adipose 


0.0 


97493 Patient- 
10pl_placenta 


68.3 


94733_Donor3 AD- 
A_adipose 


0.0 


97495 Patient- 
1 lgo_adipose 


17.4 


94734_Donor3 AD - 
Badipose 




97496 Patient- 

1 lsk skeletal muscle 


22.8 


94735_Donor3 AD- 
C_adipose 




97497_Patient- 
1 1 ut_uterus 


24.3 


77138 Liver HepG2untreat 
ed 


1.8 


97498_Patient- 
1 1 pl_placenta 


15.4 


73556_Heart_Cardiac 
stromal cells (primary) 


0.0 


97500_Patient- 
12go_adipose 


62.4 


81735_Small Intestine 


97.3 


97501_Patient- 


79.6 


72409_Kidney_Proximal 


5.6 
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12sk_skeletal muscle 




Convoluted Tubule 




97502_Patient- 
12ut_uterus 


30.6 


82685_SmaII 
intestine Duodenum 


54.3 


97503 Patient- 
12pl_placenta 


29.7 


9065 0_Adrenal_Adrenocorti 
cal adenoma 


10.0 


94721_Donor2U- 
A_Mesenchymal 
Stem Cells 


0.0 


72410 Kidney HRCE 


8.3 


94722_Donor 2 U - 
BJvlesenchymal 
Stem Cells 


0.0 


7241 1 Kidney HRE 


28.3 


94723 Donor 2 U - 
CJvlesenchymal 
Stem Cells 


0.0 


73139_Uterus_Uterine 
smooth muscle cells 


0.0 



Alcomprehensive panel vl.O Summary: Agl 980 two experiments^WHhe 
same probe and primer produce results that are in excellent agreement, with highest 
expression in normal tissue adjacent to psoriasis (CTs=30.5-3 1 .2). This target is induced in 
bone tissue, synovial fluid, synovial fluid cells and synovium from arthritis patients 



5 (rheumatoid-RA and osteoarthritis-OA); In addition, the expression of this transcript in 
these samples from normal patients is much lower. Other tissues including skin and lung 
also express this transcript. However, a consistent expression in diseased tissue, as 
compared to adjacent tissue or normal lung, is not apparent. This may be due to 
contamination with activated monocytes which highly express this transcript (see panel 
10 4.1D) 

CNSneurodegenerationvl.O Summary: Ag3909 This panel does not show 
differential expression of the CG94235-01 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1 .4 for 
discussion of utility of this gene in the central nervous system. 

15 General_screening_panel_vl.4 Summary: Ag3909 Two experiments with the 

same probe and primer produce results that are in excellent agreement, with highest 
expression of the CG94235-01 gene in a gastric cancer cell line (CTs=23.6-24.4). Thus, 
expression of this gene could be used as to differentiate this sample from other samples on 
this panel and as a marker of gastric cancer. This gene encodes a putative thymidylate 

20 kinase, a DNA synthesis enzyme necessary for cell growth. Thus, therapeutic modulation 
of the expression or function of this gene product may be useful in the treatment of gastric 
cancer. 
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Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic disorders. 
5 Dysregulated expression of this gene may contribute to neuroendocrine disorders or 
metabolic diseases, such as obesity and diabetes. 

In addition, this gene is expressed at much higher levels in fetal lung, liver and 
skeletal muscle tissue (CTs=28-30) when compared to expression in the adult counterpart 
(CTs=32.5-35). Thus, expression of this gene may be used to differentiate between the 
1 0 fetal and adult source of these tissues. 

This gene is also expressed at moderate to low levels in the CNS, including the 
hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
1 5 schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 1.3D Summary: Agl980 Highest expression of the CG94235-01 gene in 
this panel is seen in a gastric cancer cell line(CT=26). Overall, expression is in reasonable 
agreement with the results in Panel 1 .4. Moderate to low levels of expression are seen in 
metabolic tissues including adipose, adult and fetal liver, skeletal muscle, heart, pituitary, 
20 thyroid, adrenal and pituitary. Moderate to low levels of expression are seen in all CNS 
regions examined. 

In addition, higher levels of expression are seen in fetal liver (CT=30.2) when 
compared to expression in adult liver (CT=33.7). Thus, expression of this gene could be 
used to differentiate between the adult and fetal sources of this tissue. 

25 Panel 2D Summary: Agl980 Highest expression of the CG94235-01 gene is seen 

in normal bladder (CT=27.3). In addition, higher levels of expression are seen in ovarian, 
bladder and lung cancers when compared to expression in normal adjacent tissue. Thus, 
expression of this gene could be used as a marker of these cancers. Furthermore, 
therapeutic modulation of the expression or function of this gene product may be useful in 

30 the treatment of ovarian, bladder and lung cancers. 

Panel 4.1D Summary: Ag3909 Highest expression of the CG94235-01 gene is 
seen in LPS treated monocytes. (CT=25.4). Prominent levels of expression are also seen in 
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LPS activated macrophages and dendritic cells. This transcript may encode a protein that 
is important in the normal regulation of cytokines. Inappropriate regulation of the protein 
encoded by this gene may result in the enhanced and uncontrolled expression of 
inflammatory cytokines. Therefore, designing therapies that could regulate the expression 
(such as antisense therapies) or function of the protein encoded by this gene may be 
important in the treatment of osteoarthritis and rheumatoid arthritis as well as other 
diseases. 

Panel 4D Summary: Agl980 The expression profile of the CG94235-01 gene in 
this panel is similar to the expression pattern seen in Panel 4. 1 D using the Ag3909 probe 
and primer set, except it is also expressed in LAK cells. This may reflect differences 
between panel 4 and 4.1 or slightly different amplicons. Please see panel 4. ID for 
discussion of utility of this gene in inflammation. 

Panel 5 Islet Summary: Ag3909 Highest expression of the CG94235-01 gene is 
seen in islet cells (CT=33.4). Low but significant levels of expression are seen in other 
metabolic tissues, including adipose, placenta and skeletal muscle. Please see Panel 1.4 for 
discussion of utility of this gene in metabolic disease. 

General oncology screening panel_v_2.4 Summary: Ag3909 Results from one 
experiment with the CG94235-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Z. CG94235-02: Splice Variant of CG94235-01 

Expression of gene CG94235-02 was assessed using the primer-probe sets Ag3909 
and Ag6052, described in Tables ZA and ZB. Results of the RTQ-PCR runs are shown in 
Tables ZC, ZD, ZE, ZF, ZG and ZH. 



Table ZA . Probe Name Ag3909 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 1 -caggtgccacgtctaactagat-3 1 


22 


1529 


299 


Probe 


TET-5 1 -tgttgtttgaaacatctacatccacca-3 ' -TAMRA 


27 


1498 


300 


Reverse 


5 1 -gaaatttgggaacactgcataa-3 • 


22 


1472 


301 



Table ZB . Probe Name Ag6052 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -cctaggcgccgttttg-3 1 


16 


2242 


302 
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Probe 


TET-5 ' -agtgtacctcctttattcctgaagcccg-3 1 -TAMRA 


28 


2210 


303 


Reverse 


5 1 -gtcgaccaggtcaagcact-3 ■ 


19 


2188 


304 




Tabic ZC. CNS ncurodegeneratio 


nvl.O 





Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 
212248424 


Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 
212248424 


|AD 1 Hippo 


11.7 


Control (Path) 3 Temporal Ctx 


5.6 


AD 2 Hippo 


41.5 


Control (Path) 4 Temporal Ctx 


33.7 


AD 3 Hippo 


7.6 


AD 1 Occipital Ctx 


12.5 


AD 4 Hippo 


7.2 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


75.3 


AD 3 Occipital Ctx 


1.1 


AD 6 Hippo 


36.1 


AD 4 Occipital Ctx 


23.7 


Control 2 Hippo 


19.1 


AD 5 Occipital Ctx 


29.9 


Control 4 Hippo 


12.9 


AD 6 Occipital Ctx 


35.4 


Control (Path) 3 
Hippo 


12.2 


Control 1 Occipital Ctx 


9.1 


AD 1 Temporal Ctx 


9.5 jControl 2 Occipital Ctx 


69.7 


AD 2 Temporal Ctx (36.3 jControl 3 Occipital Ctx [l 8.9 


AD 3 Temporal Ctx b.7 'Control 4 Occipital Ctx j3.9 


AD 4 Temporal Ctx |3 1 .9 jControl (Path) 1 Occipital Ctx ]7 1 .2 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


31.9 


AD 5 SupTemporal 
Ctx 


56.6 


Control (Path) 3 Occipital Ctx 


4.3 


AD 6 Inf Temporal 
Ctx 


22.4 


Control (Path) 4 Occipital Ctx 


23.0 


AD 6 Sup Temporal 

Ctx 


32.3 


Control 1 Parietal Ctx 


8.5 


Control 1 Temporal 
Ctx 


5.3 


Control 2 Parietal Ctx 


38.2 


Control 2 Temporal 
Ctx 


42.9 


Control 3 Parietal Ctx 


29.1 


Control 3 Temporal 

Ctx 


13.0 


Control (Path) 1 Parietal Ctx 


73.2 


Control 4 Temporal 
Ctx 


4.4 


Control (Path) 2 Parietal Ctx 


44.4 


Control (Path) 1 
Temporal Ctx 


58.2 


Control (Path) 3 Parietal Ctx 


7.3 


Control (Path) 2 
Temporal Ctx 


41.8 


Control (Path) 4 Parietal Ctx 


49.0 
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Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 
217235826 


Rel. Exp.(%) 
Ag3909, Run 
219173644 


Tissue Name 


Rel. 

Exp.(%) 
Ag3909, 
Run 

217235826 


Rel. Exp.(%) 
Ag3909, Run 
219173644 


Adipose 


0.6 


0.6 


Renal ca. TK-10 


0.0 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


0.0 


Bladder 


9.7 


1 1.7 


Melanoma* 
Hs688(B).T 


0.0 


0.0 


Gastric ca. (liver 
met.) NCI-N87 


100.0 


100.0 


Melanoma* 
M14 


0.3 


0.4 


Gastric ca. KATO 111 


0.5 


0.6 


Melanoma* 
LOXIMVI 


0.1 


0.2 


Colon ca. SW-948 


0.5 


0.5 


Melanoma* 
SK-MEL-5 


0.2 


0.2 


Colon ca. SW480 


0.1 


0.1 


Squamous 
cell 

carcinoma 
jSCC-4 


0.5 




0.5 


Colon ca.* (SW480 
met) SW620 


0.0 


0.0 


Testis Pool 


0.6 


0.5 


Colon ca. HT29 


0.1 


0.1 


i Prostate ca.* 
(bone met) 
PC-3 


0.1 


0.1 


Colon ca. HCT-1 16 


0.3 


0.3 


Prostate Pool 


0.3 


0.3 


Colon ca. CaCo-2 


0.1 


0.0 


Placenta 


0.4 


0.5 


Colon cancer tissue 


0.6 


0.5 


Uterus Pool 


0.2 


0.2 


Colon ca. SW1116 


0.2 


0.1 


Ovarian ca. 
OVCAR-3 


0.2 


0.2 


Colon ca. Colo-205 


0.0 


0.0 


Ovarian ca. 
SK-OV-3 


0.5 


0.4 


Colon ca. SW-48 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.1 


0.1 


Colon Pool 


0.4 


0.5 


Ovarian ca. 
OVCAR-5 


0.6 


0.5 


Small Intestine Pool 


0.4 


0.4 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach Pool 


0.1 


0.3 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Bone Marrow Pool 


0.2 


0.2 


Ovary 


0.8 


0.6 


Fetal 1 leart 


0.2 - 


0.2 


Breast ca. 
MCF-7 


0.0 


0.0 


Heart Pool 


0.5 


0.5 


Breast ca. 
MDA-MB- 


0.2 


0.2 


Lymph Node Pool 


0.4 


0.4 
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231 












Breast ca. BT 
549 


2.3 


3.1 


Fetal Skeletal Muscle 


0.2 


0.2 


Breast ca. 
T47D 


1.1 


1.0 


Skeletal Muscle Pool 


1.7 


1.9 


Breast ca. 
MDA-N 


0.2 


:o.2 


Spleen Pool 


2.3 


,2.6 


Breast Pool 


0.3 


0.4 


Thymus Pool 


0.5 


0.5 


jTrachea 


0.9 


1.0 


CNS cancer 
(glio/astro) U87-MG 


0.0 


0.0 


Lung 


0.1 


0.1 


CNS cancer 
(glio/astro) U-l 18- 
MG 


1.0 


1.3 


Fetal Lung 


0.9 


0.9 


CNS cancer 
(neuro;met) SK-N- 
AS 


1.2 


1.5 


Lung ca. NCI- 
N4I7 




0.0 


CNS cancer (astro) 
SF-539 


0.3 


0.4 


Lung ca. LX- 
1 


0.0 


° : ° — 


CNS cancer (astro) 
SNB-75 


0.1 


0.1 


Lung ca. NCI- 
H146 


0.9 




CNS cancer (glio) 
SNB-19 


0.0 


o.o i 


Lung ca. 
SHP-77 




— ■ — 


CNS cancer (slio) 
SF-295 


0.2 


0.2 


Lung ca. 
A549 






Brain (Amygdala) 
Pool 


0.6 


0.5 


Lung ca. NCI- 
H526 






Brain (cerebellum) 


0.1 


0.2 


Lung ca. NCI- 
H23 






Brain (fetal) 


0.8 


0.7 


Lung ca. NCI- 
H460 


0.0 


0.0 


Brain (Hippocampus) 
Pool 


0.5 


0.5 


Lung ca. 
HOP-62 


0.0 


0.0 


Cerebral Cortex Pool 


0.7 


0.7 


Lung ca. NCI- 


0.0 


0.0 


Brain (Substantia 
nigra) Pool 


0.8 


0.7 


Liver 


0.1 


0.1 


3rain (Thalamus) 
Pool 


0.8 


0.9 


Fetal Liver 


4.7 


4.1 


Brain (whole) 


1.3 


1.3 


Liver ca. 
HepG2 


0.0 


10 


Spinal Cord Pool 


0.4 


0.5 


Kidney Pool 


0.7 


D.8 


Adrenal Gland 


0.7 


3.8 


Fetal Kidney 


0.2 


12 


Pituitary gland Pool 


0.3 


3.3 
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Renal ca. 786- 


0.0 


0.0 


Salivary Gland 


0.5 


0.4 


Renal ca. 
A498 


0.2 


0.2 


Thyroid (female) 


0.4 


0.2 


Renal ca. 
ACHN 


0.0 


0.0 


Pancreatic ca. 
CAPAN2 


0.3 


0.3 
0.3 


Renal ca. UO- 
31 


0.1 


0.0 


Pancreas Pool 


0.4 



Tissue Name 


Rel. Exp.(%) 
Ag6052, Run 
228746663 


Tissue Name 


Rel.Exp.(%)Ag6052, 
Run 228746663 


Adipose 


1.2 


Renal ca. TK-10 


0.0 


Melanoma* 

HSD60(AJ. 1 


0.0 


Bladder 


12.9 


Melanoma* 

HS058(D). 1 


0.0 


Gastric ca. (liver met.) NCI- 
N87 


100.0 


jMelanoma* M 1 4 


'0.0 


Gastric ca. KATO 111 


16.0 


jMelanoma* 
jLOXIMVI 


j 0.0 


Colon ca. SW-948 


0.0 


'Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.4 


Squamous cell 
carcinoma SCC-4 


2.7 


Colon ca.* (SW480 met) 
SW620 


0.0 


Testis Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca* (bone 
met) PC -3 


0.0 


Colon ca. HCT-1 16 


0.4 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.4 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.9 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV- 
3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.6 


Ovarian ca. 
OVCAR-5 


0.6 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.4 
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~ _ 


Fetal Heart 


™ 


Breast ca MCF-7 




Heart Pool 




Breast ca. MDA- 


0.5 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


1.5 


Fetal Skeletal Muscle 


1.5 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


2.1 


Breast ca. MDA-N 


0.0 


Spleen Pool 


3.7 


Breast Pool 


0.9 


Thymus Pool 


1.2 


Trachea 


1 .0 


CNS cancer (glio/astro) U87- 
MG 


00 


Lung 


0.0 


CNS cancer (glio/astro) U-l 1 8- 
MG 


0.8 


Fetal Lung 


0.4 


CNS cancer (neuro;met) SK- 
N-AS 


2.3 


Lung ca. NCI-N417 


0.0 


CN- ( tin i, (; ;tro) SF 539 


0.0 


Lungca. LX-1 


0.0 


CNS cancer (astro) SNB 75 


0.0 


Lung ca. NCI-H146 


0.4 


CNS cancer (glio) SNB-1 9 JO.O 


Lungca. SHP-77 


0.0 


CNS cancer (glio) SF-295 jl.9 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool j 1 .3 


Lung ca. NC1-H526 


0.0 


Brain (cerebellum) jO.O 


i ungca NCI-H23 


0.0 


Brain (fetal) 0.0 


Lungca. NCI-H460 


0.0 


Brain (Hippocampus) Pool j 1 .2 


Lungca. HOP-62 


0.0 


Cerebral Cortex Pool 


1.8 


Lungca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.4 


Fetal Liver 


2.8 


Brain (whole) 


0.0 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


1.4 


Kidney Pool 


o.o 


Adrenal Gland 


0.4 


Fetal Kidney 


1.6 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.6 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.3 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.6 


Table ZF. Panel 4.1D 



Tissue Name 


Rel. 

Ex P .(%) 
Ag3909, 
Run 

170127176 


Rel. Exp.(%) 
Ag6052, Run 
226202125 


Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 
170127176 


Rel. 

Exp.(%) 
Ag6052, 
Run 

226202125 


Secondary Thl act 


0.9 


0.0 


HUVEC IL-lbeta 


0.1 


0.0 
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Secondary Th2 act 


32.3 


27.7 


HUVEC IFN gamma 


0.9 


0.5 








HUVECTNF alpha + 
IFN gamma 


10.5 


18.3 


Secondary Thl 
rest 


5, 


5.9 


HUVEC TNF alpha + 
IL4 


0.4 


)0.6 


Secondary Thl 
irest 


2.0 


1.0 


HUVEC IL-1 I 


0.0 


2.7 


Secondary Trl rest 


4.8 


4.3 


Lung Microvascular 
EC none 


0.2 





Primary Thl act 


1.3 


1.1 


Lung Microvascular 
EC TNFalpha + IL- 
lbeta 


0.6 


0.0 


Primary Th2 act 


2.0 


2.2 


Microvascular Dermal 
EC none 


0.1 


0.0 


Primary Trl act 


1.1 


0.4 


Microsvasular Dermal 
EC TNFalpha + IL- 
Ibeta 


0.4 


0.0 | 


Primary Th 1 rest 


3.5 


4.4 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


0.0 


Primary Th2 rest 


0.3 


2.1 


Small airway 
epithelium none 


0.0 


0.0 


Primary Trl rest 


2.5 


0.3 


Small airway 
epithelium TNFalpha 
+ IL-lbeta 


0.0 


0.0 I 


CD45RA CD4 
lymphocyte act 


9.3 


4.2 


Coronery artery SMC 
rest 


0.0 


0.0 


CD45RO CD4 
lymphocyte act 


12.4 


1 1.7 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


0.0 


CDS lymphocyte 
act 


1.2 


0.6 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest j ' 


12.9 


Astrocytes TNFalpha 
+ IL-1 beta 


0.5 


0.3 


Secondary CD8 
lymphocyte act 


0.7 


0.4 


KU-8 12 (Basophil) 
rest 


1.4 


1.1 


CD4 lymphocyte 
none 


0.5 


D.O 


KLU-812 (Basophil) 
:> MA/ionomycin 


4.7 


6.3 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


.2 


.9 


Pf~T> 1 1 fi£ 
-A-.IJ 1 1 UO 

Keratinocytes) none 


0.4 


0.7 


LAK cells rest 


2.8 


2.8 


CCD 1106 
Keratinocytes) 
rNFalpha + IL-1 beta 


7.7 


5.5 ' 


LAK cells IL-2 


7.5 | 


5.4 


Liver cirrhosis 


3.i r 


D.O 


LAK cells IL- 


7.1 


.2 


NCI-H292 none 


D.l 


L0 
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2+IL-12 












LAK cells IL- 
2+IFN gamma 


9.5 


2.7 


NCI-H292 IL-4 


0.3 


0.0 


LAK cells IL-2+ 
IL-18 


11.7 


7.1 


NCI-H292 IL-9 


0.4 


0.0 


LAK cells 
PMA/ionomycin 


6.9 


5.5 


NC1-H292 1L-13 


0.5 


0.0 


NK Cells 11. 2 rest 111. 3 


, 34 INCI-H292IFN 
(gamma 


2.0 


0.4 


Two Way MLR 3 j. . n 
, ; 4.0 

day 1 


12.7 


HPAEC none 


0.0 


0.6 


Two Way MLR 5 
day 


5.3 


3.6 


HPAEC TNF alpha + 
IL-1 beta 


2.9 


0.4 


Two Way MLR 7 
day 


1.7 


5.2 


Lung fibroblast none 


0.0 


0.6 


PBMC rest 

1 


0.4 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


9.5 


18.6 


jPBMC PWM 


2.7 


1.5 


Lung fibroblast IL-4 


0.0 


5.2 


iPBMC PHA-L 


1.1 


4.2 


Lung fibroblast IL-9 


o.o jo.o 


iRamos (B cell) 
;iione 


0.0 


0.0 


Lung fibroblast IL-1 3 


0.0 


0.0 


Ramos (B cell) \ Q Q 
jionomycin 


0.0 


Lung fibroblast IFN 
gamma 


3.6 


5.7 


B lymphocytes 
PWM 


0.9 


0.7 


Dermal fibroblast 
CCD1070 rest 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.5 


1.5 


Dermal Fibroblast 
CCD 1 070 TNF alpha 


1.1 


1.6 


EOL-1 dbcAMP 


0.6 


0.3 


CCD1070 IL-1 beta 


0.8 


1.6 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast IFN 
gamma 


6.3 


2.9 


Dendritic cells 
none 


0.6 


0.0 


Dermal fibroblast IL- 
4 


0.3 


0.0 


Dendritic cells 
LPS 


20.0 


15.1 


Dermal Fibroblasts 
rest 


0.1 


0.0 


Dendritic cells 
anti-CD40 


0.3 


0.4 


Neutrophils 
TNFa+LPS 


0.1 


1.6 


Monocytes rest 


2.7 


2.8 


Neutrophils rest 


0.3 


0.0 


Monocytes LPS 


100.0 


100.0 


Colon 


0.5 


3.0 


Macrophages rest 


0.9 


0.8 


Lung 


0.9 


1.4 


Macrophages LPS 


36.9 


24.3 


Thymus 


1.0 


1.3 


HUVEC none 


0.0 


0.0 


Sidney 


0.4 


0.0 


HUVEC starved 


0.1 


0.0 
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Table ZG . Panel 5 Islet 







lei. Exp.(%) 
\g3909, Run ' 
'42413199 


: 

'issue Name ' 


lei. 

ixp.(%) 
\g3909, 
lun 

.42413199 


9745 7_Patient-02go_adipose 


?0.1 


H709 Donor 2 AM - A_adipose 


).0 


97476_Patient-07sk_skeletal 
muscle 


22.2 


94710 Donor 2 AM - B_adipose 


.8 


97477_Patient-07ut_uterus 


18.2 ^ 


947 1 1 Donor 2 AM - C_adipose 


D.O 


97478_Patient-07pljlacenta 


53.6 


947 12 Donor 2 AD - A_adipose 


1.6 


99 167_Bayer Patient 1 


100.0 


947 1 3_Donor 2 AD - B_adipose 


1.2 


97482_Patient-08ut_uterus 


44.8 


9471 4 Donor 2 AD - C_adipose 


0.0 


97483_Patient-08pl_placenta 


74.2 


94742_Donor 3 U - 

A Mesenchymal Stem Cells 


3.1 


97486_Patient-09sk_skeletal 
muscle 


9.3 


94743_Donor 3 U - 

B Mesenchymal Stem Cells 


3.7 


97487 Patient-09utjJterus 


24.1 


94730_Donor 3 AM - A_adipose 


1.7 


|97488_Patient-09pl_placenta 144.1 


94731_Don«.i 3 AM - B_adipose 


3.6 


97492_Patient-10ut_uterus 


47.0 


94732_Donor 3 AM - C_adipose |0.0 


9 7493_Pat ient- 1 0 p l_placenta 


68.3 


94733_Donor 3 AD - A_adipose 


0.0 


97495_Patient-l lgo_adipose 


17.4 


94734 Donor 3 AD - B_adipose 


0.0 


97496JPatient-llsk_skeletal 
muscle 


22.8 


94735_Donor 3 AD - C_adipose 


0.0 


97497_Patient-l lut uterus 


24.3 


77 1 3 8 _Liver_HepG2untreated 


1.8 


97498_Patient-l lpl_placenta 


15.4 


73556_Heart_Cardiac stromal cells 
(primary) 


0.0 


97500_Patient-12go_adipose 


62.4 


81735 Small Intestine 


97.3 


9750 l_Patient- 1 2sk_skeletal 
muscle 


79.6 


72409_Kidnev_Proximal 
Convoluted Tubule 


5.6 


97502_Patient-12ut_uterus 


30.6 


82685 Small intestine_Duodenum 


54.3 


97503_Patient-12pl_placenta 


29.7 


90650_Adrenal_Adrenocortical 
adenoma 


10.0 


94721_Donor2U- 

A Mesenchymal Stem Cells 


0.0 


72410_Kidney_HRCE 


8.3 


94722_Donor 2 U - 

B Mesenchymal Stem Cells 


0.0 


7241 I Kidney HRE 


28.3 


94723_Donor 2 U - 
C_Mesenchymal Stem Cells 


0.0 


73 139_Uterus_Uterine smooth 
muscle cells 


0.0 


Table ZH. General oncology screening panel_v_2.4 


Tissue Name 


Rel. Exp.(%) 
Ag3909, Run 


Tissue Name 


Rel. 

Exp.(%) 
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268143664 




Ag3909, 
268143664 


Colon cancer 1 


0.0 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


_ 


Colon cancer 2 


0.0 


Iden^ 


0.0 


Colon cancer NAT 2 


0.0 






Colon cancer 3 


0.1 


Adenocardnoi^ •> 


To¥o 


Colon cancer NAT 3 


0.0 


Adinroarc^ 





Colon malignant cancer 4 


0.0 


Adlnol!^ 

__fgggg^ng?a o t prostate___ 




Colon normal adjacent tissue 

4 


0.0 


Prostate cancer NAT 5 


— 

0.0 


Lung cancer 1 


0.0 


Adenocarcinoma of the prostate 6 


0.0 


Lung NAT 1 


0.0 


Adenocarcinoma of the prostate 7 


0.0 


Lung cancer 2 


0.0 


Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 


0.0 


Adenocarcinoma of the prostate 9 


0.0 


Squamous cell carcinoma 3 


0.0 


Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 0.0 


Kidney cancer 1 


0.0 


metastatic melanoma 1 


0.0 


KidneyNAT 1 


0.0 


Melanoma 2 


0.0 


Kidney cancer 2 


0.0 


Melanoma 3 


0.0 


Kidney NAT 2 


0.0 


metastatic melanoma 4 


0.0 


Kidney cancer 3 


0.0 


metastatic melanoma 5 


0.0 


Kidney NAT 3 


0.0 


Bladder cancer 1 


0.0 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


0.0 







CNS neurodegeneration vl.O Summary: Ag3909 This panel does not show 
differential expression of the CG94235-01 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1.4 for 
discussion of utility of this gene in the central nervous system. A second experiment with 
the probe and primer set Ag6052, which is specific for the CG94235-02 variant, shows 
low/undetectable levels of expression in all the samples on this panel (CTs>35). (Data not 
shown.) 

General_screening_panel_vl.4 Summary: Ag3909 Two experiments with the 
same probe and primer produce results that are in excellent agreement, with highest 
expression of the CG94235-01 gene in a gastric cancer cell line (CTs=23. 6-24.4). Thus, 
expression of this gene could be used as to differentiate this sample from other samples on 
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this panel and as a marker of gastric cancer. This gene encodes a putative thymidylate 
kinase, a DNA synthesis enzyme necessary for cell growth. Thus, therapeutic modulation 
of the expression or function of this gene product may be useful in the treatment of gastric 
cancer. 

5 Among tissues with metabolic function, this gene is expressed at moderate to low 

levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic function and that 
disregulated expression of this gene may contribute to neuroendocrine disorders or 

1 0 metabolic diseases, such as obesity and diabetes. 

In addition, this gene is expressed at much higher levels in fetal lung, liver and 
skeletal muscle tissue (CTs=28-30) when compared to expression in the adult counterpart 
(CTs=32.5-35). Thus, expression of this gene may be used to differentiate between the 
fetal and adult source of these tissues. 

15 This gene is also expressed at moderate to low levels in the CNS. including the 

hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

20 General_screening_panel_vl.5 Summary: Ag6052 Expression of the CG94235- 

02 variant appears to be restricted to a gastric cancer cell line (CT=3 1 .2) and normal 
bladder (CT=34.2) in this panel. Thus, expression of this gene could be used to 
differentiate between these samples and other samples on this panel and as a marker to 
detect the presence of gastric cancer. Furthermore, therapeutic modulation of the 

25 expression or function of this gene may be effective in the treatment of gastric cancer. 

Panel 4.1D Summary: Ag3909/Ag6502 Two experiments with two different 
probe and primer sets produce results that are in very good agreement. Highest expression 
of the CG94235-01 gene is seen in LPS treated monocytes. (Ag3909 CT=25.4; Ag6502 
CT=31.1). Prominent levels of expression are also seen in LPS activated macrophages and 

30 dendritic cells. This transcript may encode a protein that is important in the normal 

regulation of cytokines. Inappropriate regulation of the protein encoded by this gene may 
result in the enhanced and uncontrolled expression of inflammatory cytokines. Therefore, 
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designing therapies that could regulate the expression (such as antisense therapies) or 
function of the protein encoded by CG94325-01this gene may be important in the 
treatment of osteoarthritis and rheumatoid arthritis as well as other diseases. 

Panel 5 Islet Summary: Ag3909 Highest expression of the CG94235-01 gene is 
5 seen in islet cells (CT=33.4). Low but significant levels of expression are seen in other 
metabolic tissues, including adipose, placenta and skeletal muscle. Please see Panel 1.4 for 
discussion of utility of this gene in metabolic disease. 

General oncology screening panel_v_2.4 Summary: Ag3909 Results from one 
experiment with the CG94235-01 gene are not included. The amp plot indicates that there 
1 0 were experimental difficulties with this run. A second experiment with the probe and 

primer set Ag6052, which is specific for the CG94235-02 variant, shows low/undetectable 
levels of expression in all the samples on this panel (CTs>35). (Data not shown.) 

AA. CG94692-01 and CG94692-02: Carnirine/Acylcarnitin Translocase 

15 Expression of gene CG94692-01 and full length physical clone CG94692-02 was 

assessed using the primer-probe set Ag3 941. described in Table AAA. Results of the 
RTQ-PCR runs are shown in Tables AAB. AAC, AAD and AAE. Please note that 
CG94692-02 represents a full-length physical clone of the CG94692-01 gene, validating 
the prediction of the gene sequence. 

20 Table AAA . Probe Name Ag3941 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -ggcttcttcaagggaatgag-3 ' 


20 


160 


305 


Probe 


TET-5 ' -ttgccagcatagctgtggtcaactct-3 ' -TAMRA 


26 


188 


306 


Reverse 


5 ' -tgttgctatagaccccaaacag-3 ' 


22 


217 


307 



Table AAB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3941, 
Run 212344932 


Tissue Name 


Rel. Exp.(%) 
Ag3941, Run 
212344932 


AD 1 Hippo 


6.3 


Control (Path) 3 Temporal Ctx 


4.2 


AD 2 Hippo 


16.5 


Control (Path) 4 Temporal Ctx 


28.9 


AD 3 Hippo 


3.0 


AD 1 Occipital Ctx 


2.0 


AD 4 Hippo 


6.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


0.6 


AD 6 Hippo 


10.6 


AD 4 Occipital Ctx 


17.1 
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Control 2 Hippo 


17.3 


AD 5 Occipital Ctx 


28.7 


Control 4 Hippo 


8.0 


AD 6 Occipital Ctx 




Control (Path) 3 
Hippo 


4.5 


Control 1 Occipital Ctx 


2.1 


AD 1 Temporal Ctx 


6.7 


Control 2 Occipital Ctx 


39.5 


AD 2 Temporal Ctx 


27.9 


Control 3 Occipital Ctx 


17.4 


AD 3 Temporal Ctx 


3 9 


Control 4 Occipital Ctx 


5.7 


AD 4 Temporal Ctx 


18.9 


( Mitrol i Path) 1 Occipital < tx 


51.4 


Ctx 


39.8 


Control (Path) 2 Occipital Ctx 


11.5 


AD 5 Sup Temporal 
Ctx 


25.0 


Control (Path) 3 Occipital Ctx 


1.6 


AD 6 I li f Temporal 
Ctx 


13.8 


Control (Path) 4 Occipital Ctx 


16.7 


AD 6 Sup Temporal 
Ctx 


11.9 


Control 1 Parietal Ctx 


4.1 


Control 1 Temporal 
Ctx 


5.2 


Control 2 Parietal Ctx 


13.7 


Control 2 Temporal 
Ctx 


33.9 


Control 3 Parietal Ctx 


13.6 


Control 3 Temporal 
Ctx 


6.0 


Control (Path) 1 Parietal Ctx 


36.9 


Control 3 Temporal 
Ctx 


10.7 


Control (Path) 2 Parietal Ctx 


21.9 


Control (Path) 1 
Temporal Ctx 


45.4 


Control (Path) 3 Parietal Ctx 


2.8 


Control (Path) 2 
Temporal Ctx 


23.3 


Control (Path) 4 Parietal Ctx 


42.6 



Table AAC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3941,Run 
219822166 


Tissue Name 


Rel. Exp.(%) 
Ag3941,Run 
219822166 


Adipose 


6.3 


Renal ca.TK-10 


7.5 


Melanoma* Hs688(A).T 


25.3 


Bladder 


11.3, 


Melanoma* Hs688(B).T 


8.1 


Gastric ca. (liver met.) NCI-N87 


13.8 


Melanoma* M14 


10.6 


Gastric ca. KATO III 


3.7 


Melanoma* LOXIMVI 


10.5 


Colon ca. SW-948 


8.2 


Melanoma* SK-MEL-5 


17.8 


Colon ca. SW480 


21.3 


Squamous cell carcinoma 
SCC-4 


1.3 


Colon ca.* (SW480 met) SW620 


14.0 


Testis Pool 


7.6 


Colon ca. HT29 


5.0 
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Prostate ca.* (bone met) PC-3 


32.3 


Colon ca.HCT-1 16 


6.3 


Prostate Pool 


8.5 


Colon ca. CaCo-2 


. ., 


Placenta 


12.7 


Colon cancer tissue 


7.4 


Uterus Pool 


1.2 


Colon ca.SW1116 


19.5 


Ovarian ca. OVCAR-3 


16.4 


Colon ca. Colo-205 


1.1 


Ovarian ca. SK-OV-3 


7.2 


Colon ca. SW-48 


6.6 


Ovarian ca. OVCAR-4 


3.3 


Colon Pool 


12.9 


Ovarian ca. OVCAR-5 


55.1 


Small Intestine Pool 


8.2 


Ovarian ca. IGROV 1 


22.1 


Stomach Pool 


4.5 


Ovarian ca. OVCAR-S 


6.8 


Bone Marrow Pool 


4.6 


Ovary 


6.8 


Fetal Heart 


7.2 


Breast ca. MCF-7 


11.2 


Heart Pool 


5.6 


Breast ca. MDA-MB-231 


11.5 


Lymph Node Pool 


11.0 


Breast ca. EST 549 


9.1 


Fetal Skeletal Muscle 


15.5 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


12.2 


Breast ca. MDA-N 


32.5 


Spleen Pool 


18.6 


Breast Pool 


9.1 


Thymus Pool 


23:0 i 


Trachea 


16.4 


CNS cancer (glio/astro) U87-MG 


19.8 


Luno "~ 
ung 


1.0 


CNS cancer (glio/astro) U- 1 18- 
MG 


15.6 j 


Fetal Lung 


29.7 


CNS cancer (neurounet) SK-N-AS[5.0 j 


Lung ca. NCI-N417 


2.9 


CNS cancer (astro) SF-539 


3.7 


Lungca. LX-1 


18.3 


CNS cancer (astro) SNB-75 


13.7 


Lung ca.NCI-H 146 > 


8.3 


CNS cancer (glio) SNB-19 


18.0 


Lung ca. SHP-77 


12.7 


CNS cancer (glio) SF-295 


14.6 


Lung ca. A549 


18.8 


Brain (Amygdala) Pool 


7.2 


Lungca. NCI-H526 


5.2 


Brain (cerebellum) |5.2 


Lung ca. NCI-H23 


24.1 


Brain (fetal) 


1.5 


Lung ca. NCI-H460 


6.3 


3rain (Hippocampus) Pool 6.2 


Lung ca. HOP-62 


7.0 


Cerebral Cortex Pool 


8.4 


Lung ca. NCI-H522 


15.4 


Brain (Substantia nigra) Pool j 


12.3 


Liver 


5.2 


Brain (Thalamus) Pool ! 


9.2 


Fetal Liver 


15.9 


Brain (whole) 


7.7 


Liver ca. HepG2 


?.5 


Spinal Cord Pool 


3.0 


Kidney Pool 


8.6 


Adrenal Gland 


9.5 


Fetal Kidney 


>.0 


Pituitary gland Pool 


4.3 


Renal ca. 786-0 


5.4 


Salivary Gland 


3.7 


Renal ca. A498 


.5 


rhyroid (female) 


13.7 
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Renal ca. A( TIN 


.5 I 


>ancreatic ca. CAPAN2 ( 


J 


Renal ca.UO-31 


.3 1 


3 ancreas Pool 


7.0 


Table AAD. Panel 4.1D 


Tissue Name 


lei. Exp.(%) 
\g394l, Run 
70684829 


1 

rissue Name 


lei. Exp.(%) 
\g3941, Run 
70684829 


Secondary Th 1 act 


36.9 


HUVEC IL-lbeta 


).3 


Secondary Th2 act 


43.5 


HUVEC I FN gamma 


5.4 


Secondary Trl act ] 


47.3 


HUVEC TNF alpha + IFN gamm i 


5.9 


Secondary Thl rest (46.0 


HUVEC TNF alpha +IL4 


8.7 


Secondary Th2 rest |45.1 


HUVEC IL-11 


7.0 


Secondary Trl rest 


58.6 


Lung Microvascular EC none 


11.7 


Primary Th 1 act 


31.6 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


14.2 


Primary Th2 act 


72.7 


Microvascular Dermal EC none ;6.0 


Primary Trl act 


47.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


4.8 


Primary Thl rest 


40.6 


Bronchial epithelium TNFalpha + 
ILlbeta 


2.4 


Primarv Th2 rest 


26.2 


Small airway epithelium none 


Y.4 


Primary Trl rest 


100.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


4.1 


CD45RA CD4 lymphocyte 
act 


27.9 


Coronery artery SMC rest 


9.4 


CD45RO CD4 lymphocyte 
act 


45.1 


Coronery artery SMC TNFalpha + 
IL-lbeta 


14.2 


CD8 lymphocyte act 


44.1 


Astrocytes rest 


4.2 


Secondary CD8 lymphocyte 
rest 


37.1 


Astrocytes TNFalpha + IL-lbeta 


7.1 


Secondary CD8 lymphocyte 
act 


33.2 


KU-8 12 (Basophil) rest 


16.8 


CD4 lymphocyte none 


63.7 


KU-812 (Basophil) 
PMA/ionomycin 


11.7 


2ryThl/Th2/Trl anti-CD95 
CH11 


85.9 


CCD 1 106 (Keratinocytes) none 


2.6 


LAK cells rest 


73.7 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.8 


LAK cells IL-2 


68.3 


Liver cirrhosis 


8.1 


LAK cells IL-2+IL-12 


42.9 


NCI-H292 none 


19.2 


LAK cells IL-2+IFN gamma 


35.8 


NCI-H292 IL-4 


17.9 


LAK cellsIL-2+IL-18 


59.5 


NCI-H292 IL-9 


38.2 
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|LAK cells PMA/ionomycin 


14.7 


j - I 1.292 IL-13 


ll 1 9 


NK Cells IL-2 resf 


81.2 


|NCI l! !92 11 N gamnic 


26.1 


[Two Way MLR 3 day 


63.3 


JHPAEC none 


4.0 


iTwo Way MLR 5 day 


31.9 


|HPAEC TNF alpha + IL-1 beta 


Il3.3 


(Two Way MLR 7 day 


28.9 


{Lung fibroblast none 


6.8 


irblvlC rest 


26.8 


Lung fibroblast TNF alpha + IL-1 
[beta 


9.0 


[PBMC PWM 


18.2 


|Limg fibroblast IL-4 


7.6 


iPBMC PHA-L 


33.0 


(Lung fibroblast IL-9 


19.6 


I Ramos (B cell) none 


7.2 


|Lung fibroblast IL-13 


7.9 


(Ramos (B cell) ionomycin 


20.6 


jLung fibroblast I FN gamma 


7.6 


6 1 mphocytes PWM 


25.0 


(Dermal fibroblast CCD1070 rest 


11.3 


B lymphocytes CD40L and 
IL-4 


74.2 


bermal fibroblast CCD1070 TNF 

[alpha 


40.1 


EOL-1 dbcAMP 


2.5 


jDermal fibroblast CCD 1 070 IL-1 
!beta 


1.0 


EOL-1 dbcAMP 
PMA/ionomycin 


1.6 


Dermal fibroblast IFN gamma 


__ 

4.9 


Dendritic cells none 


23.8 


Dermal fibroblast IL-4 


15.4 


Dendritic cells LPS 


23.2 


Dermal Fibroblasts rest 


8.7 


Dendritic cells anti-CD40 


36.3 


Neutrophils TNFa-LPS 


2.9 


Monocytes rest 


31.6 


Neutrophils rest 


11.3 


Monocytes LPS 


16.4 


Colon 


40.9 


Macrophages rest 


30.1 


Lung 


37.6 


Macrophages LPS 


13.8 


Thymus 


69.7 


HUVEC none 


4.1 


Kidney 


85.9 


HUVEC starved 


5.8 







Tissue Name 


Rel. Exp.(%) 
Ag3941, Run 
268035089 


Tissue Name 


Rel. Exp.(%) 
Ag3941,Run 
268035089 


Colon cancer 1 


7.9 


Bladder cancer NAT 2 


0.0 


Colon cancer NAT 1 


1.2 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


6.5 


Bladder cancer NAT 4 


2.0 


Colon cancer NAT 2 


7.9 


Adenocarcinoma of the prostate 1 


13.4 


Colon cancer 3 


8.0 


Adenocarcinoma of the prostate 2 


0.2 


Colon cancer NAT 3 


4.8 


Adenocarcinoma of the prostate 3 


7.4 


Colon malignant cancer 4 


9.3 


Adenocarcinoma of the prostate 4 


6.9 


Colon normal adjacent tissue 


2.7 


Prostate cancer NAT 5 


0.4 
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4 




~l 


Lung cancer 1 


7.5 


Adenocarcinoir i th ni state < 


Lung NAT 1 


1.2 


Adenocarcinoma of the prostate 7 |3.4 


Lung cancer 2 


21.5 


Adenocarcinoma of the prostate 8 


1.3 


Lung NAT 2 


1.1 


Adenocarcinoma of the prostate 9 115.8 


Squamous cell carcinoma 3 


15.7 


Prostate cancer NAT 10 


1.1 


Lung NAT 3 


0.4 


Kidney cancer 1 


16.6 


metastatic melanoma 1 


7.9 


KidneyNAT 1 


4.2 


Melanoma 2 


0.2 


Kidney cancer 2 


100 0 


Melanoma 3 


0.8 


Kidney NAT 2 


5.2 


metastatic melanoma 4 


19.2 


Kidney cancer 3 


16.0 


metastatic melanoma 5 


23.5 


Kidney NAT 3 


4.0 


Bladder cancer 1 


0.5 


Kidney cancer 4 


14.6 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


18.3 


Bladder cancer 2 


1.9 





CNS_neurodegeneration_vl.O Summary: Ag3941 The CG94692-0f gene 



appears to be slightly downregulated in the brains of Alzheimer's patients, when compared 
to expression in normal control brains. Therefore, up-regulation of this gene or its protein 
product, or treatment with specific agonists for this receptor may be of use in reversing the 
5 dementia/memory loss and neuronal death associated with this disease. 

General_screening_panel_vl.4 Summary: Ag3941 This gene is widely 
expressed in this panel, with highest expression in a breast cancer cell line (CT=28.6). 
Prominent levels of expression are also seen in samples derived from prostate cancer, 
ovarian cancer and melanoma. Thus, expression of this gene could be used to differentiate 
1 0 these samples from the rest of the samples on this panel and as a marker of these cancers. 
Furthermore, therapeutic modulation of the expression or function of this gene product 
may be useful in the treatment of breast, ovarian, prostate and melanoma cancers. Overall, 
the widespread expression of this gene suggests a role for this gene in cell survival and 
growth. 

15 Among tissues with metabolic function, this gene is expressed at moderate to low 

levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic function and that 
disregulated expression of this gene may contribute to neuroendocrine disorders or 

20 metabolic diseases, such as obesity and diabetes. 
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In addition, this gene is expressed at much higher levels in fetal lung (CT=30.5) 
when compared to expression in the adult counterpart (CT=35.2). Thus, expression of this 
gene may be used to differentiate between the fetal and adult source of this tissue. 

This gene is also expressed at moderate to low levels in the CNS, including the 
5 hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 

Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 4.1D Summary: Ag3941 Highest expression of the CG94692-01 gene is 
10 seen in primary resting Trl cells (CT=30.9). The transcript is expressed at higher levels in 
lymphocytes, which is consistent with the expression in the thymus and lymph node on 
Panel 1.4. Therefore, therapeutics designed with this sequence or the protein it encodes 
could be important in regulating T cell activation and be important for immune modulation 
and in treating T and B cell mediated diseases such as asthma, allergy, COPD. arthritis, 
15 psoriasis, lupus and IBD. 

General oncology screening panel_v_2.4 Summary: Ag3941 Highest 
expression of the CG94692-01 gene is seen in kidney cancer (CT=29.5). In addition, 
significant levels of expression are also seen in kidney cancers and lung cancers when 
compared to normal adjacent tissue. Thus, expression of this gene could be used to 
20 differentiate these samples from the rest of the samples on this panel and as a marker of 
these cancers. Furthermore, therapeutic modulation of the expression or function of this 
gene product may be useful in the treatment of kidney and lung cancers. 



AB. CG94724-01: Acylcarnitine Translocase 

25 Expression of gene CG94724-01 was assessed using the primer-probe set Ag4071, 

described in Table ABA. Results of the RTQ-PCR runs are shown in Table ABB. 

Table ABA . Probe Name Ag4071 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -ctggtacattcaccacaggc-3 ' 


20 


363 


308 


Probe 


TET-5 ' -cctggagaacccatcaagtccttgtt-3 1 -TAMRA 


26 


392 


309 


Reverse 


5 ' -acttggtttcccctgaagaa-3 1 


20 


430 


310 



Table ABB . Panel 4. ID 
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Tissue Name 


Rel. Exp.(%) 

Aa-Art"71 D*t« 
/\g^U/ 1, KUtl 

171807825 


Tissue Name 


Rel. Exp.(%) 
Ag4071,Run 
171807825 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVECIFN gamma 


0.0 


Secondary Trl act 


0.0 


HUvtL l Nr alpha + 1FN 
gamma 


0.0 


Secondary Tli 1 rest 


0.0 


HUVEC TNF alpha + 1L4 


0.0 


Secondary Th2 rest 


0.0 


nUV bL. 1 L- 1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


o.o " ~ 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + 1L- 1 beta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + 1L-1 beta 


0.0 


Primary Th I rest 


0.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest jo.0 j 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
ymphocyte rest 


3.4 


Astrocytes TNFalpha + IL-lbeta 


0.0 


Secondary CD8 
ymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


4.9 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 




2ryThl/Th2/TrI anti- 
3D95CHI1 


D.O 


CCD1 106 (Keratinocytes) none 


D.O 




^AK cells rest 


3.0 


~CDlI06(Keratinocytes) T 
rNFalpha + IL-lbeta 


3.0 




-AK cells IL-2 


).0 


Jver cirrhosis ~T 


).0 


I 


.AKcellsIL-2+IL-I2 ( 


.0 I 


^CI-H292 none ( 


).0 


I 

£ 


.AK cells IL-2+IFN 
am ma 


0.7 1 


VJCI-H292 IL-4 ( 


.0 


I 


.AK cells IL-2+ 1L- 1 8 C 


.0 ]- 


s(CI-H292 IL-9 


.0 


I 

P 


.AK cells 

MA/ionomycin C 


-0 f 


4CI-H292 IL-13 C 


.0 
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NK Cells IL-2 rest 


0.0 


MCI H292 irN ;amm 


0.0 


Two Way MLR 3 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-I beta 


0 0 


Two Way MLR 7 day 


4.2 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-I \ Q Q 
beta 


PBMC I'WM 
PBMC PHA-L 


0.0 |Lung fibroblast IL-4 jo.O 


0.0 


Lung fibroblast IL-9 iO.O 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 |7.9 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast 1FN gamma jO.O 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1 070 rest {0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast CCD 1070 TNF 
alpha 


5.1 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 1L-1 
beta 


0.0 


EOL-I dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma io.O 
Dermal fibroblast IL-4 jo.O 


Dendritic cells none 


0.0 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest |o.O 


P< ndntic cells anti-CI)40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


9.5 


Macrophages LPS 


0.0 


Thymus 


6.9 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


3.7 







CNS_neurodegeneration_vl.O Summary: Ag4071 Expression of the CG94724- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

General screening_panel_vl.4 Summary: Ag4071 Expression of the CG94724- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1D Summary: Ag4071 Expression of the CG94724-01 gene is 
exclusively seen in kidney (CT=32.9). Therefore, expression of this gene can be used to 
distinguish kidney sample from other samples in this panel. In addition, therapeutic 
modulation of this gene through the use of small molecule target may be beneficial in the 
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treatment of autoimmune and inflammatory diseases that affect kidney, including lupus 
and glomerulonephritis. 

The CG94724-01 gene codes for carnitine/acylcarnitine translocase (CACT) 
homologue. Carnitine-acylcarnitine translocase is 1 of 10 closely related mitochondrial- 
5 membrane carrier proteins that shuttle substrates between cytosol and the 

intramitochondrial matrix space. Deficiency in CACT causes defect in the co-transport of 
free and esterified carnitine across the inner mitochondrial membrane. Recently, Choong 
et al (2001, Pediatr Dev Pathol 4(6):573-9, PM1D: 1 1826365) reported a case of lethal 
cardiac tachyarrhythmia in a newborn who died at 72 h of age from severe, intractable 
10 cardiac tachyarrhythmia, despite an improvement in his neurological and biochemical 

status caused due to CACT deficiency. Postmortem examination showed marked steatosis 
of myocardium, liver, and kidney. Thus, the CG94724-01 gene may also play a role in the 
pathology of disorders associated with CACT deficiency and therapeutic modulation of 
this gene product could be useful in the treatment of these disorders including lethal 
15 cardiac tachyarrhythmia . 

Panel 5 Islet Summary: Ag4071 Expression of the CG94724-0I gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4071 Expression of the 
CG94724-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
20 (data not shown). 

AC. CG94946-01 and CG94946-02 and CG94946-04 and CG94946-05 and CG94946- 
06 and CG94946-07: Agrin Precursor 

Expression of gene CG94946-01 and variants CG94946-02, CG94946-04, 
25 CG94946-05, CG94946-06 and CG94946-07 was assessed using the primer-probe sets 
Ag3605 and Ag3974, described in Tables ACA and ACB. Results of the RTQ-PCR runs 
are shown in Tables ACC, ACD, ACE, ACF, ACG and ACH. Please note that variants 
CG94946-02, CG94946-04 and CG94946-07 correspond to the probe and primer set 
Ag3974 only. 

30 Table ACA . Probe Name Ag3605 



Primers 


Sequences 


i _J Start 
Length j Position 


SEQID 
No 


Forward 


5 ' -gaccccaagtcagaactgttc-3 1 


21 j 3514 


3„ 
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Probe 


TET-5 ' -attgagagcaccctggacgacctctt-3 ' 


-TAMRaI 26 


3553 


312 


Reverse 


5 • -gaaatccttcttgacgtctgaa-3 ' 


| 22 


3585 


313 



Table ACB . Probe Name Ag3974 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Foi'wardj5 1 -gacaccaggatcttctttgtga-3 ' 


22 


379 


314 


Probe 


TET-5 ' -catacctgtggccagcccacaag-3 ' -TAMRA 


23 


413 


315 


Reverse 


5 ' -gagttgagcatcagctcgtt-3 ' 


20 


436 


316 



Table ACC . CNSneurodegenerationvl.O 



jRel. 
Exp.(%) 

Tissue Name |Ag3605, 
|Run 

210997601 


Rel. 

Exp.(%) 
Ag3974, 
Run 

212348647 


Tissue Name 


Rel. 

Exp.(%) 
Ag3605, 
Run 

210997601 


Rel. Exp.(%) 

Ag3974, 

Run 

212348647 


AD 1 Hippo 1 12.2 


28.7 


Control (Path) 3 Temporal 
Ctx 


10.9 


15.4 


AD 2 Hippo : 27.2 
AD 3 Hippo 15.3 


36.3 
19.2 


Control (Path) 4 Temporal 
Ctx 

AD 1 Occipital Ctx 


57.0 46.0 
26A |28.9 ~ 


AD 4 Hippo 


34.4 


0] 8 Ud 2 Occipital Ctx 
l(Missing) 


0.0 


0.0 


AD 5 Hippo 


100.0 


80.7 ! AD 3 Occipital Ctx 


12.0 


19.5 


AD 6 Hippo 


32.3 


42.6 


AD 4 Occipital Ctx 


24.7 


21.0 


Control 2 Hippo 


37.9 


42.3 


AD 5 Occipital Ctx 


44.4 


47.6 


Control 4 Hippo 


20.7 


34.9 


AD 6 Occipital Ctx 


13.5 


14.' 


Control (Path) 3 
Hippo 


7.5 


12.4 


Control 1 Occipital Ctx 


10.0 


20.4 


AD 1 Temporal 
Ctx 


28.7 


32.1 


Control 2 Occipital Ctx 


51.8 


55.1 


AD 2 Temporal 
Ctx 


31.9 


31.2 


Control 3 Occipital Ctx 


17.3 


22.5 


AD 3 Temporal 
Ctx 


17.4 


20.2 


Control 4 Occipital Ctx 


14.9 


22.2 


AD 4 Temporal 
Ctx 


31.2 


24.0 


Control (Path) 1 Occipital 
Ctx 


90.8 


71.2 


AD 5 Inf 
Temporal Ctx 


87.1 


100.0 


Control (Path) 2 Occipital 
Ctx 


19.8 


17.4 


AD 5 Sup 
Temporal Ctx 


51.4 


58.6 


Control (Path) 3 Occipital 
Ctx 


7.4 


15.4 


AD 6 Inf 
Temporal Ctx 


42.9 


40.6 


Control (Path) 4 Occipital 
Ctx 


44.8 


38.7 


AD 6 Sup 


51.1 


35.1 


Control 1 Parietal Ctx 


14.6 


18.2 
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Temporal Ctx 












Control 1 
Temporal Ctx 


17.3 


19.8 


Control 2 Parietal Ctx 


53.6 




Control 2 
Temporal Ctx 


50.3 


48.3 


Control 3 Parietal Ctx 


18.6 


21.2 


Control 3 
Temporal Ctx 


23.7 




Control (Path) 1 Parietal 
Ctx 


76.3 


51.4 


Control 3 
Temporal Ctx 


20.3 


25.0 


Control (Path) 2 Parietal 
Ctx 


36.6 


32.3 


Control (Path) 1 
Temporal Ctx 


78.5 


63.7 


Control (Path) 3 Parietal 
Ctx 


12.0 


11.9 


Control (Path) 2 
Temporal Ctx 


50.3 


43.5 


Control (Path) 4 Parietal 
Ctx 


64.2 


58.6 



Tissue Name 


lei. jRel. 
Exp.(%) :Exp.(%) 
/Vg3605, |Ag3974, 
Run ]Run 
213406184 217508632 


jRel. 
|Exp.(%) 

Tissue Name jAg3605. 

iRun 

2 13406 184 


lei. Exp.(%) 
\g3974, Run 
21^7508632 


Adipose 


1.4 jl.5 


Renal ca. TK.-10 |19.3 


6.4 1 


Melanoma* 
Hs688(A).T 


2.6 3.2 

! 


Bladder j 


i 1 
3.4 9.0 

37.7 80.7 


Melanoma* 
Hs688(B).T 


4.6 j 


4.2 


Gastric ca. (liver I 
met.)NCI-N87 | 


Melanoma* M14 


6.7 


6.4 


Gastric ca. KATO 111 |17.8 1 17.7 


Melanoma* LOXIMV1 


4.8 


4.0 


Colon ca. SW-948 


9.2 


7.8 


Melanoma* SK-MEL- 
5 


2.6 


4.2 


Colon ca. SW480 


25.0 


32.3 


Squamous cell 
carcinoma SCC-4 


9.0 


8.4 


Colon ca.* (SW480 
met) SW620 


5.0 


4.6 


Testis Pool 


1.3 


1.1 


Colon ca. HT29 


26.8 


30.6 


Prostate ca.* (bone 
met) PC-3 


21.0 


24.8 


Colon ca. HCT-116 


4.9 


5.8 


Prostate Pool 


0.9 


0.8 


Colon ca. CaCo-2 


13.6 


10.4 


Placenta 


0.9 


1.3 


Colon cancer tissue 


10.2 


10.0 


Uterus Pool 


0.4 


0.4 


Colon ca. SW1116 


5.1 


3.6 


Ovarian ca. OVCAR-3 


77.4 


66.9 


Colon ca. Colo-205 


1.8 


1.5 


Ovarian ca. SK-OV-3 


42.0 


36.3 


Colon ca. SW-48 


1.2 


0.7 


Ovarian ca. OVCAR-4 


9.5 


12.7 


Colon Pool 


1.9 


1.3 


Ovarian ca. OVCAR-5 


39.2 


44.4 


Small Intestine Pool 


0.6 


1.0 


Ovarian ca. IGROV-1 


22.1 


27.7 


Stomach Pool 


1.5 


1.2 


Ovarian ca. OVCAR-8 


18.0 jl4.9 


Bone Marrow Pool 


0.6 


0.5 
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Ovary 


1.5 


1.9 


Fetal Heart 


1.4 


1.0 


Breast ca. MCF-7 


7.7 


9.7 


Heart Pool 


0.7 


(is 


Breast ca. MDA-MB- 
231 


22.7 


31.2 


Lymph Node Pool 


2.1 


2.0 

1 


Breast ca. BT 549 


13.2 


10.1 


Fetal Skeletal Muscle 


0.9 


J0.5 


iBreast ca. T47D 


100.0 


100.0 


Skeletal Muscle Pool 


0.4 


|0.5 


Breast ca. MDA-N 


4.8 

.... 


4.2 


Spleen Pool 


0.7 


j0.7 


.Breast Pool 




1.6 


Thymus Pool 


1.8 |2.2 


Trachea 


2.8 




2.6 


CNS cancer 
(glio/astro) U87-MG 


5.9 


6.0 


Lung 


0.2 


0.1 


CNS cancer 
(glio/astro) U-118- 
MG 


11.7 


11.2 








CNS cancer 
(neuro;met) SK-N-AS 


1.2 


0.9 


! ' 


1.4 


0.7 


CNS cancer (astro) 
SF-539 


6.7 


5.0 


Lung ca. LX-1 


10.5 


1 1.0 


CNS cancer (astro) 
SNB-75 


22.8 


32.3 


Lungca. NC1-H146 


0.1 


0.1 


CNS cancer (glio) 
SNB-19 


25.0 


20.2 


Lung ca. SHP-77 




1.1 0.8 


CNS cancer (glio) 
SF-295 


35.1 


~ -• 

38.2 


Lung ca. A549 


15.6 


10.4 


Brain (Amygdala) 
Pool 


1.7 


1.3 


Lung ca. NCI-H526 


5.4 


1.6 


Brain (cerebellum) 


1.4 


1.0 


Lungca. NCI-H23 


18.9 


20.6 


Brain (fetal) 


7.0 


2.8 


Lungca. NCI-H460 


11.5 


9.3 


Brain (Hippocampus) 
Pool 


1.6 


0.9 


Lung ca. HOP-62 


23.7 


23.0 


Cerebral Cortex Pool 


1.9 


0.9 


Lung ca. NCI-H522 


1.8 


2.3 


Brain (Substantia 
nigra) Pool 


2.8 


1.7 


Liver 


0.5 


0.6 


Brain (Thalamus) 
Pool 


2.7 


1.6 


Fetal Liver 


0.8 


1.4 


Brain (whole) 


3.4 


1.1 


Liver ca. HepG2 


15.5 


12.6 


Spinal Cord Pool 


1.8 


1.4 


Kidney Pool 


1.5 


2.5 


Adrenal Gland j0.2 


0.4 


Fetal Kidney 


5.8 


4.6 


Pituitary gland Pool 


0.3 


0.2 


Renal ca. 786-0 


46.3 


39.5 


Salivary Gland 


1.1 


1.3 


Renal ca. A498 


13.8 


7.9 


Thyroid (female) 


3.3 


3.7 


Renal ca. ACHN 


14.3 


15.9 


Pancreatic ca. 
CAPAN2 


23.7 


27.7 
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[Renal ca. UO-3 1 |41.5 ]38 7 [Pancreas Pool j.3.0 |4.1 

Table ACE . Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Ag3605, Run 
173764229 


Tissue Name 


T?f*l Fvn CO/ \ 

Ag3605, Run 
173764229 


Normal Colon 


4.7 


Kidney Margin (OD04348) 


100.0 


Colon cancer (OD06064) 


, o {Kidney malignant cancer 
;(OD06204B) 


21.6 


Colon Margin (OD06064) 


^ -, (Kidney normal adjacent tissue 
[(OD06204E) 


23.2 


Colon cancer (OD06159) 


2.1 


Kidney Cancer (OD04450-01) 


53.2 


Colon Margin (OD06159) 


1.8 


Kidney Margin (OD04450-03) 


21.3 


Colon cancer (OD06297- 
04) 


2.0 


Kidney Cancer 8 1206 13 


1.1 


Colon Margin (OD06297- 
05) 


3.0 


Kidney Margin 8120614 


14.1 


CC Gr.2 ascend colon 
(OD03921) 


3.6 


Kidney Cancer 9010320 


20.3 


CC Margin (OD03921) 


1.3 


Kidne: I largin 9010321 


15.0 


Colon cancer metastasis 
(OD06104) 


1.1 


Kidney Cancer 8 1 ^0607 


71.7 


I um Margin (01)06104) 


1.0 


Kidney Margin 8120608 


12.2 


Colon mets to lung 
(OD04451-01) 


4.5 


Normal Uterus 


7.3 


Lung Margin (OD0445 1 - 
02) 


6.7 


Uterine Cancer 0640 1 1 


6.9 


Normal Prostate 


2.1 


Normal Thyroid 


4.3 


Prostate Cancer (OD044 1 0) 


3.9 


Thyroid Cancer 064010 


27.0 


Prostate Margin 
(OD04410) 


3.4 


Thyroid Cancer A302152 


19.1 


Normal Ovary 


7.6 


Thyroid Margin A302 153 


8.1 


Ovarian cancer (OD06283- 
03) 


27.9 


Normal Breast 


14.0 


Ovarian Margin 
(OD06283-07) 


2.0 


Breast Cancer (OD04566) 


13.4 


Ovarian Cancer 064008 


16.0 


Breast Cancer 1024 


35.1 


Ovarian cancer (OD06145) 


10.1 


Breast Cancer (OD04590-01) 


31.6 


Ovarian Margin 
(OD06145) 


8.2 


Breast Cancer Mets (OD04590-03) 


8.7 


Ovarian cancer (OD06455- 
03) 


28.9 


Breast Cancer Metastasis (OD04655- 
05) 


13.3 


Ovarian Margin 


1.9 


Breast Cancer 064006 


21.5 
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(OD06455-07) 








Normal Lung 


2.9 


Bit st Cancel "1<>026<- 


17.7 


Invasive poor diff. lung 
adeno (ODO4945-01 


9.4 


Breast Margin 9100265 


16.5 


Lung Margin (OD04945- 
03) 


7.9 


Breast Cancer A209073 


13.2 


Lung Malignant Cancer 
(0 DOS 126) 


7.5 


Breast Margin A2090734 


35.4 


Lung Margin (OD03 126) 


7.0 


Breast cancer (OD06083) 


24.5 


Lung Cancer (OD05014A) 


17.0 


Breast cancer node metastasis 
(OD06083) 


21.5 


Lung Margin (OD050 1 4B) 


1 1.7 


Normal Liver 


5.0 


Lung cancer (OD06081) 


12.2 


Liver Cancer 1026 


15.5 


Lung Margin (OD06081) 


2.4 


Liver Cancer 1025 


12.2 


Lun« Cancer (OD0423 7- 
01) 


1 .8 


Liver Cancer 6004-T 


7.8 


Lung Margin (OD04237- 
02) 


16.2 


Liver Tissue 6004-N 


6.1 


Ocular Melanoma 
Metastasis 


8.4 


Liver Cancer 6005-T 


25.0 


Ocular Melanoma Margin 
(Liver) 


2.9 


Liver Tissue 6005-N 


12.4 


Melanoma Metastasis 


4.0 


Liver Cancer 064003 


12.9 


Melanoma Margin (Lung) 


3.8 


Normal Bladder 


14.2 


Normal Kidney 


10.9 


Bladder Cancer 1023 


9.5 


Kidney Ca, Nuclear grade 2 
(OD04338) 


35.4 


Bladder Cancer A302173 


12.2 


Kidney Margin (OD04338) 


20.4 


Normal Stomach 


8.7 


Kidney Ca Nuclear grade 
1/2 (OD04339) 


52.9 


Gastric Cancer 9060397 


8.9 


Kidney Margin (OD04339) 


16.6 


Stomach Margin 9060396 


7.4 


Kidney Ca, Clear cell type 
(OD04340) 


16.6 


Gastric Cancer 9060395 


7.0 


Kidney Margin (OD04340) 


7.4 


Stomach Margin 9060394 


7.5 


Kidney Ca, Nuclear grade 3 
(OD04348) 


11.2 


Gastric Cancer 064005 


6.9 



Table ACF . Panel 4.1D 





Rel. 


Rel. 




Rel. 


Rel. 


Tissue Name 


Exp.(%) 


Exp.(%) 




Exp.(%) 


Exp.(%) 


Ag3605, 


Ag3974, 


Tissue Name 


Ag3605, 


Ag3974, 




Run 


Run 




Run 


Run 




169943454 


170739806 




169943454 


170739806 
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Secondary Th 1 act 


' 0 


1.2 


HUVEC IL-1 beta 


15.6 


18.9 


Secondary Th2 act 


^ 1 


8.0 


HUVEC I FN gamma 


12.9 


16.7 


Secondary Trl act 


2.5 


3.5 


HUVEC TNF alpha + 
I FN gamma 


37.6 


34.9 


Secondary Th 1 rest 


0.0 


|0.7 


HUVEC TNF alpha + 
1L4 


31.4 


31.4 
13.9 


Secondary Th2 rest 


!0.2 iHUVECIL-ll 


14.9 


Secondary Trl rest 


04 |j 2 jLung Microvascular F.C ^ () 
i ' jnone 


100.0 
97.9 


Primary I hl act 


3.6 !3.2 


Lung Microvascular EC . 
TNFalpha+IL-lbeta j U " U 


Primary Th2 act 


1.1 


2.0 


Microvascular Dermal 
EC none 


49.7 


48.3 


Primary Trl act 


3.4 


2.9 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


56.6 


47.0 


Primary Thl rest 


0.9 


0.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


78.5 


90.1 


Primary Th2 rest 


0, 


0.2 


Small airway epithelium 
none 


31.0 


32.5 

1 


Primary Trl rest 


0.2 


0.3 


Small airway epithelium 
TNFalpha+ IL-lbeta 


81 .8 


93.3 


CD45RACD4 j 
lymphocyte act 1 J ' 


22.7 


Coronery artery SMC 
rest 


17.2 


28.5 


CD45RO CD4 
lymphocyte act 


5.0 


5.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


22.2 


28.7 


CDS lymphocyte act 


3.9 


3.3 


Astrocytes rest 


80.1 


55.1 


Secondary CDS 
lymphocyte rest 


3.7 


3.3 


Astrocytes TNFalpha + 
IL-lbeta 


82.9 


66.4 


Secondary CDS 
lymphocyte act 


3.3 


3.5 


KU-812 (Basophil) rest 


2.6 


1.9 


CD4 lymphocyte 
none 


0.3 


0.1 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.6 


2.8 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


0.3 


0.4 


CCD 11 06 

(Keratinocytes) none 


70.7 


82.4 


LAK cells rest 


5.0 


6.4 


CCD 1106 

^Keratinocytes) 

TNFalpha+IL-lbeta 


77.4 


72.7 


LAK cells IL-2 


2.8 


1.7 


Liver cirrhosis 


13.2 


14.4 


LAK cells IL-2+IL- 
12 


1.4 


1.8 


NCI-H292 none 


57.8 


54.0 


LAK cells IL-2+IFN 
gamma 


2.3 


1.1 


NCI-H292 IL-4 


62.4 


78.5 
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LAK cells IL-2+ IL- 
18 


2.9 


1.6 


NCI-H292 IL-9 


61.6 


79.6 


LAK cells 
PMA/ionomycin 


6.8 


4.6 


— 

NCI-H292 IL-13 


j •- 

53.6 


59.9 


NK Cells IL-2 rest 


1.6 




NCI-Hzyz IhN gamma 


167.4 


71.7 


Two Way MLR 3 day 


10.7 


~ 


HrAtt, none 


21.5 




2L3 


Two Way MLR 5 day 


6.5 


5.3 


HFAbC, 1 Nr alpha + IL- 
1 beta 


37.6 


45.4 


Two Way MLR 7 day 


4.3 


4.0 


Lung fibroblast none 


22.4 


29.3 


PBMC rest 


0.0 


0.6 


Lung fibroblast TNF 
alpha + IL-1 beta 


71.2 


87.7 


PBMC PWM 






Lung fibroblast IL-4 


16.2 




PBMC PHA-L 


5.5 


3.4 


Lung fibroblast IL-9 


31.9 


30.4 


Ramos (B cell) none 


0.4 


0.4 


Lung fibroblast IL-13 


18.7 


36.6 


Ramos (B cell) 
ionomycin 


0.2 


0.2 


Lung fibroblast I FN 
gamma 


-3.0 


29.7 


B lymphocytes PWM 


2.4 


3.0 


Dermal fibroblast 
CCD 1070 rest 


15.9 


27.2 


B lymphocytes 
CD40L and IL-4 


1.5 


3.7 


Dermal fibroblast 
CCD 1070 TNF alpha 


15.9 


20.6 1 


EOL-1 dbcAMP 


3.4 


3.1 




Dermal fibroblast 
CCD 1070 IL-1 beta 


17.2 


22.4 ; 


EOL-1 dbcAMP 
PMA/ionomycin 


18.3 


8.0 


Dermal fibroblast 1FN 
gamma 


7.2 


10.3 


Dendritic cells none 


14.8 


9.0 


Dermal fibroblast IL-4 


7.2 


8.0 


Dendritic cells LPS 


48.3 


32.8 


Dermal Fibroblasts rest 


4.3 


6.3 


Dendritic cells anti- 
CD40 




8.8 


Neutrophils TNFa+LPS 


0.0 


0.9 


Monocytes rest 


0.9 


1.4 


Neutrophils rest 


0.2 


1.0 


Monocytes LPS 


66.4 


81.2 


Colon 


7.0 


5.8 


Macrophages rest 


16.2 


9.7 


Lung 


23.3 


23.3 


Macrophages LPS 


54.7 


43.8 


Thymus 


5.8 


7.3 


HUVEC none 


?.3 


2.6 


Kidney 


23.2 


53.2 


HUVEC starved 


4.4 


25.2 









Table ACG . Panel CNS_1 



Tissue Name 


Rel. Exp.(%) 
Ag3605, Run 
171648697 


Tissue Name 


Rel. Exp.(%) 
Ag3605, Run 
171648697 


BA4 Control 


23.8 


BA17PSP 


21.9 


BA4 Control2 


44.4 


BAI7PSP2 


14.4 


BA4 Alzheimer's2 


7.5 


Sub Nigra Control 


37.9 
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JBA4 Parkinson's ]66.0 


Sub Nigra Control2 


31 0 


BA4 Parkinson's2 J80.7 


Sub Nigra Alzheimer's2 


26 !■ 


|BA4 Huntington's j23.5 


Sub Nigra Parkinson 's2 


80.1 


|BA4 Huntington's2 j49.3 


Sub Nigra Huntington's ]76.3 


[BA4PSP jl9.5 


Sub Nigra Huntington's2 129.7 


|BA4 PSP2 |34.9 


Sub Nigra PSP2 


11.1 


1BA4 Depression [20.6 


Sub Nigra Depression 


34.4 


!• \4 Depression I 1 3 


Sub Nigra Depression2 


18.8 


|BA7 Control |53.2 


Glob Palladus Control 


40.3 


[BA7 Control 147.6 


Glob Palladus Control2 


35.8 


[BA7 Alzheimer's2 j 1 3.6 


Glob Palladus Alzheimer's 


20.0 


BA7 Parkinson's |39.8 


Glob Palladus Alzheimer's2 


21.8 


BA7 Parkinson's2 j60.7 


Glob Palladus Parkinson's 


ino o 


BA7 Huntington's |41.8 


Glob Palladus Parkinson's2 


25.0 


BA7 Huntington's2 [62.9 


Glob Palladus PSP 


17.9 


BA7 PSP 136.1 


Glob Palladus PSP2 


7.2 1 I 


BA7 PSP2 j25.0 


Glob Palladus Depression 


15.5 


BA7 Depression 120.0 


Temp Pole Control 


15.3 


BA9 Control |36.6 


Temp Pole Control2 


76.8 


BA9Contiol2 [83.5 


Temp Pole Alzheimer's 


14.3 1 


BA9 Alzheimer's jl7.1 


Temp Pole Alzheimei J s2 


14.7 


BA9 Alzheimer's2 J34.4 


Temp Pole Parkinson's 


76.3 


BA9 Parkinson's j70.7 


Temp Pole Parkinson 's2 


77.4 


BA9 Parkinson's2 j74.2 


Temp Pole Huntington's 


39.8 


BA9 Huntington's j55.5 


Temp Pole PSP 


7.4 


BA9 Huntington's2 ]45.1 


Temp Pole PSP2 


8.1 


BA9 PSP ]28.7 


Temp Pole Depression2 


31.6 


BA9 PSP2 ]8.2 


Cing Gyr Control 


82.4 


BA9 Depression j 1 8.0 


Cing GyrControl2 


82.4 


BA9 Depression2 jo.O 


Cing Gyr Alzheimer's 


27.4 


BA 17 Control j74.7 


Cing Gyr Alzheimer's2 


36.3 


BA17Control2 j86.5 


Cing Gyr Parkinson's 


46.3 


BA17 Alzheimer's2 j20.3 


Cing Gyr Parkinson's2 


42.6 


BA 17 Parkinson's |75.3 


Cing Gyr Huntington's 


70.7 


BA17 Parkinson's2 j85.3 


Cing Gyr Huntington's2 


37.6 


BA17 Huntington's 


47.0 


Cing Gyr PSP 


21.6 


BA17 Huntington's2 


26.6 


Cing Gyr PSP2 


13.9 


BA17 Depression 


24.8 


Cing Gyr Depression j 


21.3 
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|BA17 Depression2 


41.2 


Cing Gyr Depression2 j 


125 


Table ACH. General oncology screenine panel ■< 


t 2.4 


Tissue Name 


ReL Exp.(%) 
Ag3605, Run 
260268655 


Tissue Name 


Rel. Exp.(%) 
Ag3605 Run 
260268655 


Colon cancer 1 


11.6 


Bladder cancer NAT 2 


0.2 


Colon cancer NAT 1 


5.3 


Bladder cancer NAT 3 


1.7 


Colon cancer 2 


18.6 


Bladder cancer NAT 4 


1.8 


Colon cancer NAT 2 


3.2 


Adenocarcinoma of the prostate 1 


2.4 


Colon cancer 3 


22.2 


Adenocarcinoma of the prostate 2 


2.6 


Colon cancer NAT 3 


7.7 


Adenocarcinoma of the prostate 3 


5.2 


Colon malignant 
cancer 4 


23.5 


Adenocarcinoma of the prostate 4 


21.9 


Colon normal adjacent 
tissue 4 


2.0 


Prostate cancer NAT 5 


5.4 


Lung cancer 1 


57.0 


Adenocarcinoma of the prostate 6 




Lung NAT 1 


3.1 


Adenocarcinoma of the prostate 7 


3.8 


Lung cancer 2 


100.0 ' 


Adenocarcinoma of the prostate 8 


1.8 


Lung NAT 2 


3.9 


Adenocarcinoma of the prostate 9 


10.7 


Squamous cell 
carcinoma 3 


41.2 


Prostate cancer NAT 1 0 


, 

0.7 


Lung NAT 3 


1.7 


Kidney cancer 1 


38.7 


metastatic melanoma 
1 


5.4 


KidneyNAT 1 


20.6 


Melanoma 2 


3.2 


Kidney cancer 2 


66.4 


Melanoma 3 


3.7 


Kidney NAT 2 


39.8 


metastatic melanoma 
4 


12.5 


Kidney cancer 3 


64.6 


metastatic melanoma 
5 


7.9 


Kidney NAT 3 


19.6 


Bladder cancer 1 


0.5 


Kidney cancer 4 


16.4 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


15.6 


Bladder cancer 2 


6.0 







CNS_neurodegeneration_vl.O Summary: Ag3605/Ag3974 Two experiments 
with two different probe and primer sets confirm the expression of this gene at moderate 
levels in the brains of an independent group of individuals. However, no differential 
expression of this gene was detected between Alzheimer's diseased postmortem brains and 
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those of non-demented controls in this experiment. Please see Panel 1.4 for a discussion of 
the potential utility of this gene in treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3605/Ag3974 Two experiments 
with the same probe and primer set produce results that are in excellent agreement. The 
5 expression of the CG94946-01 gene appears to be highest in a sample derived from a 

breast cancer cell line (T47D) (CTs=22.5-25.3). In addition, there appears to be substantial 
expression in other samples derived from breast cancer cell lines, ovarian cancer cell lines, 
kidney cancer cell lines, lung cancer cell lines, colon cancer cell lines and brain cancer cell 
lines. Thus, the expression of this gene could be used to distinguish T47D cells from other 

10 samples in the panel. Moreover, therapeutic modulation of this gene, through the use of 
small molecule drugs, protein therapeutics, or antibodies could be of benefit in the 
treatment of breast, ovarian, kidney, lung, colon or brain cancer. 

Among metabolic tissues, this gene has low-to-moderate levels of expression in 
adrenal, pituitary, adult and fetal heart, adult and fetal liver, adult and fetal skeletal 

1 5 muscle, and adipose. This gene product has high levels of expression (CT values = 27) in 
pancreas and thyroid. Thus, this gene product may be important for the pathogenesis, 
diagnosis, and/or treatment of metabolic and endocrine diseases, including obesity. Types 
1 and 2 diabetes and thyroidopathies. In support of this hypothesis, decreased glomerular 
expression of agrin has been observed in diabetic nephropathy (Yard BA, Exp Nephrol 

20 200 1;9(3):2 14-22). 

In addition, this gene is expressed at moderate levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. The CG94946-01 gene encodes a protein 
with homology to agrin, a neuronal aggregating factor that induces the aggregation of 

25 acetylcholine receptors and other postsynaptic proteins on muscle fibers and is crucial for 
the formation of the neuromuscular junction. Agrin also plays an important role in 
defining neuronal responses to excitatory neurotransmitters both in vitro and in vivo 
(Hilgenberg LG,Mol Cell Neurosci 2002 Jan;19(l):97-1 10 and Bixby JL, J Neurobiol 
2002 Feb 5;50(2): 164-79). The CG59841-01 gene expression in the central nervous 

30 system is consistent with the hypothesis that this protein may have similar functions as 
agrin. Therefore, this gene may play a role in central nervous system disorders such as 
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Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

Agrin has also been implicated in the formation of senile plaques in Alzheimer's 
disease and in the acetylcholine synapse/neuromuscular junction (van Horssen J, Acta 
5 Neuropathol (Berl) 2001 Dec;102(6):604-14). In addition, an agrin minigene rescued 
dystrophic symptoms in a mouse model of muscular dystrophy (Moll J, Nature. 2001 Sep 
20;413(6853):302-7). Therefore, this gene product may be used as a treatment or cure for 
congenital muscular dystrophies. Furthermore, this gene product is also an excellent drug 
target in AD or in any disease involving the neuromuscular junction or the acetylcholine 
10 system. 

Panel 2.2 Summary: Ag3605 Expression of the CG94946-01 gene is highest in a 
sample of normal kidney (CT = 27.4). In addition, expression of this gene appears to be 
upregulated in a number of ovarian and renal cancers when compared to the matched 
control margins. Thus, expression of this gene could be used as a marker for ovarian and 

1 5 renal carcinoma. Furthermore, therapeutic modulation of the activity of this gene or its 
protein product, using small molecule drugs, antibodies or protein therapeutics, could be 
of benefit in the treatment of renal and ovarian cancer. 

Panel 4.1D Summary: Ag3605/Ag3974 Two experiments with two different 
probe and primer sets produce results that are in very good agreement. Highest expression 

20 of the CG94946-01 gene is highest in lung microvascular endothelial cells (CTs=27.3- 
28.5), microvascular dermal endothelial cells, mucoepidermoid cell line NCI-H292, 
astrocytes, and keratinocytes. This gene encodes a protein with homology to agrin. 
Recently, it has been demonstrated that agrin, an aggregating protein crucial for formation 
of the neuromuscular junction, is also important for T cell signaling in the immune system 

25 (Khan AA, Science 2001 Jun 1 ;292(5522): 1681-6). In additin, agrin has been identified as 
a potential disease target for autoimmune disorders at the neuromuscular junction, 
including multiple sclerosis (Liyanage Y, Muscle Nerve 2002 Jan;25(l):4-16)Therefore, 
small molecule drug, antibody or protein therapeutics designed against the protein 
encoded by the CG59841-01 gene could reduce or inhibit inflammation in asthma, 

30 emphysema, allergy, psoriasis, muscular dystrophy and multiple sclerosis. 

Panel CNS_1 Summary: Ag3605 This panel confirms the expression of the 
CG94946-01 gene at low levels in the brains of an independent group of individuals. 
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Please see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of 
central nervous system disorders 

General oncology screening panel_v_2.4 Summary: Ag3605 Highest 
expression of the CG94946-01 gene is seen in lung cancer (CT=26.8). In addition, higher 
5 levels of expression are seen in lung and kidney cancers when compared to expression in 
normal adjacent tissue. Thus, expression of this gene could be used as a marker of lung 
and kidney cancers. Furthermore, therapeutic modulation of the expression or function of 
this gene product may be useful in the treatment of lung and kidney cancers. 

1 0 AD. CG95 165-01 : Adenylate Cyclase, Type II 

Expression of gene CG95 165-01 was assessed using the primer-probe set Ag399 1 , 
described in Table ADA. Results of the RTQ-PCR runs are shown in Tables ADB, ADC 
and ADD. 





Table ADA. Probe Name A< 


£991 






jprimers 


Sequences 


Length | 


Start 
Position 


SEQID 
No 


jForward 


5 ' -aacgtggcatcctgaagagt-3 1 


20 | 


3261 


317~ " 


Probe 


TET-5 ' -caccttcattttggcaagaagactgt-3 ' -TAMRA 


26 | 


3281 


318 


jReverse 


5 ' -gttgcagtcagaaagtgtgtga-3 ' 




3322 


319 



15 Table ADB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3991,Run 
212349022 


Tissue Name 


Rel. Exp.(%) 
Ag3991, Run 
212349022 


AD 1 Hippo 


27.9 


Control (Path) 3 Temporal Ctx 


15.9 


AD 2 Hippo 


46.0 


Control (Path) 4 Temporal Ctx 


35.4 


AD 3 Hippo 


14.9 


AD 1 Occipital Ctx 


26.2 


AD 4 Hippo 


13.8 


AD 2 Occipital Ctx (Missing) 


0 0 


AD 5 Hippo 


61.6 


AD 3 Occipital Ctx 


1 1.9 


AD 6 Hippo 


97.9 


AD 4 Occipital Ctx 


31.2 


Control 2 Hippo . 


43.8 


AD 5 Occipital Ctx 


61.6 


Control 4 Hippo 


:<. l 


AD 6 Occipital Ctx 


26.4 


Control (Path) 3 Hippo 


15.9 


Control 1 Occipital Ctx 


'.'0 


AD 1 Temporal Ctx 


39.0 


Control 2 Occipital Ctx 


71.2 


AD 2 Temporal Ctx 


55.9 


Control 3 Occipital Ctx 


21.9 


AD 3 Temporal Ctx 


15.0 


Control 4 Occipital Ctx 


16.5 


AD 4 Temporal Ctx 


33.2 


Control (Path) 1 Occipital Ctx 


81.2 
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AD 5 Inf Temporal Ctx 


no i) c 


ontrol (Path) 2 Occipital Ctx 9 


3 


AD 5 Sup Temporal ( 
Ctx 


0.7 ( 


:ontrol (Path) 3 Occipital Ctx 7 


4 


AD 6 Inf Temporal Ctx 


74.7 ( 


:ontrol (Path) 4 Occipital Ctx 9 


.6 


AD 6 Sup Temporal 
Ctx 


J0.1 < 


:ontroI 1 Parietal Ctx |l 


5.0 


Control 1 Temporal Ctx 


5.1 jControl 2 Parietal Ctx 5 


6.3 


iControl 2 Temporal Ctx 


54.2 Control 3 Parietal Ctx 


3.2 


'Control 3 Temporal Ctx 


25.0 Control (F ttln 1 Parietal Ctx c 


9.3 


Control 3 Temporal Ctx 


15.5 < onl '>! Path : 1 metal Ctx - 


8.3 


Control (Path) 1 
Temporal Ctx 


65.1 


Control (Path) 3 Parietal Ctx 


1.6 


Control (Path) 2 
Temporal Ctx 


43.5 


Control (Path) 4 Parietal Ctx 1 


14.4 


Table ADC. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag3991. Run 
170739808 


Tissue Name 


Rel. 

Exp.(%) 
Ag3991, 
Run 

170739808 


Secondary Th 1 act 


0.0 HUVEC IL-lbeta 


0.0 


(Secondary Th2 act 


0.0 HUVEC I FN gamma 


0.0 


jSecondary Trl act 


1 .5 jHUVEC 1 NF alpha + 1FN gamma 


4.5 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC 1L- 11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium TNFalphc 
+ IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
(lymphocyte act 


0.0 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.0 


|CD8 lymphocyte act 


0.0 


Astrocytes rest 


37.4 
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Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha 4- IL- 1 beta 


15.8 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl_anti- 
CD95 CHI 1 


0.0 


CCD1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 

. ...i 


LAK cells IL-2 


0.0 


Liver cirrhosis 


2.8 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


1.8 


LAK cells IL-2+IFN 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells IL-2+ IL-18 


0.0 


KCI-H292 II. -9 




LAK cells 
PMA/ionomycin 


0.0 


NCI-H292IL-13 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 I FN °amma 


1 


Two Way MLR 3 day 


0.0 


HPAEC none 


-• - ! 


Two Way MLR 5 day 


0.0 


HPAFT TK ; F ilnln + II 1 l-iAtu 


— — j 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha -ML- 1 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


2.8 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast I FN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast CCD 1 070 TNF 
alpha 


0.0 


EOL-1 dbcAMP 

■ - ■■- 


0.0 


Dermal fibroblast CCD1070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


8.2 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


42.3 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


39.5 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


2.0 


Monocytes LPS 


0.0 


Colon 


17.7 
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Macrophages rest 


2.4 


Lung 


8.X 


Macrophages LPS 


0.0 


Thymus 


15.2 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 







Table ADD . General oncology screening panel v 2.4 



Rel. Exp.(%) 
Tissue Name |Ag399 1 , Run 
1268168218 


Tissue Name 


Rel. 

Exp.(%) 
Ag3991, 
Run 

268168218 


Colon cancer 1 


0.2 


Bladder cancer NAT 2 


0.1 


Colon cancer NAT 1 


1.1 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


0.3 


Bladder cancer NAT 4 


0.9 


Colon cancer NAT 2 


0.5 


Adenocarcinoma of the prostate 1 


7.0 


Colon cancer 3 0.6 


Adenocarcinoma of the prostate 2 


0.9 


Colon cancer NAT 3 ; 1 .4 


Adenocarcinoma of the prostate 3 


4.8 


Colon malignant cancer Q ^ 
4 


Adenocarcinoma of the prostate 4 


0.8 


Colon normal adjacent \ QQ iprostate cancer NAT 5 
tissue 4 • 


2.0 


Lung cancer 1 ,0.1 


Adenocarcinoma of the prostate 6 


1.0 


Lung NAT 1 0.1 


Adenocarcinoma of the prostate 7 


1.4 


Lung cancer 2 


0.2 


Adenocarcinoma of the prostate 8 


0.9 


Lung NAT 2 


0.1 


Adenocarcinoma of the prostate 9 


8.3 


Squamous cell 
carcinoma 3 


1.7 


Prostate cancer NAT 10 


0.5 


Lung NAT 3 


0.2 


Kidney cancer 1 


0.6 


metastatic melanoma 1 


3.8 


KidneyNAT 1 


0.4 


Melanoma 2 


2.3 


Kidney cancer 2 


100.0 


Melanoma 3 


2.0 


Kidney NAT 2 


0.0 


metastatic melanoma 4 


4.6 


Kidney cancer 3 


0.1 


metastatic melanoma 5 


1.6 


Kidney NAT 3 


II 1 


Bladder cancer 1 


0.3 


Kidney cancer 4 


0.6 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.4 


Bladder cancer 2 


1.2 







CNS_neurodegeneration_vl.O Summary: Ag3991 This panel confirms the 



expression of the CG95165-01 gene at low levels in the brains of an independent group of 
individuals, but no differential expression of this gene was detected between Alzheimer's 
5 diseased postmortem brains and those of non-demented controls in this experiment. 
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However, expression in the brain suggests that this gene may play a role in central 
nervous system disorders such as Parkinson's disease, epilepsy, multiple sclerosis, 
schizophrenia and depression. 

The CG95 165-01 gene codes for a homologue of rat adenylyl cyclase. In 
5 Drosophila, mutations in adenylyl cyclase and other members of cAMP pathway have 
been shown to reduce the ability of fruit flies to learn or to remember (Waddell S and 
Quinn WG, 2001, Trends Genet 1 7( 1 2):7 19-26, PMID: 11718926). In addition, 
derangement of second messenger, adenylyl cyclase, system closely parallels ischemic 
neuronal damage and persistent enhancement of this cAMP signaling pathway is important 

10 for neuronal survival in acute cerebral ischemia (Tanaka K, 2001, Prog Neurobiol 

65(2): 173-207, PMID: 1 1403878). Therefore, therapeutic modulation of adenylyl cyclase 
encoded by this gene may be useful in the treatment of learning disorders and acute 
cerebral ischemia. 

General_screening_panel_vl.4 Summary: Ag3991 Results from one 

15 experiment with the CG95 165-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4.1 D Summary: Ag3991 Highest expression of the CG95 165-01 gene is 
detected in kidney (CT=32). In addition, moderate to low expression of this gene is seen in 
thymus, colon, dermal fibroblast and astrocytes. Thus, expression of this gene can be used 

20 to distinguish these samples from other samples used in this panel. In addition, therapeutic 
modulation of this gene may be beneficial in the treatment of autoimmune and 
inflammatory diseases that affect brain, colon and kidney, including inflammatory bowel 
diseases, lupus and glomerulonephritis. 

General oncology screening panel_v_2.4 Summary: Ag3991 Highest 

25 expression of the CG95 1 65-01 gene is detected in kidney cancer (CT=27.8). Interestingly, 
expression of this gene is higher in cancer sample compared to the adjacent control sample 
(CT=40). In addition, moderate expression of this gene is seen in prostate adenocarcinoma 
and melanoma. This expression is low/undetectable in the normal samples used in this 
panel. Therefore, therapeutic modulation of this gene product through the use of small 

30 molecule drug or antibodies could be beneficial in the treatment of melanoma, prostate 
and kidney cancers. 
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AE. CG95175-01: Ephrin Type-A Receptor 7 Precursor 

Expression of gene CG95 175-01 was assessed using the primer-probe sets Ag3992 
and Ag612, described in Tables AEA and AEB. Results of the RTQ-PCR runs are shown 
in Table AEC. 

5 Tabic AEA . Probe Name Ag3992 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -accactatggtgaggctacaga-3 1 


22 


2427 


320 


Probe 


TET-5 ' -ctatgggccgctcccgagacact-3 ■ -TAMRA 




2466 


321 


Reverse 


5 ' -agagctgaagtggccaaact-3 • 


20 


2491 


322 



Table AEB . Probe Name Ag612 



Sequences 



Length 



5 ' -gccgctcccgagacactt-3 • 



Start 
Position 



SEQ ID No 

323 



Probe TET-5 ' -ccacttcagctctgccagtgacgtg-3 ' -TAMRA 25 2498 
Reverse 5 ' -cccacatgatgatgccgaa-3 ' ! |9 : 2529 



Table AEC . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag612, Run 
145645058 


Tissue Name 


Rel. Exp.(%) 
Ag612, Run 
145645058 


Secondary Thl act 


0.0 


HUVEC lL-lbeta 


0.0 


Secondary Th2 act 


12.2 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha 
+ ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


4.6 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 
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CD45RO CD4 
lymphocyte act 


4.6 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.0 


CDS lymphocyte act 


5.3 


Astrocytes rest 


().() 


Secondary CDS 
lymphocyte rest 


5.3 


Astrocytes TNFalpha + IL-1 beta 


0.0 


Secondary CDS 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl_anti- 
CD95CH11 


0.0 


CCD1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


9.7 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


1.0 


LAK cells 1L-2 


0.0 


Liver cirrhosis 


61.6 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


5.1 


NCI-H292 none 


32.5 


LAK cells IL-2+IL- 18 10.0 


XirM UOOO TI A 
INL.l-n.iyz IL-4 


■ -- 

46 ' 7 


LAK cells i 
PMA/ionomycin 


NCI-H292 1L-9 


58.6 


NK Cells IL-2 rest 10.0 


Kir~i uooo ii i "3 
lNd-ri-iyz i jl,- i _> 





Two Way MLR 3 day lo.O " " 


iNd-nzyz iriN gamma 


- 100 ' 0 - 


Two Way MLR 5 day jO.O 






Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF alpha + IL- 
1 beta 


0.0 


PBMC PHA-L 


8.9 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


9.0 


B lymphocytes PWM 


8.5 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD1070 
TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast CCD1070 IL-1 
beta 


0.0 


Dendritic cells none 


5.3 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells LPS 


4.5 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti-CD40 


0.0 


IBD Colitis 2 


10.7 
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Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


I : 


Colon 


51.8 


Macrophages rest 


0.0 


Lung 


24.7 


Macrophages LPS 


0.0 


Thymus 


4.1 


HUVEC none 


0.0 


Kidney 


46 


HUVEC starved 


0.0 







CNS_neurodcgeneration_vl.O Summary': Ag3992 Expression of the CG95175- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 



General_screening_panel_vl.4 Summary: Ag3992 Expression of the CG95175- 
5 01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1 D Summary': Ag3992 Expression of the CG95 175-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
Panel 4D Summary: Ag612 Highest expression of the CG95 175-01 gene is 
10 detected in I FN gama treated NCI-H292 cells (CT=33). Moderate to low expression of this 
gene is also seen in cytokine treated and untreated NCI-H292 cells, liver cirrhosis and 
colon tissue samples. Therefore, expression of this gene can be used to distinguish these 
samples from other samples used in this panel. In addition, therapeutic modulation of this 
gene can be used for the treatment of chronic obstructive pulmonary disease, asthma, 
15 allergy, and emphysema, liver cirrhosis, autoimmune and inflammatory disease affecting 
colon including Crohn's disease and ulcerative colitis. 

Panel CNS_1 Summary: Ag3992 Expression of the CG95175-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag3992 Expression of the 
20 CG95 1 75-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

AF. CG95693-01 : DPY-19 Protein I Like Protein 

Expression of gene CG95693-01 was assessed using the primer-probe set Ag4026, 
25 described in Table AFA. Results of the RTQ-PCR runs are shown in Tables AFB, AFC, 
AFD and AFE. 

Table AFA . Probe Name Ag4026 
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Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 • -tggcattctaacagtgatgtca-3 ' 


22 


54 


326 


h< be 


TET-5 * -tgcaaacctctgtaatcaatggagca-3 ' -TAMRA 


26 


87 


327 
328 


Reverse 


5 ' -aaagttcttcctgaggcaaatc-3 ' 


22 


130 



Table AFB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
212391720 


Tissue Na.ine 


Rel. Exp.(%) 
A«4026 Run 
2\ 'Ml 720 


AD 1 Hippo 


7.6 


Control (Path) 3 Temporal Ctx 


2.0 


AD 2 Hippo 


24.0 


Control (Path) 4 Temporal Ctx 


27.7 


AD 3 Hippo 


12.4 


AD 1 Occipital Ctx 


15.3 


AD 4 Hippo 


9.3 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


77.4 


AD 3 Occipital Ctx 


4.8 


AD 6 Hippo 


38.7 


AD 4 Occipital Ctx 


24.5 


Control 2 Hippo 


11.2 


AD 5 Occipital Ctx 


23.8 


Control 4 Hippo 


23.8 


AD 6 Occipital Ctx 


30.6 


Control (Path) 3 Hippo 


12.8 


Control 1 Occipital Ctx 


1.6 


AD 1 Temporal Ctx 


12.2 


Control 2 Occipital Ctx 


47.0 


AD 2 Temporal Ctx 


26.4 


Control 3 Occipital Ctx 


13.2 


AD 3 Temporal Ctx 


5.6 


Control 4 Occipital Ctx 


7.9 


AD A Tpmnnral Ptv 

r\Lt *t i empurai v^ix 




Control (Path) 1 Occipital Ctx 


52.1 


AD 5 Inf Temporal Ctx 


84.7 


Control (Path) 2 Occipital Ctx 


13.4 


AD 5 SupTemporal Ctx 


52.5 


Control (Path) 3 Occipital Ctx 


4.0 


AD 6 Inf Temporal Ctx 


55.5 


Control (Path) 4 Occipital Ctx 


14.7 


AD 6 Sup Temporal Ctx 


100.0 


Control 1 Parietal Ctx 


5.1 


Control 1 Temporal Ctx 


7.9 


Control 2 Parietal Ctx 


47.6 


Control 2 Temporal Ctx 


18.0 


Control 3 Parietal Ctx 


6.6 


Control 3 Temporal Ctx 


12.9 


Control (Path) 1 Parietal Ctx ! 


51.1 


Control 4 Temporal Ctx 


6.0 


Control (Path) 2 Parietal Ctx 


17.9 


Control (Path) 1 Temporal 
Ctx 


47.0 


Control (Path) 3 Parietal Ctx 


4.8 


Control (Path) 2 Temporal 
Ctx 


22.8 


Control (Path) 4 Parietal Ctx 


33.4 



Table AFC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
218425757 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
218425757 


Adipose 


0.9 


Renal ca. TK-10 


12.7 



577 



WO 03/010327 



PCT/US02/14199 



Melanoma* Hs688(A) T 




Bladder 


7.6 


melanoma risooo^JDj. 1 




Gastric ca. (liver met.) NCI-N87 


14.1 


Melanoma* M14 


2.5 


Gastric ca. KATO III 


31.0 


Melanoma* LOXIMVI 


5.4 


Colon ca. SW-948 


4.3 


Melanoma* SK-MEL-5 


16.8 


Colon ca. SW480 


49.0 


Squamous cell carcinoma 
SCC-4 


0.8 


Colon ca.* (SW480 met) SW620 [25.9 


Testis Pool 




Colon ca. HT29 


1-6 . . . 


Prostate ca.* (bone met) PC- 
3 




5.6 


Colon ca. HCT-1 16 


0.9 


Prostate Pool 


2.6 


Colon ca. CaCo-2 


6.3 


Placenta 


0.4 


Colon cancer tissue 


20.4 


Uterus Pool 


0.8 


Colon ca. SW1116 


0.9 


Ovarian ca. OVCAR-3 


3.3 


Colon ca. Colo-205 


6.2 


Ovarian ca. SK-OV-3 


31.2 


Colon ca. SW-48 


6.5 


Ovarian ca. OVCAR-4 


2.0 


Colon Pool 


5.5 


jOvarian ca. OVCAR-5 


10.2 


Small Intestine Pool 


7.2 


Ovarian ca IGROV-I 


11.6 


{Stomach Pool 


3.8 


Ovarian ca OVCAR-f 


3.8 




Bone Marrow Pool 


0.0 


[Ovary 




j Fetal Heart 




Breast ca. MCF-7 


12.4 




Heart Pool 


2.1 


Breast ca. MDA-MB-23 1 


10.9 


I 


Lymph Node Pool 


7.7 


Breast ca. BT 549 


23.7 


jFetal Skeletal Muscle 


1.9 


Bre ■ ' ca T47D 


10.2 




Skeletal Muscle Pool 


2.0 


Breast ca. MDA-N 


7.2 


jSpleen Pool 


6 2 


Breast Pool 


7.6 




Thymus Pool 


7.6 


Trachea 


8.4 


|CNS cancer (glio/astro) U87-MG 


7.0 


Lung 


5.2 


CNS cancer (glio/astro) U-l 1 8- 
jMG 


27.9 


Fetal Lung 


15.6 


jCNS cancer (neuro;met) SK-N- 

AS _ _ 


6.8 


Lungca. NCI-N417 


0.4 


JCNS cancer (astro) SF-539 


10.7 


Lung ca. LX-1 


29.9 


jCNS cancer (astro) SNB-75 


60.7 


Lung ca. NCI-H146 


1.8 


J CNS cancer (glio) SNB-19 


14.9 


Lung ca. SHP-77 


1.6 


jCNS cancer (glio) SF-295 


100.0 


Lung ca. A549 


0.9 


jBrain (Amygdala) Pool j4.8 


Lungca.NCI-H526 


2.1 


|Brain (cerebellum) jo.4 


Lungca. NCI-H23 


52.9 


"[Brain (fetal) ] 


3.8 


Lung ca. NCI-H460 


3.8 


jBrain (Hippocampus) Pool | 


6.2 
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Lung ca.HOP-62 


2.4 


Cerebral Cortex Pool 


6.0 


Lungca.NCI-H522 


4.2 


Brain (Substantia nigra) Pool 


6.6 


Liver 


0.0 


Brain (Thalamus) Pool 


8.8 


Fetal Liver 


14.8 


Brain (whole) 


2.9 


Liver ca. HepG2 


1.8 


Spinal Cord Pool 


9.3 


Kidney Pool 


7.9 


Adrenal Gland 


1.6 


ll-etal Kidney 


10.6 


Pituitary gland Pool 


3.4 


Renal ca. 786-0 


14.4 


Salivary Gland 


0.7 


Renal ca. A498 


3.3 


Thyroid (female) 


2.5 


Renal ca. ACHN 


3.1 


Pancreatic ca. CAPAN2 


9.0 


Renal ca. UO-3 1 


2.9 


Pancreas Pool 


5.1 


Table AFD. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
171613290 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
171613290 


Secondary Th 1 act 


1.3 


HUVECIL-lbeta 




0.7 


Secondary Th2 act 


4.2 jHUVEC I FN gamma 


0.9 


Secondary Trl act !2.0 
Secondary Th 1 rest 2.9 


HUVEC TNF alpha + I FN 
gamma 


1.2 


HUVEC TNF alpha + IL4 


o.o~~ 


Secondary Th2 rest \3.3 


HUVEC IL-11 


0.0 


Secondary Trl rest 2.6 


Lung Microvascular EC none 


1.0 


Primary Th 1 act 


3.4 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.0 


Primary Th2 act 


9.2 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


3.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


1.1 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.5 


Primary Th2 rest 


0 5 


Small airway epithelium none 


0.0 


Primary Trl rest 


2.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


1.5 


CD45RA CD4 lymphocyte 
act 


7.3 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte 
act 


7.7 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.4 


CD8 lymphocyte act 


8.1 


Astrocytes rest j2.0 


Secondary CD8 lymphocyte 
rest 


2.9 


Astrocytes TNFalpha + IL- 1 beta j 1 .4 


Secondary CD8 lymphocyte 
act 


2.7 


KU-812 (Basophil) rest j74.2 
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CD4 lymphocyte none 


1.3 


KU-8 12 (Basophil) 
PMA/ionomycin 


100.0 


2ry Thl/Th2/Trl_anti-CD95 
CH11 


7.6 


CCD1 106 (Keratinocytes) none 


3.7 


LAK cells rest 


3.4 


CCD1 106 (Keratinocytes) 
TNFalpha+ IL-lbeta 


2.8 


LAK cells IL-2 


10. 0 


Liver cirrhosis 


0.0 


LAK cells IL 21 1L 12 


2.2 


NCI-H292none 


4.0 


LAK cells IL-2+IFN gamma 


5.8 


NCI-H292 IL-4 


LAK cells IL-2+IL- 18 


5.7 


NC1-H292 IL-9 


3.0 


LAK cells PMA/ionomycin 


1.4 


NCI-H292 IL-13 


3.0 


NK Cells IL-2 rest 


12.9 


NCI-H292 IFN gamma 


2.5 


Two Way MLR 3 day 


4.8 


HPAEC none 


0.5 


Two Way MLR 5 day 


2.1 


HPAEC TNF alpha + IL-1 beta 


1.8 


Two Way MLR 7 day 


3.1 


Lung fibroblast none 


0.0 


PBMC rest 


0.5 
5.6 


Lung fibroblast TNF alpha + IL-1 
beta 

Lung fibroblast IL-4 


1.4 


PbMC PWM 


0.4 


PBMC PHA-L 


5.3 


Lung fibroblast IL-9 


1.0 


Ramos (B cell) none 


23.0 


Lung fibroblast IL-13 


2.2 


Ramos (B cell) ionomycin 


24.0 


Lung fibroblast IFN gamma 


2.3 


B lymphocytes PWM 


3.3 


Dermal fibroblast CCD 1070 rest 


3.3 


B lymphocytes CD40L and 
IL-4 


12.2 


Dermal fibroblast CCD 1070 TNF 
alpha 


12.9 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


1.4 


Dendritic cells none 


1.0 


Dermal fibroblast IL-4 


1.2 


Dendritic cells LPS 


0.8 


Dermal Fibroblasts rest 


3.7 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.6 


Neutrophils rest 


0.0 


Monocytes LPS 


0.0 


Colon 


11.7 


Macrophages rest 


4.0 


Lung 


6 6 


Macrophages LPS 


0.0 


Thymus 


22.4 


HUVECnonc 


0.9 


Kidney 


49.3 


HUVEC starved 


1.7 






Table AFE. General oncology screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 


Tissue Name 


Rel. Exp.(%) 
Ag4026, Run 
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§68362901 


[26836290 1 


Colon cancer 1 


I 23 - 8 


Bladder cancer NAT 2 jo.6 


Colon NAT 1 


J0.7 


Bladder cancer NAT 3 jo.O 


Colon cancer 2 


[3.9 


Bladder cancer NAT 4 2.2 


Colon cancer NAT 2 


2.0 


Adenocarcinoma of the prostate I 




Colon cancer 3 


163.3 


Adenocarcinoma of the prostate 2 


1.3 


Colon cancer NAT 3 


4.9 






Colon malignant cancer 4 


16.8 


Adenocarcinoma of the prostate 4 


38.7 


Colon normal adjacent tissue 

4 


1.2 


Prostate cancer NAT 5 


0.4 


Lung cancer 1 


13.9 


Adenocarcinoma of the prostate 6 


1.6 


Lung NAT 1 


0.9 


Adenocarcinoma of the prostate 7 


0.4 


Lung cancer 2 


16.3 


Adenocarcinoma of the prostate 8 


0.0 


i hie VAT 2 


0.3 


Adenocarcinoma of the prostate 9 


6.1 


Squamous cell carcinoma 3 


4.0 


Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 


0.5 


Kidney cancer 1 


12.8 


metastatic melanoma 1 


30.1 


KidneyNAT 1 


5.0 


Melanoma 2 


0.0 
1.5 


Kidney cancer 2 


100.0 


Melanoma 3 


Kidney NAT 2 


6.3 


metastatic melanoma 4 


12.4 


Kidney cancer 3 


52.9 


metastatic melanoma 5 


26.8 


Kidney NAT 3 


4.2 


Bladder cancer 1 


0.9 


Kidney cancer 4 


12.1 


Bladder cancer NAT 1 


9.0 


Kidney NAT 4 


2.6 


Bladder cancer 2 


1.0 







expression of the CG95693-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag4026 Highest expression of the 
CG95693-01 gene is detected in CNS cancer SF-295 cell line (CT=28.3). Significant 
expression of this gene is associated with cluster of cancer cell lines including CNS, colon, 
renal, lung, breast, ovarian, pancreatic, melanoma and prostate cancer cell lines. 
Therefore, therapeutic modulation of this gene could be beneficial in the treatement of 
these cancers. 
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Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adrenal gland, thyroid, pituitary gland, skeletal muscle, 
heart, fetal liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=3 1 ) when 
compared to adult liver (CT=40). This observation suggests that expression of this gene 
can be used to distinguish fetal from adult liver. In addition, the relative overexpression of 
this gene in fetal liver suggests that the protein product may enhance liver growth or 
development in the fetus and thus may also act in a regenerative capacity in the adult. 
Therefore, therapeutic modulation of the protein encoded by this gene could be useful in 
treatment of liver related diseases. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease. Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4026 Highest expression of the CG95693-01 gene is 
detected in PMA/ionomycin treated basophils (CT=31). In addition, this gene is expressed 
to a lesser extent in untreated KU-812 cells. Therefore, antibody or small molecule 
therapies designed with the protein encoded for by this gene could block or inhibit 
inflammation or tissue damage due to basophil activation in response to asthma, allergies, 
hypersensitivity reactions, psoriasis, and viral infections. 

Moderate to low levels of expression of this gene is also detected in Ramos B cells, 
activated lymphocytes, IL-2 treated LAK cells, TNF alpha treated dermal fibroblasts, and 
normal tissues respresented by colon, lung, thymus and kidney. Therefore, modulation of 
the gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 
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General oncology screening panel_v_2.4 Summary: Ag4026 In this panel the 
highest expression of the CG95693-01 gene is detected in a kidney cancer sample 
(CT=31.4). Interestingly, expression of this gene is higher in kidney, prostate, and lung 
cancer samples as compared to adjacent control samples as well as the melanoma samples 
in this panel. Therapeutic modulation of this gene, through the use of small molecule 
drugs, protein therapeutics or antibodies could be of benefit in the treatment of melanomas 
as well as kidney, prostate and lung cancers. 



AG. CG95814-01: Rho-GTPase-Activating Protein 4 

Expression of gene CG958 14-01 was assessed using the primer-probe set Ag4031, 
described in Table AGA. Results of the RTQ-PCR runs are shown in Tables AGB, AGC, 
AGD, AGE and AGF. 

Table AGA . Probe Name Ag4031 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID I 

No I 


Forward 


5 ' -aaccaatgcatctgtcttcaag-3 ' 


22 


1053 


329 I 


Probe 


TET-5 ' -tccatgacctatctgaccttattgatca-3 ' -TAMRA 


28 


1082 


330 ! 


Reverse 


5 1 -tgcatggtagcctaagtcaca-3 ' 


21 


1114 


331 ! 



Table AGB . CNSneurodegenerationvF.O 



Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
212396115 


Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
212396115 


AD 1 Hippo 


18.2 


Control (Path) 3 Temporal Ctx 


10.4 


AD 2 Hippo 


42.0 


Control (Path) 4 Temporal Ctx 


29.3 


AD 3 Hippo 


13.2 


AD 1 Occipital Ctx 


25.9 


AD 4 Hippo 


11.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


96.6 


AD 3 Occipital Ctx 


7.7 


AD 6 Hippo 


83.f 


AD 4 Occipital Ctx 


19.9 


Control 2 Hippo 


42.9 


AD 5 Occipital Ctx 


21.8 


Control 4 Hippo 


20.4 


AD 6 Occipital Ctx 


55.9 


Control (Path) 3 Hippo 


5.0 


Control 1 Occipital Ctx 


6.5 


AD 1 Temporal Ctx 


26.2 


Control 2 Occipital Ctx 


68.3 


AD 2 Temporal Ctx 


29.1 


Control 3 Occipital Ctx 


15.1 


AD 3 Temporal Ctx 


8.9 


Control 4 Occipital Ctx 


10.9 


AD 4 Temporal Ctx 


0.0 


Control (Path) 1 Occipital Ctx 


88.3 


AD 5 Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


6.5 
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AD 5 SupTemporal Ctx 


61.1 


Com UPdlh)"o. cipitalCl 


5.6 


AD 6 Inf Temporal Ctx 


71.2 


Control (Path) 4 Occipital Ctx 


.0 ) 


AD 6 Sup Temporal Ctx 


66.9 


Control 1 Parietal Ctx 




Control 1 Temporal Ctx 


5.8 


Control 2 Parietal Ctx 


UL6 


Control 2 Temporal Ctx 


47.6 


Control 3 Parietal Ctx 


3.4 


Control 3 Temporal Ctx 


12.9 


Control (Path) 1 Parietal Ctx 


58.3 


Control 4 Temporal Ctx 


9.9 


Control (Path) 2 Parietal Ctx 


5.4 


Control (Path) 1 Temporal \^ ^ 
Ctx 


Control (Path) 3 Parietal Ctx 


5.7 


Control (Path) 2 Temporal 
Ctx 


26.2 


Control (Path) 4 Parietal Ctx 


45.7 


Table AGC. General screening panel vl.4 


Tissue Name 


Kel. bxp.(yo) 
Ag4031, Run 
21 8525 1 2 1 


Tissue Name 


Rel. Exp.(%) 
Ag4031, Run 
218525121 


Adipose 


10.7 


Renal ca. TK-10 


34.9 


Melanoma* Hs688(A).T 


28.1 


Bladder 


13.4 


Melanoma* Hs688(B).T 


24.8 


Gastric ca. (liver met.) NCI- L, fi j 
N87 1 


Melanoma* MI4 


50.3 


Gastric ca. KATO 111 


27.9 


Melanoma* LOX1MVI 




Colon ca. SW-948 


1.9 


Melanoma* SK-MEL-5 


66.0 


Colon ca. SW480 


66.0 


Squamous cell carcinoma 
SCC-4 


14.3 


Colon ca.* (SW480 met) 
SW620 


14.1 


Testis Pool 


4.5 


Colon ca. HT29 


6.1 


Prostate ca.* (bone met) 
PC-3 


34.4 


Colon ca. HCT-116 


26.8 


Prostate Pool 


3.1 


Colon ca. CaCo-2 


6.8 


Placenta 


15.6 


Colon cancer tissue 


8.0 


Uterus Pool 


4.5 


Colon ca. SW1116 


1. 


Ovarian ca. OVCAR-3 


15.8 


Colon ca. Colo-205 


3.5 


Ovarian ca. SK-OV-3 


34.6 


Colon ca. SW-48 


3.4 


Ovarian ca. OVCAR-4 


15.7 


Colon Pool 


12.9 . 


Ovarian ca. OVCAR-5 


25.3 


Small Intestine Pool 


9.6 


Ovarian ca. IGROV-1 


28.9 


Stomach Pool 


5.8 


Ovarian ca. OVCAR-8 


14.7 


Bone Marrow Pool 


3.2 


Ovary 


9.8 


Fetal Heart 


H2 


Breast ca. MCF-7 


17.1 


Heart Pool 


3.9 


Breast ca. MDA-MB-231 


55.1 


Lymph Node Pool 


8.4 



584 



WO 03/010327 



PCT/US02/14199 



Breast ca. BT 549 




Fetal Skeletal Muscle 


12.3 


Breast ca. T47D 


43.8 


Skeletal Muscle Pool 


3.7 


Breast ca. Ml J \-N 


26.8 


Spleen Pool 


16.8 


Breast Pool 


10.2 


Thymus Pool 


8.8 


Trachea 


6.7 


CNS cancer (glio/astro) U87- 
MG 


37.4 


Lung 


3.1 


CNS cancer (glio/astro) U- 1 18- 
MG 


48.0 


Fetal Lung 


37.4 


AS 


30.1 


Lungca. NCI-N417 


4.6 


CNS cancer (astro) SF-539 


45.1 


Lung ca. LX-1 


15.5 


CNS cancer (astro) SNB-75 


83.5 


Lung ca.NCI-H 146 


7.0 


CNS cancer (glio) SNB-19 


32.5 


Lung ca. SHP-77 


73.2 


CNS cancer (glio) SF-295 


46.7 


Lung ca. A549 


20.7 


Brain (Amygdala) Pool 


14.5 


Lung ca. NCI-H526 


4.2 


Brain (cerebellum) 


100.0 


Lung ca. NCI-H23 


37.4 


Brain (fetal) 


55.9 


Lungca.NCI-H-160 


20.0 


Brain (Hippocampus) Pool 


16.3 


Lung ca. HOP-62 


24.1 


Cerebral Cortex Pool 


15.5 


Lung ca. NCI-H522 


31.0 


Brain (Substantia nigra) Pool 


18.0 


Liver 


0.9 


Brain (Thalamus) Pool 


23.7 


Fetal Liver 


12.6 


Brain (whole) 


32.1 


Liver ca. HepG2 


15.4 


Spinal Cord Pool 


18.9 


Kidney Pool 


18.0 


Adrenal Gland 


9.3 


Fetal Kidney 


13.7 


Pituitary gland Pool 


2.2 


Renal ca. 786-0 


54.0 


Salivary Gland 


1.1 


Renal ca. A498 


8.0 


Thyroid (female) 


2.1 


Renal ca. ACHN 


19 1 


Pancreatic ca. CAPAN2 


31.9 


Renal ca. UO-3 1 


24.5 


Pancreas Pool 





Table AGP . Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4031, Run 
171616912 


Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
171616912 


Secondary Th 1 act 


49.3 


HUVEClL-lbeta 


49.7 


Secondary Th2 act 


100.O 


HUVEC I FN gamma 


43.5 


Secondary Trl act 


62.0 


HUVECTNF alpha + IFN 
gamma 


32.5 


Secondary Th 1 rest 


13.2 


HUVEC TNF alpha + IL4 


28.3 


Secondary Th2 rest 


13.9 


HUVEC IL- 11 


25.2 
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Secondary Trl rest 


11.9 


Lung Microvascular EC none 


55.9 


PnmaryThl act 




Lung Microvascular EC 
TNFalpha + IL-lbeta 


31.9 


Primary Th2 act 


23.0 


Microvascular Dermal EC 
none 


42.6 


Primary Trl act 


22.5 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


27.5 


Primary Thl rest 


13.6 


Bronchial epithelium 
TNFalpha + ILlbeta 


19.8 


Primary Th2 rest 


9.2 


Small airway epithelium none 


_ _ 

6.9 


Primary Trl rest 


16.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


13.3 


CD45RA CD4 lymphocyte 
act 


26.8 


Coronery artery SMC rest 


22, 


CD45RO CD4 lymphocyte 
act 


22.7 


Coronery artery SMC 
TNFalpha + IL-lbeta 


24.7 


CD8 lymphocyte act 


19.3 


Astrocytes rest 


38.4 


Secondary CD8 
lymphocyte rest 


14.4 


Astrocytes TNFalpha + IL- 
Ibeta 


26.8 


Secondary CD8 
lymphocyte act 


21.5 


KU-8 12 (Basophil) rest 


9.3 


CD4 lymphocyte none 


4.9 


(KU-8 12 (Basophil) 
PMA/ionomycin 


49.7 


2ry Thl/Th2/Trl_anti- 
cuyo LH I 1 


21.9 


CCD 11 06 (Keratinocytes) 
jnone 


38.2 


LAK cells rest 


24.7 


ICCDI 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


33.2 


LAK cells IL-2 


19.3 


1 


Liver cirrhosis 


5.0 


LAK cells IL-2+IL- 12 


12.4 


|NC1-H292 none 


20.6 


LAK cells IL-2+IFN 
gamma 


17.0 


|nCI-H292 1L-4 


24.5 


LAK cells IL-2+ IL-18 


22.8 


|NCI-H292 IL-9 


37.1 


LAK cells PMA/ionomycin 


8.4 


NCI-H292 IL-13 


25.2 


NK Cells IL-2 rest 


30.1 


1 


NCI-H292 I FN gamma 


28.9 


Two Way MLR 3 day 


22.2 


jriPAEC none 


34.9 


Two Way MLR 5 day 


19.5 


jrlPAEC TNF alpha + IL-1 beta 


43.8 


Two Way MLR 7 day 


22.5 


jLung fibroblast none 


26.6 


PBMC rest 


6.6 


Lung fibroblast TNF alpha + 
jIL-1 beta 


24.7 


PBMC PWM 


13.7 


JLung fibroblast IL-4 


13.6 


PBMC PHA-L 


19.6 


|Lung fibroblast IL-9 


23.0 


Ramos (B cell) none 


20.2 


jLung fibroblast IL-13 


14.9 
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Ramos (B cell) ionomycin 


23.7 


Lung fibroblast IFN gamma 


_ S 


B lymphocytes PWM 


22.7 


Dermal fibroblast CCD 1070 
rest 


37.9 


B lymphocytes CD40L and 
IL-4 


22.7 


Dermal fibroblast CCD 1070 
TNF alpha 


54.7 


EOL-1 dbcAMP 


25.9 


Dermal fibroblast CCD 1070 
1L-1 beta 


21.5 


EOL-1 dbcAMP 
PMA/ionomycin 


27.2 


Dermal fibroblast IFN gamma 


29.7 


Dendritic cells none 


36.1 


Dermal fibroblast IL-4 


46.3 


Dendritic cells LPS 


76.8 


Dermal Fibroblasts rest 


26.4 


Dendritic cells anti -CD40 


32.5 


Neutrophils TNFa+LPS 


2.6 


Monocytes rest 


33.4 


Neutrophils rest 


9.7 


Monocytes LPS 


42.0 


Colon 


4.7 


Macrophages rest 


18.6 


Lung 


27.0 


Macrophages LPS 


26.8 


Thymus 


20.6 


HUVEC none 


37.1 


Kidney 


16.6 


HUVEC starved 


45.1 







Table AGE . Panel CNS1 



Tissue Name 


Rel. Exp.(%) 
Ag403l, Run 
180912028 


Tissue Name 


Rel. Exp.(%) 
Ag4031, Run 
180912028 


BA4 Control 


24.1 


BA17 PSP 


30.1 


BA4 Control 


50.7 


BA17 PSP2 


7.1 


BA4 Alzheimer's2 


3.5 


Sub Nigra Control 


37.9 


BA4 Parkinson's 


68.3 


Sub Nigra Control2 


34.4 


BA4 Parkinson's2 


75.8 


Sub Nigra Alzheimer's2 


25.5 


BA4 Huntington's 


42.3 


Sub Nigra Parkinson's2 


77.9 


BA4 Huntington's2 


6.5 


Sub Nigra Huntington's 


100.0 


BA4 PSP 


8.2 


Sub Nigra Huntington's2 


69.7 


BA4 PSP2 


30.4 


Sub Nigra PSP2 


12.8 


BA4 Depression 


49.7 


Sub Nigra Depression 


20.7 


BA4Depression2 


15 ■ 


Sub Nigra Depression2 


14.9 


BA7 Control 


31.9 


Glob Palladus Control 


20.2 


BA7 Control 


31.6 


Glob Palladus Control 


15.1 


BA7 Alzheimer's2 


7.4 


Glob Palladus Alzheimer's 


18.6 


BA7 Parkinson's 


27.5 


Glob Palladus Alzheimer's2 


8.3 


BA7 Parkinson's2 


47.6 


Glob Palladus Parkinson's 


81.8 


BA7 Huntington's 


75.3 


Glob Palladus Parkinson's2 


19.9 
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BA7 Huntington's2 


48.6 


GlobPalladi PSP 




BA7 PSP 


38.7 


Glob Palladus PSP2 


8 0 


BA7 PSP2 


23.3 


Glob Palladus Depression 


10.2 


BA7 Depression 


13.6 


Temp Pole Control 


12.4 


BA9 Control 


39.0 


Temp Pole Control2 


40.1 


BA9 Control2 


83.5 


Temp Pole Alzheimer's 


5.5 


BA9 Alzheimer's 


5.0 


Temp Pole Alzheimer's2 


6.1 


BA9 Alzheimer's2 


19.8 


Temp Pole Parkinson's 


35.1 


BA9 Parkinson's 


42.6 


Temp Pole Parkinson's2 


38.4 


BA9 Parkinson's2 


50.3 


Temp Pole Huntington's 


49.0 


BA9 Huntington's 


59.9 


Temp Pole PSP 


3.3 


BA9 Huntington's2 


11.8 


Temp Pole PSP2 


5.3 


BA9 PSP 


15.4 


Temp Pole Depression2 


11.2 


BA9 PSP2 


4.4 


Cing Gyr Control 


57.8 


BA9 Depression 


10.7 


Cing Gyr Control2 


32.1 


BA9 Depression2 


8.2 


Cing Gyr Alzheimer's 


20.9 


BA 17 Control 


44.8 


Cing Gyr Alzheimer's2 


10.6 


BA1 7 Control 


46.0 

___ 


Cing Gyr Parkinson's 


38.7 


BA17 Alzheimer's2 


Cing Gyr Parkinson's2 


46.0 


BA17 Parkinson's 


40.0 


Cing Gyr Huntington's 


76.] 


BA17 Parkinson's2 


57.8 


Cing Gyr Huntington's2 


38.2 


BA17 Huntington's 


44.4 


Cing Gyr PSP 


2o 2 


BA17 Huntington's2 


24.7 


Cing GyrPSl'2 




BA17 Depression 


12.7 


Cing Gyr Depression 


10.2 


BA17 Depression2 


32.5 


Cing Gyr Depression2 


19.6 


Table AGF. General oncoloev screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag403 1 , Run 
268362913 


Tissue Name 


Rel. Exp.(%) 
Ag4031,Run 
268362913 


Colon cancer 1 


13.4 


Bladder cancer NAT 2 


1.0 


Colon NAT 1 


2.2 


Bladder cancer NAT 3 


1.3 


Colon cancer 2 


27.4 


Bladder cancer NAT 4 


5 8 


Colon cancer NAT 2 


5.6 


Adenocarcinoma of the 
prostate 1 


9.3 


Colon cancer 3 


15.2 


Adenocarcinoma of the 
prostate 2 


2.1 


Colon cancer NAT 3 


15.8 


Adenocarcinoma of the 
prostate 3 


9.0 


Colon malignant cancer 4 


35.4 


Adenocarcinoma of the 


8.8 
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prostate 4 




Colon normal adjacent 
tissue 4 


2.7 


Prostate cancer NAT 5 


6.3 


Lung cancer 1 


13.9 


Adenocarcinoma of the 
prostate 6 


3.1 


Lung NAT 1 


4.3 


Adenocarcinoma of the 
prostate 7 


6.1 


Lung cancer 2 

..... . 


100.0 


Adenocarcinoma of the 
prostate 8 


1.2 


Lung NAT 2 


9.3 


Adenocarcinoma of the 
prostate 9 


25.2 


Squamous cell carcinoma 3 


29.5 


Prostate cancer NAT 1 0 


1.7 


Lung NAT 3 


1.6 


Kidney cancer 1 


34.2 


metastatic melanoma 1 


20.2 jKidneyNAT 1 


19.1 


Melanoma 2 


8.5 


Kidney cancer 2 


42.0 


Melanoma 3 


8.0 


Kidney NAT 2 


38.4 


metastatic melanoma 4 


93.3 


Kidney cancer 3 


33.4 


metastatic melanoma 5 


65.1 


Kidney NAT 3 19.1 


Bladder cancer 1 


1.6 


Kidney cancer 4 |l5.5 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 |8.4 


Bladder cancer 2 


6.0 





CNS_neurodegeneration_vl.O Summary: Ag4031 This panel confirms the 
expression of the CG95814-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
5 Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screeningjpanel_vl.4 Summary: Ag403 1 Highest expression of the 
CG95814-01 gene is detected in brain (cerebellum) sample (CT=25.6). High expression of 
10 this gene is seen in all the region of the central nervous system examined, including 
amygdala, hippocampus, substantia nigra, thalamus, cerebellum, cerebral cortex, and 
spinal cord. Therefore, this gene may play a role in central nervous system disorders such 
as Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

1 5 The CG95 814-01 gene codes for a homolog of RHO-GTPase activating protein 

(RHOGAP). RHOGAP stimulats the GTPase activity of Rho protein. Since Rho proteins 
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have been implicated in ischemia (Trapp et aL, 2001, Mol Cell Neurosci 17(5):883-94, 
PMID: 1 1358485), inhibitors for the RHOGAP encoded by this gene may have a 
therapeutic utility in the treatment of pathological aspects following brain damage 
including neuronal death. 
5 High expression of this gene is also observed in cluster of cancer cell lines 

including pancreatic, CNS, colon, renal, lung, breast, ovarian, prostate, squamous cell 
carcinoma, and melanoma cancer cell lines. Therefore, therapeutic modulation of this gene 
can be useful in treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 

10 to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=27-28) 

15 when compared to adult lung and liver (CTs=30-32). This observation suggests that 
expression of this gene can be used to distinguish fetal from adult lung and liver. In 
addition, the relative overexpression of this gene in fetal tissue suggests that the protein 
product may enhance growth or development of lung and liver in the fetus and thus may 
also act in a regenerative capacity in the adult. Therefore, therapeutic modulation of the 

20 protein encoded by this gene could be useful in treatment of liver and lung related 
diseases. 

Panel 4.1D Summary: Ag403 1 Highest expression of the CG95814-01 gene is 
detected in activated secondary Th2 cells (CT=27). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 

25 health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 

30 tissues. This pattern is in agreement with the expression profile in 

General_screening_panel_vl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
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therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 
5 Panel CNS_1 Summary: Ag403 1 This panel confirms the expression of the 

CG95814-01 gene at low levels in the brains of an independent group of individuals. 
Please see Panel 1.4 for a discussion of the potential utility of this gene in treatment of 
central nervous system disorders. 

General oncology screening panel_v_2.4 Summary: Ag403 1 Highest 
10 expression of the CG95814-01 gene is detected in lung cancer sample (CT=27). Higher 
expression of this gene is also seen in melanoma, colon, kidney, prostate cancers and this 
expression is lower in corresponding control samples. Thus, expression of this gene can be 
used as diagnostic marker and therapeutic modulation of this gene product may be useful 
in the treatment of these cancers. 

15 

AH. CG95824-01: Rho GAP 

Expression of gene CG95824-01 was assessed using the primer-probe set Ag4032. 
described in Table AHA. Results of the RTQ-PCR runs are shown in Tables AHB, AHC, 
AHD and AHE. 

20 Table AHA . Probe Name Ag4032 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 ' -ctcctacctggaaggctacatc-3 ' 


22 


2475 


332 


Probe 


TET-5 1 -atggaccccgtcaatgacactaaagg-3 • -TAMRA 


26 


2506 


333 


Reverse 


5 ' -cagtacttggcatacgtgctta-3 ' 


22 


2540 


334 



Table AHB . CNSncurodegenerationvl.O 





Rel. Exp.(%) 




Rel. Exp.(%) 


Tissue Name 


Ag4032, Run 


Tissue Name 


Ag4032, Run 




212396260 




212396260 


AD 1 Hippo 


9.9 


Control (Path) 3 Temporal Ctx 


5.9 


AD 2 Hippo 


14.8 


Control (Path) 4 Temporal Ctx 


27.0 


AD 3 Hippo 


5.2 


AD 1 Occipit.il Ctx 


14.9 


AD 4 Hippo 


3.5 


AD 2 Occipital Ctx (Missing) 


0.0 
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AD 5 Hippo 


100.0 


\D 3 Occipital Ctx 


5.2 


AD 6 Hippo 


29.3 


\D 4 Occipital Ctx 


8.2 


Control 2 Hippo 


22.8 


A.D 5 Occipital Ctx 


45.4 


Control 4 Hippo 


2.5 


AD 6 Occipital Ctx 


16.0 


Control (Path) 3 Hippo 


3.5 


Control 1 Occipital Ctx 


1.4 


AD 1 Temporal Ctx 


1 0.5 [Control 2 Occipital Ctx 


84.7 


AD 2 Temporal Ctx 


19.2 


Control 3 Occipital Ctx 


21.8 


AD 3 Temporal Ctx 


4.7 {Control 4 Occipital Ctx 


1.6 


AD 4 Temporal Ctx 


1 1.7 (Control (Path) 1 Occipital Ctx 


53.2 


AD 5 lnf Temporal Ctx 


73.2 Control (Path) 2 Occipital Ctx 


14.2 


AD 5 Sup Temporal Ctx 


27.0 {Control (Path) 3 Occipital Ctx 


3.3 


AD 6 I nf Temporal Ctx 




Control (Path) 4 Occipital Ctx 


17.2 


AD 6 Sup Temporal Ctx 


39.2 


Control 1 Parietal Ctx [5.6 


Control 1 Temporal Ctx 


4.2 jControl 2 Parietal Ctx 


32.8 


Control 2 Temporal Ctx 


35.6 


Control 3 Parietal Ctx 


23.0 


Control 3 Temporal Ctx 


15.5 


Control (Path) 1 Parietal Ctx 


62.0 


'Control 3 Temporal Ctx 


6.7 


Control (Path) 2 Parietal Ctx 


18.8 


Control (Path) 1 Temporal 
Ctx 


41.8 


Control (Path) 3 Parietal Ctx 


3.0 


Control (Path) 2 Temporal 
Ctx 


32.8 


Control (Path) 4 Parietal Ctx 


37.4 


Table AHC. General screening panel_vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
218525131 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
218525131 


Adipose 


0.5 


Renal ca. TK-10 


22.7 


Melanoma* Hs688(A).T 


2.7 


Bladder 


4.2 


Melanoma* Hs688(B).T 


2.4 


Gastric ca. (liver met.)NCl-N87 


31.4 


Melanoma* M14 


4.4 


Gastric ca. KATO III 


41.8 


Melanoma* LOX1MVI 


12.4 


Colon ca. SW-948 


8.6 


Melanoma* SK-MEL-5 


25.3 


Colon ca. SW480 


17.2 


Squamous cell carcinoma 
SCC-4 


5.0 


Colon ca * (SW480 met) SW620 


11.3 


Testis Pool 


19.6 


Colon ca. HT29 


b 8 


Prostate ca.* (bone met) 
PC-3 


30.8 


Colon ca. HCT-116 


40.9 


Prostate Pool 


2.0 


Colon ca. CaCo-2 


9.7 


Placenta 


2.5 


Colon cancer tissue 


8.8 


Uterus Pool 


0.0 


Colon ca. SW1116 


2.5 
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< l mm i.OV( V ] 


■ 


Colon ca. Colo-205 


2.3 


Ovarian ca. SK-OV-3 


16.5 


Colon ca. SW-48 


5.2 


Ovarian ca. OVCAR-4 


10.1 


Colon Pool 


2.1 


Ovarian ca. OVCAR-5 


35.1 


Small Intestine Pool 


1.1 


Ovarian ca. IGROV-1 


4.2 


Stomach Pool 


1.6 


Ovarian ca. OVCAR-8 


15.8 


Bone Marrow Pool 


0.5 


Ovary 


2.6 


Fetal Heart 


7.3 


Breast ca. MCF-7 


37.9 iHeartPool 


1.5 


Breast i 1 IDA v[E 231 


18.6 


Lymph Node Pool 


2.3 


Breast ca. BT 549 


84.7 


Fetal Skeletal Muscle 


5.9 


Breast ca. T47D 


58.2 


Skeletal Muscle Pool 


0.3 


Breast ca. MDA-N 


3.0 


Spleen Pool 


1.7 


Breast Pool 


3.0 


Thymus Pool 


3.9 


Trachea 


10.2 


CNS cancer (glio/astro) U87-MG 


9.9 


Lung 


0.1 


CNS cancer (glio/astro) U-l 1 8-MG : 10/t 


Fetal Lung 


17.9 


CNS cancer (neuro;met) SK-N-AS 20.9 


Lung ca.NCI-N417 


3.2 


CNS cancer (astro) SF-539 :5.() 


Lungca. LX-1 


9.9 


CNS cancer (astro) SNB-75 ! 1 2.6 


Lung ca.NCI-H146 


10.4 


( nee ho) sNE I' 120 


Lungca. SHP-77 


50.0 


CNS cancer (glio) SF-295 nT.2 


Lung ca. A549 


16.8 


Brain (Amygdala) Pool 


33.0 


Lung ca. NCI-H526 


18.4 


Brain (cerebellum) 


87.7 


Lung ca. NCI-H23 




Brain (fetal) 


100.0 


Lung ca. NCI-H460 


5.5 


Brain (Hippocampus) Pool 


33.4 


Lung ca. HOP-62 


5.1 


Cerebral Cortex Pool 


69.3 


Lung ca. NCI-H522 


33.7 


Brain (Substantia nigra) Pool 155.5 


Liver 


0.2 


Brain (Thalamus) Pool 15 1 . 1 


Fetal Liver 


1.6 


Brain (whole) 


69.3 


Liver ca. HepG2 


4.8 


Spinal Cord Pool 


23.2 


Kidney Pool 


2.0 


Adrenal Gland 


4.9 


Fetal Kidney 


15.8 


Pituitary gland Pool 


2.7 


Renal ca. 786-0 


4.5 


Salivary Gland 


2.5 


Renal ca. A498 


1.6 


Thyroid (female) 


3.1 


Renal ca. ACHN 


4.9 


Pancreatic ca. CAPAN2 


10.2 


Renal ca. UO-31 


6.5 


Pancreas Pool 


6.8 


Table AHD. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
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171616913 




171616913 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


11.1 


Secondary Th2 act 


1.4 


HUVEC IFN gamma 


24.7 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN gamma 


7.7 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 1L4 


17.9 


Secondary Th2 rest 




HUVtL IL-1 1 


14.3 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


11.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC TNFalpha + 
; IL-lbeta 


20.7 


Primary Th2 act 


1.3 


IMicrovascular Dermal EC none 


20.2 


Primary Trl act 


1.0 


Microsvasular Dermal EC TNFalpha 
+ 1L-Ibeta 


10.2 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha + 
1 L 1 beta 


19.5 


Primary Th2 rest 


0.0 


Small ainvay epithelium none 


6.0 


Primary Trl rest 


0.0 


Small airway epithelium TNFalpha + 
IL-lbeta 


10.5 


CD45RA CD4 lymphocyte 
act 


15.4 


Coronery artery SMC rest 


,5, I 


CD45RO CD4 lymphocyte 


0.0 


Coronery artery SMC TNFalpha + 
IL-lbeta 


1 

6.7 


CDS lymphocyte act 


0.0 


Astrocytes rest 


31.0 


Secondary CD8 
lymphocyte rest 


6.4 


Astrocytes TNFalpha + IL-1 beta 


22.2 


Secondary CD8 

CD4 Jym^hoc^e none 


0.0 


KU-8 12 (Basophil) rest 


13.7 






KU-8 12 (Basophil) PMA/ionomycin 


14.5 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD1 1 06 (Keratinocytes) none 


40.9 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) TNFalpha 
+ IL-lbeta 


22.1 


LAK cells IL-2 




Liver cirrhosis 


8.8 


LAK cells IL-2+IL-12 




*4CI-H292 none 


40.3 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


56.6 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-9 


51.8 


LAK cells 
PMA/ionomycin 


0.0 j 


NCI-H292IL-13 


31.9 


NK Cells IL-2 rest 


D.O 


NCI-H292 IFN gamma 


100.0 


Two Way MLR 3 day- 


10 


HPAEC none 


9.2 


Two Way MLR 5 day 


3.0 j 


HPAEC TNF alpha + IL-1 beta 


32.3 
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Two Way MLR 7 day 


0.0 


Lung fibroblast none 


35.1 


PBMC rest 


1.3 


Lung fibroblast TNF alpha + IL-1 
beta 




PBMC PWM 


0.0 


Lung fibroblast IL-4 


17.8 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


34.6 


Ramos (B cell) none 
'Ramos (B cell) ionomycin 


10.8 


Lung fibroblast IL-1 3 




12.9 


Lung fibroblast I FN gamma 


31.2 

24.5 " " ~ 


H lymphocytes PWM 


0.0 


Dermal fibroblast CCDI070 rest 


B lymphocytes CD40L and 
IL-4 




0.0 


Dermal fibroblast CCD 1070 TNF 


15.7 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CC Dl 070 IL-1 
beta 


12.6 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast I FN gamma 


1 1.0 


Dendritic cells none 


3.5 


Dermal fibroblast IL-4 


40 6 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


25.2 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


O.O jColon 


13.9 


Macrophages rest 


0.0 ]Lung 


6.0 


Macrophages LPS 


9.0 


Thymus 


15.5 


IIUVLCnone 1 1 5.2 


Kidney 


26.6 


HUVEC starved |l0.7 







Table AHE . Panel CNS_1 



Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
180912030 


Tissue Name 


Rel. Exp.(%) 
Ag4032, Run 
180912030 


BA4 Control 


19.1 


BA17 PSP 


31.6 


BA4 Control 


36.9 


BA17 PSP2 


17.1 


BA4 Alzheimer's2 


13.0 


Sub Nigra Control 


10.1 


BA4 Parkinson's 


56.6 


Sub Nigra Control2 


24.3 


BA4 Parkinson's2 


88.9 


Sub Nigra Alzheimer's2 


3.8 


BA4 Huntington's 


69.3 


Sub Nigra Parkinson's2 


21.0 


BA4 Huntington's2 


11.9 


Sub Nigra Huntington's 


23.8 


BA4 PSP 


13.8 


Sub Nigra Huntington's2 


19.3 


BA4 PSP2 


29.7 


Sub Nigra PSP2 


6.6 


BA4 Depression 


14.0 


Sub Nigra Depression 


3.0 


BA4 Depression2 


13.1 


Sub Nigra Depression2 


2.8 


BA7 Control 


71.7 


Glob Palladus Control 


0.9 
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HA 7 ( ontroI2 


39.S 


Glob Palladus Control2 jo.O 


BA7 Alzheimer's2 


15.0 


Glob Palladus Alzheimer's 


2.5 


BA7 Parkinson's 


38.4 


Glob Palladus Alzheimer's2 


2.9 


BA7 Parkinson's2 


64.0 


Glob Palladus Parkinson's 


76.3 


BA7 Huntington's 


53.! 


Glob Palladus Parkinson's2 


3.5 


BA7 Huntington's! 


42.9 


Glob Palladus PSP 


0.0 


BA7 PSP 


44.1 


Glob Palladus PSP2 


3.2 


BA7 PSP2 


37.6 


Glob Palladus Depression 


0.0 


BA7 Depression 


8.8 


Temp Pole Control 


10.2 


BA9 Control 


17.7 


Temp Pole Control2 


36.1 


BA9 Control2 


100.0 


Temp Pole Alzheimer's 


8.7 


BA9 Alzheimer's 


7.1 


Temp Pole Alzheimer's2 


6.4 


BA9 Alzheimer's2 


25.9 


Temp Pole Parkinson's 


32.8 


BA9 Parkinson's 


65.1 


Temp Pole Parkinson's! 


18.7 


BA9 Parkinson's! 


49.3 


Temp Pole Huntington's 


31.0 


BA9 Huntington's 
BA9 Huntington's! 


41.5 
34.9 


Temp Pole PSP 


8.1 


Temp Pole PSP! 


3.8 


BA9 PSP 


11.4 


Temp Pole Depression2 


5.9 


BA9 PSP! 


3.6 


Cing Gyr Control 


37.4 


BA9 Depression 


18.8 jCing Gyr Control! 


28.5 


BA9 Depression! 


13.7 


Cing Gyr Alzheimer's 


9.9 


BA 17 Control 


92.7 


Cing Gyr Alzheimer's! 


7.! 


BA 17 Control! j57.8 


Cing Gyr Parkinson's 


26.2 


BA17 Alzheimer's2 


27.2 


Cing Gyr Parkinson's! 


8.5 


BA17 Parkinson's 


62.9 


Cing Gyr Huntington's 


43.5 


BA17 Parkinson's2 


74.7 


Cing Gyr Huntington's! 


5.6 


BA 17 Huntington's 


77.4 


Cing Gyr PSP 


2.8 


BA17 Huntington's! 


26.1 


Cing Gyr PSP2 


1.4 


BA 1 7 Depression 


18.4 


Cing Gyr Depression 


0.0 


BA 17 Depression! 


39.2 


Cing Gyr Depression2 


10.8 



CNS_neurodegeneration_vl.O Summary: Ag4032 This panel confirms the 
expression of the CG95824-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 
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GeneraLscreeningjpaneljvl^ Summary: Ag4032 Highest expression of the 
CG95824-01 gene is detected in fetal brain (CT=29.8). High expression of this gene is 
seen all the regions of the central nervous system examined, including amygdala, 
hippocampus, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. 
5 Therefore, this gene may play a role in central nervous system disorders such as 

Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression and therapeutic modulation of this gene product may be beneficial in the 
treatment of these CNS disorders. 

The CG95824-01 gene codes for a RHO-GTPase activating protein (RHOGAP). 
1 0 RHOGAP stimulats the GTPase activity of Rho proteins. Since Rho proteins have been 
implicated in ischemia (Trapp et al., 2001, Mol Cell Neurosci 17(5):883-94, PMID: 
1 1358485), inhibitors for the RHOGAP encoded by this gene may have a therapeutic 
utility in the treatment of pathological aspects following brain damage including neuronal 
death. 

1 5 Moderate levels of expression of this gene is also seen in cluster of cancer cell 

lines including melanoma, CNS, colon, breast, ovarian, lung, gastric, renal, pancreatic and 
prostate cancer cell lines. Therefore, therapeutic modulation of this gene product through 
the use of small molecule target may be useful in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 

20 to moderate levels in pancreas, adrenal gland, thyroid, fetal heart, and fetal skeletal 

muscle. Therefore, therapeutic modulation of the activity of this gene may prove useful in 
the treatment of endocrine/metabolically related diseases, such as obesity and diabetes. 

Interestingly, expression of this gene is higher in fetal (CTs=32-33) as compared to 
adult lung, heart and skeletal muscle (CTs>36). Therefore, expression of this gene can be 

25 used to distinguish fetal from these adult tissue samples. In addition, the relative 

overexpression of this gene in fetal suggests that the protein product may enhance growth 
or development of these tissues in the fetus and thus may also act in a regenerative 
capacity in the adult. Therefore, therapeutic modulation of the RHOGAP encoded by this 
gene could be useful in treatment of muscle, lung and heart related diseases. 

30 Panel 4.1D Summary: Ag4032 Highest expression of the CG95824-01 gene is 

detected in IFN gamma treated NCI-H292 cells (CT=33.4). Moderate to low expression of 
this gene is seen in NCI-H292 cells, HPAEC, lung fibroblasts, IL-4 treated dermal 
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fibroblasts. Therefore, therapeutics designed with the protein encoded by this gene maj 
reduce or eliminate symptoms caused by inflammation in lung epithelia, in chronic 
obstructive pulmonary disease, asthma, allergy, and emphysema. 

Panel CNS_1 Summary: Ag4032 This panel confirms the expression of the 
5 CG95824-0 1 gene at low levels in the brains of an independent group of individuals. 
Please see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of 
central nervous system disorders. 



AI. CG96231-01 and CG96231-02: OTU-Like Cysteine Protease 

Expression of gene CG96231-0I and full length physical clone CG96231-02 was 
assessed using the primer-probe set Ag4043, described in Table AIA. Results of the RTQ- 
PCR runs are shown in Tables AIB, AIC, AID and AIE. Please note that CG96231-02 
represents a full-length physical clone of the CG96231-01 gene, validating the prediction 
of the gene sequence. 

Table AIA . Probe Name Ag4043 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -agcgtaacttccctgatcca-3 ' 


20 


815 


335 


Probe 


TET-5 1 -cacctcctctgaccattttctcctct-3 ' -TAMRA 


26 


839 


336 


Reverse 


5 1 -tcatctgctaattccagtgctt-3 1 


22 ■ 


887 


337 



Table ALB . CNS neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
21415 1952 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
214151952 


AD 1 Hippo 


13.8 


Control (Path) 3 Temporal Ctx 


4.5 


AD 2 Hippo 


15.4 


Control (Path) 4 Temporal Ctx 


28.3 


AD 3 Hippo 


8.6 


AD 1 Occipital Ctx 


40.6 


AD 4 Hippo 


8.1 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo |98.6 


AD 3 Occipital Ctx 


10.2 


AD 6 Hippo 


72.2 


AD 4 Occipital Ctx 


19.8 


Control 2 Hippo 


20.6 


AD 5 Occipital Ctx 


41.2 


Control 4 Hippo 


8 : 


AD 6 Occipital Ctx 


31.4 


Control (Path) 3 Hippo 


6.3 


Control 1 Occipital Ctx 


3.3 


AD I Temporal Ctx 


16.6 


Control 2 Occipital Ctx 


43.8 


AD 2 Temporal Ctx 


19.3 


Control 3 Occipital Ctx 


11.1 


AD 3 Temporal Ctx 


8.5 


Control 4 Occipital Ctx 


7.4 
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AD 4 Temporal Ctx 


20.6 


Control (Path) 1 Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx 


74.2 


Control (Path) 2 Occipital Ctx 


8./ 


AD 5 SupTemporal Ctx 


20.4 


Control (Path) 3 Occipital Ctx 


2.0 


AD 6 Inf Temporal Ctx 


87.7 


Control (Path) 4 Occipital Ctx 


12.9 


AD 6 Sup Temporal Ctx 


70.2 


Control I Parietal Ctx 


6.2 


^Control 1 Temporal Ctx 


4.5 


Control 2 Parietal Ctx 


44.1 


Control 2 Temporal Ctx 


16.7 


Control 3 Parietal Ctx 




jControl 3 Temporal Ctx 


1 1.5 


Control (Path) 1 Parietal Ctx 


62.0 


Control 4 Temporal Ctx 


5.0 


Control (Path) 2 Parietal Ctx 


20.6 


Control (Path) 1 Temporal 
Ctx 


35.1 


Control (Path) 3 Parietal Ctx 


1.4 


Control (Path) 2 Temporal 
Ctx 


j 2 ,4 


Control (Path) 4 Parietal Ctx 


40.3 


Table AIC. General screening psnpl vi i 




Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
21842621 1 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
218426211 


Adipose 


4.4 


Renal ca. TK-10 


19.8 


Melanoma* Hs688(A).T 


5.9 


Bladder 


10.4 


.Melanoma* Hs68S(B).T 


4.5 


Gastric ca. (liver met.) NCI-N87 


33.9 


jMelanoma* M14 


8.8 


Gastric ca. KATO III 


27.9 


Melanoma* LOX1MV1 


15.6 


Colon ca. SW-948 




Melanoma* SK-MEL-5 


19.1 


Colon ca. SW480 





Squamous cell carcinoma 
SCC-4 


13.7 


Colon ca.* (SW480 met) 
SW620 


11.0 


Testis Pool 


4.9 


Colon ca. HT29 


10.5 


Prostate ca* (bone met) 
PC-3 


46.3 


Colon ca. IICT-116 


27.7 


Prostate Pool 


2.2 


Colon ca. CaCo-2 


18.0 


Placenta 


3.5 


Colon cancer tissue 


6.5 


Uterus Pool 


2.3 


Colon ca. SW111 6 


2.5 


Ovarian ca. OVCAR-3 


7.3 


Colon ca. Colo-205 


4.7 


Ovarian ca. SK-OV-3 


M.\ 


Colon ca. SW-48 


4.0 


Ovarian ca. OVCAR-4 


3.7 


Colon Pool 


4.5 


Ovarian ca. OVCAR-5 


7.3 


Small Intestine Pool 


5.3 


Ovarian ca. IGROV- 1 < 


).7 


Stomach Pool 


5.5 


Ovarian ca. OVCAR-8 i 


.0 


Bone Marrow Pool ; 


..6 


Ovary l 


.3 


Fetal Heart 


.3 


Breast ca. MCF-7 ( 


.7 


Heart Pool ' L 


.3 
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Breast ca. MDA-MB-2 3 1 


29.9 


Lymph Node Pool 


6.7 


Breast ca. BT 549 


14.7 


Fetal Skeletal Muscle 


1.8 


Breast ca.TPD 


36.3 


Skeletal Muscle Pool 


1.4 


Breast ca. MDA-N 


4.1 


Spleen Pool 


7.9 


Breast Pool 


4.7 


Thymus Pool 


11.5 


Trachea 


6.7 


CNS cancer (glio/astro) U87- 
MG 


15.4 


Lung ;0.9 


CNS cancer (glio/astro) U-l 18- ! 
MG |"~ 


Fetal Lung S34.4 


CNS cancer (neuro;met) SK-N- 
AS 


16.3 


Lung ca. NCI-N4I7 


3.2 


CNS cancer (astro) SF-539 


5.8 


Lung ca. LX-I 


27.5 


CNS cancer (astro) SNB-75 


14.5 


Lung ca. NCI-H146 


5.0 


CNS cancer (glio) SNB- 1 9 


8.9 


Lung ca. SHP-77 


12.2 


CNS cancer (glio) SF-295 


18.3 


Lung ca. A549 


14.3 


Brain (Amygdala) Pool 


4.5 


Lung ca.NCl-H526 


2.3 


Brain (cerebellum) 


4.2 


Lung ca. NCI-H23 


10.7 


Brain (fetal) 


8.1 


Lung ca. NCI-H460 


9.9 


Brain (Hippocampus) Pool 


5.5 


Lung ca. HOP-62 


6.8 


Cerebral Cortex Pool 


7.6 


Lung ca.NCI~H522 


7.3 


Brain (Substantia nigra) Pool 


8.0 


Liver 


0.5 


Brain (Thalamus) Pool 


8.8 


Fetal Liver 


100.0 


Brain (whole) 


9.3 


Liver ca. HepG2 


5.0 


Spinal Cord Pool 


8.3 


Kidney Pool 


7.6 


Adrenal Gland 


4.2 


Fetal Kidney 


7.9 


Pituitary gland Pool 


2.3 


Renal ca. 786-0 


15.9 


Salivary Gland 


1.6 


Renal ca. A498 


2.3 


Thyroid (female) 


3.5 


Renal ca. ACHN 


20.7 


Pancreatic ca. CAPAN2 


16.8 


Renal ca. UO-3 1 


8.3 


Pancreas Pool 


6.0 


Table AID. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
171616960 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
171616960 


Secondary Thl act 


19.2 


HUVEC IL-lbeta 


9.2 


Secondary Th2 act 


25.5 


HUVEC I FN gamma 


8.9 


Secondary Trl act 


14.3 


HUVEC TNF alpha + I FN 
gamma 


9.3 


Secondary Thl rest 


5.2 


HUVEC TNF alpha + IL4 


5.3 
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Secondary Th2 rest 


4.9 


HUVEC IL-11 


3.4 


Secondary Trl rest 


8.5 


Lung Microvascular EC none 




Primary Thl act 


10.7 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


3.5 


Primary Th2 act 


12.8 


Microvascular Dermal EC none 


5.1 


Primary Trl act 


16.6 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


3.6 


Primary Th 1 rest 


^ Bronchial epithelium TNFalpha 
l+ILlbeta 


6.4 

3.2 


Primary Th2 rest 


2.5 |Small airway epithelium none 


Primary Trl rest 


5.5 


Small airway epithelium 
TNFalpha + IL-1 beta 


9.7 


CD45RA CD4 lymphocyte 
act 


11.1 


Coronery artery SMC rest 


3.0 


CD45RO CD4 lymphocyte 
act 


20.7 


Coronery artery SMC TNFalpha 
+ IL-1 beta 


4.9 


CD8 lymphocyte act 


12.8 


Astrocytes rest 


3.2 


Secondary CD8 lymphocyte 
rest 


1 1.4 


Astrocytes TNFalpha + IL- 1 beta 


3.3 


'Secondary CD8 lymphocyte 
jact 


8.0 


KU-812 (Basophil) rest 


51.4 


CD4 lymphocyte none 


6.0 

- 


KU-8 12 (Basophil) 
PMA/ionomycin 


100.0 


2ryThl/Th2/Trl anti- 
CJJV5 Crl 1 1 


11.6 


CCDl 106 (Keratinocytes) none 


13.6 


LAK cells rest 


10.4 


CCD1106 (Keratinocytes) 
TNFalpha + IL-1 beta 


6.7 


LAK cells IL-2 


14.9 


Liver cirrhosis 


5.6 


LAK cells IL-2+IL-12 


11.8 


NCI-H292 none 


13.9 


LAK cells IL-2+IFN 
gamma 


5.1 


NCI-H292 IL-4 


14.5 


LAK cells IL-2+ IL-18 


6.7 


NCI-H292 IL-9 


15.6 


LAK cells PMA/ionomycin 


10.7 


NCI-H292 IL-1 3 


14.0 


NK Cells IL-2 rest 


12.2 


NCI-H292 IFN gamma 


13.3 


Two Way MLR 3 day 


10.1 


HPAEC none 


3.1 


Two Way MLR 5 day 


8.6 


HPAECTNF alpha + IL-1 beta 


7.4 


Two Way MLR 7 day 


8.2 


Lung fibroblast none 


3.7 


PBMC rest 


6.7 


Lung fibroblast TNF alpha + IL- 
1 beta 


2.8 


PBMC PWM 


10.0 


Lung fibroblast IL-4 


5.1 


PBMC PHA-L 


5 8 


Lung fibroblast IL-9 


6.7 



601 



WO 03/010327 



PCT/US02/14199 



Ramos (B cell) none 


18.3 


IjUng HDIODldSL 11^ I J 




Ramos (B cell) ionomycin 


17.8 


Lung fibroblast IFN gamma 


6.3 


B lymphocytes PWM 


10.7 


Dermal fibroblast CCD 1070 rest 


7.0 


B lymphocytes CD40L and 
IL-4 


10.4 


Dermal fibroblast CCD 1 070 
TNF alpha 


12.9 


EOL-1 dbcAMP 


5.7 


Dermal fibroblast CCD 1070 IL- 
1 beta 


8.4 


EOL-1 dbcAMP 
PMA/ionomycin 


48 


Dermal fibroblast IFN gamma 


J ' 8 


Dendritic cells none 


8.0 


Dermal fibroblast IL-4 


7.6 


Dendritic cells LPS 


6.6 


Dermal Fibroblasts rest 


5.2 


Dendritic cells anti-CD40 


11.1 


Neutrophils TNFa+LPS 


4.1 


Monocytes rest 


17.1 


Neutrophils rest 


12.2 


Monocytes LPS 


91 


Colon 


1.6 


Macrophages rest 


6.9 


Lung 


3.5 


Macrophages LPS 


4.2 


Thymus 


51.1 


HUVEC none 

HUVEc'sta'r7ed~"~ ~ 


3.9 jKidney 


14.8 


Table AIE. General oncoloev screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
268362932 


Tissue Name 


Rel. Exp.(%) 
Ag4043, Run 
268362932 


Colon cancer 1 


22.2 


Bladder cancer NAT 2 


0.4 


Colon NAT 1 


5.1 


Bladder cancer NAT 3 


0.6 


Colon cancer 2 


29.7 


Bladder cancer NAT 4 


11.6 


Colon cancer NAT 2 


7.9 


Adenocarcinoma of the prostate 
1 


20.0 


Colon cancer 3 


67.8 


Adenocarcinoma of the prostate 

2 


2.8 


Colon cancer NAT 3 


17.6 


Adenocarcinoma of the prostate 
3 


9.2 


Colon malignant cancer 4 


91.4 


Adenocarcinoma of the prostate 
4 


95.9 


Colon normal adjacent 
tissue 4 


5.3 


Prostate cancer NAT 5 


3.3 


Lung cancer 1 


9.7 


Adenocarcinoma of the prostate 
6 


4.2 


Lung NAT 1 


3.8 


Adenocarcinoma of the prostate 
7 


3.8 


Lung cancer 2 


100.0 


Adenocarcinoma of the prostate 
8 


0.6 



602 



WO 03/010327 PCT/US02/14199 



T unrr XT A T O 


22 5 jAdenocarcinoma of the prostate 


19.1 


Squamous cell carcinoma 3 


42.3 jPtostale cancer NAT 1 0 


1.8 


Lung NAT 3 


1.1 


Kidney cancer 1 


17.1 


metastatic melanoma 1 


12.9 


KidneyNAT 1 


8.8 


Melanoma 2 


11.7 


Kidney cancer 2 


74.2 


Melanoma 3 


18.2 
47.6 


Kidney NAT 2 20.0 
Kidney cancer 3 43.2 


metastatic melanoma 4 


metastatic melanoma 5 


42.9 |KidneyNAT3 8.4 


Bladder cancer 1 


2.6 


Kidney cancer 4 |14.6 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 110.9 


Bladder cancer 2 


12.2 





CNS_neurodegeneration_vl.O Summary: Ag4043 This panel does not show 
differential expression of the CG96231-01 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1.4 for 
5 discussion of utility of this gene in the central nervous system. 

General_screening_panel_vl.4 Summary: Ag4043 Expression of the CG96231- 
01 gene is highest in fetal liver (CT=25.6). This gene is also expressed at moderate levels 
in other metabolic tissues, including pancreas, thyroid, adrenal, pituitary, adipose, and 
adult and fetal heart and skeletal muscle. This widespread expression among these tissues 
1 0 suggests that this gene product may play a role in normal neuroendocrine and metabolic 
function and that disregulated expression of this gene may contribute to neuroendocrine 
disorders or metabolic diseases, such as obesity and diabetes. 

In addition, this gene is expressed at much higher levels in fetal liver when 
compared to expression in the adult counterpart (CT=33.1). Thus, expression of 'this gene 
15 may be used to differentiate between the fetal and adult source of this tissue. In addition, 
the relative overexpression of this gene in fetal liver suggests that the protein product may 
act in a regenerative capacity in the adult. Therefore, therapeutic modulation of the protein 
encoded by this gene could be useful in treatment of liver disease. 

This gene is also expressed at moderate levels in the CNS, including the 
20 hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 

Therefore, therapeutic modulation of the expression or function of this gene may be useful 
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in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Expression is also slightly higher in cell lines derived from gastric, breast, ovarian, 
and prostate cancers. Overall, this gene is ubiquitously expressed in this panel suggesting a 
role for this gene product in cell growth and/or proliferation. 

Panel 4.1 D Summary: Ag4043 Expression of the CG96231-01 gene is highest in 
the KU-812 basophil cell line treated with PMA/ionomycin (CT=27.2), with ubiquitous 
expression in this panel. Basophils release histamines and other biological modifiers in 
reponse to allergens and play an important role in the pathology of asthma and 
hypersensitivity reactions. Therefore, therapeutics designed against the putative protein 
encoded by this gene may reduce or inhibit inflammation by blocking basophil function in 
these diseases. In addition, these cells are a reasonable model for the inflammatory cells 
that take part in various inflammatory lung and bowel diseases, such as asthma. Crohn's 
disease, and ulcerative colitis. Therefore, therapeutics that modulate the function of this 
gene product may reduce or eliminate the symptoms of patients suffering from asthma, 
Crohn's disease, and ulcerative colitis. 

General oncology screening panel_v_2.4 Summary: Ag4043 Highest 
expression of the CG96231-01 gene on this panel is seen in lung cancer (CT=28.9). In 
addition, this gene is overexpressed in lung and colon cancer when compared to 
expression in the normal adjacent tissue. Thus, expression of this gene could be used as a 
marker of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of lung and colon cancer. 

AJ. CG96260-01: Phospholipase A2 

Expression of gene CG96260-01 was assessed using the primer-probe set Ag4048, 
described in Table AJA. Results of the RTQ-PCR runs are shown in Tables A.TB, AJC and 
AJD. 



Table AJA . Probe Name Ag4048 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -atgaaggtcattgccatcct-3 ' 


20 


85 


338 


Probe 


TET-5 ' -cacccacagcagtttctggcagtt-3 1 -TAMRA 


24 


129 


339 



604 



WO 03/010327 



PCT/US02/14199 



(Reverse j5 ■ -atgtgtttgaccctcctctga-3 1 1 21 j 153 j 340 
Table AJB. CNS neurode^neratinn vl 0 


jTissue Name 


Rel. Exp.(%) 
Ag4048, Run 
214292262 


Tissue Name 


Rel. Exp.(%) 
Ag4048, Run 
214292262 


IAD I Hippo 


0.0 


Control (Path) 3 Temporal Ctx 


30.6 


AD 2 Hippo 


30.1 


Control (Path) 4 Temporal Ctx 


161. 1 


AD 1 Hippo 


0.0 


AD 1 Occipital Ctx j37.I 


AD 4 Hippo 


0.0 


AD 2 Occipital Ctx (Missing) jO.O 


AD 5 Hippo 


0.0 


AD 3 Occipital Ctx |0.0 


AD 6 Hippo 


47.6 


AD 4 Occipital Ctx io.O 


Control 2 Hippo 


0.0 


AD 5 Occipital Ctx ! 0.0 


Control 4 Hippo 


34.4 


AD 6 Occipital Ctx {0.0 


Control (Path) 3 Hippo 


50.0 


Control 1 Occipital Ctx j40.3 


AD I Temporal Ctx 


0.0 


Control 2 Occipital Ctx |24.8 


AD 2 Temporal Ctx 


59.5 


Control 3 Occipital Ctx Io.O 


AD 3 Temporal Ctx 
AD 4 Temporal Ctx 


0 0 iControl 4 Occipital Ctx 'o.O 

0 0 |ControI (Path) 1 Occipital Ctx jsU ~ j 


AD 5 Inf Temporal Ctx '0.0 


Control (Path) 2 Occipital Ctx 'o.O 


AD 5 Sup Temporal Ctx 


0.0 


Control (Path) 3 Occipital Ctx |0.0 ! 


AD 6 Inf Temporal Ctx 


0.0 


Control (Path) 4 Occipital Ctx 


0.0 


AD 6 Sup Temporal Ctx 


59.9 


Control 1 Parietal Ctx 


84.7 


Control 1 Temporal Ctx 


0.0 


Control 2 Parietal Ctx 


28.7 


Control 2 Temporal Ctx 


100.0 


Control 3 Parietal Ctx 


0.0 


Control 3 Temporal Ctx 


0.0 


Control (Path) 1 Parietal Ctx 


18.7 


Control 3 Temporal Ctx 


27.0 


Control (Path) 2 Parietal Ctx 


60.3 


Control (Path) 1 Temporal 
Ctx 


3.0 


Control (Path) 3 Parietal Ctx 


0.0 


Control (Path) 2 Temporal 
Ctx 


15.4 


Control (Path) 4 Parietal Ctx 


0.0 


Table AJC. General sereening_panpl vl d 




Re I. Exp.(%) j 
Tissue Name Ag4048, Run 

j2 18535022 | 


rissue Name 


Rel. Exp.(%) 
Ag4048, Run 
218535022 


Adipose I93 


Renal ca.TK- 10 


0.0 


Melanoma* Hs688(A).T Jo.O J 


Bladder j 


3.0 


Melanoma* Hs688(B).T Jo.O j 


jastric ca. (liver met.) NCI-N87 


3.0 


Melanoma* Ml 4 Jo.O j( 


Gastric ca. KATO III 


).0 


Melanoma* LOXIMVI Jo.O j( 


:o!on ca. SW-948 


3.0 
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Melanoma* SK-MEL-5 


■ h0 — — . 


Colon ca. SW480 


23.0 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


0.0 


Testis Pool 


65.1 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) 
PC-3 


0.0 


Colon ca. HCT-1 16 


0.0 


Prostate Pool 


52.9 


Colon ca. CaCo-2 


o.'; " ' 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1I16 


0.0 


Ovarian ca. OVCAR-3 


0.0 


Colon ca. CoIo-205 


25.5 


Ovarian ca. SK-OV-3 


0.0 


Colon ca. SW-48 


00 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool 


51.4 


Ovarian ca. OVCAR-5 


0.0 


Small Intestine Pool 


92.0 


Ovarian ca. IGROV-1 


0.0 


Stomach Pool 


19.2 


Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool 


12.2 


Ovary 


9.9 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA-MB-23 1 


0.0 


Lymph Node Pool 


35.6 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0 0 


Breast Pool 


55.9 


Thymus Pool 


32.3 


Trachea 


22.7 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18-MG 


0.0 


Fetal Lung 


27.9 


( NS anc r in u> met) SK N- \S 


0.0 


Lung ca.NCI-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


4.0 


CNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca.NCl-H526 


0.0 


Brain (cerebellum) 


15.4 


Lung ca. NC1-H23 


0.0 


Brain (fetal) 


0.0 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) Pool 


22.5 


Lung ca. HOP-62 


7.2 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


48.3 
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Kidney Pool 


62.9 


Adrenal Gland 


0 0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


100.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


11.3 


Renal ca. UO-3 1 


0.0 


Pancreas Pool 


49.3 


Table AJD. General oncoloav screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4048, Run 
268362939 


Rel. Exp.(%) 

Tissue Name Ag lO-18, Run 
1268362939 


Colon cancer 1 


0.0 


Bladder cancer NAT 2 


3.9 


Colon cancer NAT 1 


16.2 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


41.8 


Bladder cancer NAT 4 0.0 


Colon cancer NAT 2 


39.8 


Adenocarcinoma of the prostate 1 ;76.3 


Colon cancer 3 


4.6 


Adenocarcinoma of the prostate 2 


20.7 


Colon cancer NAT 3 


20.6 


Adenocarcinoma of the prostate 3 


0.0 


Colon malignant cancer 4 


0.0 


Adenocarcinoma of the prostate 4 


0.0 


Colon normal adjacent 
tissue 4 


5.8 


Prostate cancer NAT 5 


5.5 


Lung cancer 1 


0.0 


Adenocarcinoma of the prostate 6 


5.4 


LUIlg IN A 1 1 




Adenocarcinoma of the prostate 7 


19.9 


Lung cancer 2 


0.0 


Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 


0.0 


Adenocarcinoma of the prostate 9 


100.0 


Squamous cell carcinoma 
3 


0.0 


Prostate cancer NAT 1 0 


0.0 


Lung NAT 3 


0.0 


Kidney cancer 1 


1.9 


metastatic melanoma I 


0 0 


KidneyNAT 1 


0.0 


Melanoma 2 


00 


Kidney cancer 2 


21.6 


Melanoma 3 


77 


Kidney NAT 2 


0.0 


metastatic melanoma 4 


6.0 


Kidney cancer 3 


0.0 


metastatic melanoma 5 


5.6 


Kidney NAT 3 


0.0 


Bladder cancer 1 


4.4 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


00 







CNS_neurodegeneration_vl.O Summary: Ag4048 This panel confirms the 
expression of the CG96260-01 gene at low levels in the temporal cortex of brain 
(CT=34.3). However, no differential expression of this gene was detected between 
Alzheimer's.diseased postmortem brains and those of non-demented controls in this 
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experiment. The the CG96260-01 gene codes for a homolog of rat phospholipase A2 
(PLA(2)). Mouse deficient in cytosolic PLA(2), a member of PLA(2) family, has been 
shown to suffer smaller infarcts and fewer neurological deficits after transient occlusion of 
the middle cerebral artery and have less injury after administration of a dopaminergic 
5 selective neurotoxin (Sapirstein A, and Bonventre .TV, 2000, Biochim Biophys Acta 2000 
Oct 3 1;1 488(1 -2): 139-48, PMID: 1 1080683). Thus, PLA(2) encoded by this gene could 
also play a role in inflammation and injuries of brain and pharmacological targeting of this 
enzyme may have important therapeutic benefits. 

General_screening_panel_vl.4 Summary: Ag4048 Highest expression of the 
10 CG96260-01 gene is detected in renal cancer A498 cell line (CT=33). Therefore, 

therapeutic modulation of this gene product may be beneficial in the treatment of renal 
cancer. 

In addition, moderate to low expression of this gene is seen in samples derived 
from normal tissues including testis, prostate, breast, kidney, lymphnode. thymus and 

15 spinal cord. Therefore, therapeutic modulation of this gene product may be useful in 
treatment of diseases associated with these tissues. 

Among tissues with metabolic or endocrine function, this gene is expressed at 
moderate to low levels in pancreas, and the gastrointestinal tract. Therefore, therapeutic 
modulation of the activity of this gene may prove useful in the treatment of 

20 endocrine/metabolically related diseases, such as obesity and diabetes. 

The the CG96260-01 gene codes for a homolog of rat phospholipase A2 (PLA(2)). 
PLA(2) has been implicated in the pathogenesis and pathophysiology of acute pancreatitis 
(Nevalainen et ah, 1999, Hepatogastroenterology 46(29):2731-5, PMID: 10576338). In 
addition, in blood plasma, PLA(2) modifies the circulating lipoproteins and so induce 

25 formation of small dense LDL particles, which are associated with increased risk for 
cardiovascular disease (Hurt-Camejo et al, 2001, Circ Res 89(4):298-304, PMID: 
1 1509445). Furthermore, the cytoplasmic PLA(2) knockout mouse has revealed important 
roles of cPLA(2) in normal fertility, generation of eicosanoids from inflammatory cells, 
brain injuries and allergic responses (Sapirstein A, and Bonventre JV, 2000, Biochim 

30 Biophys Acta 2000 Oct 3 1 ; 1488(1 -2): 139-48, PMID: 1 1080683). Therefore, therapeutic 
modulation of PLA(2) encoded by this gene may be useful in the treatment of pancreatitis, 
neurological disorders, allergies and cardiovascular diseases. 
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Panel 4.1D Summary: Ag4048 Expression of the CG96260-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel CNS1 Summary: Ag4048 Expression of the CG96260-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
5 General oncology screening panel_v_2.4 Summary: Ag4048 Highest 

expression of the CG96260-01 gene is detected in prostate adenocarcinoma samples 
(CTs=33). In addition, low expression of this gene is also seen in a single colon cancer and 
the adjacent normal tissue sample. The CG96260-01 gene codes for a homolog of rat 
phospholipase A2 (PLA(2)). Phospholipase A2 has recently been recognized to be 
10 involved in a wide number of pathophysiological situations, ranging from systemic and 
acute inflammatory conditions to cancer (Balsinde et al., 1999, Annu Rev Pharmacol 
Toxicol 39:175-89, PMID: 10331081). Therefore, therapeutic modulation of this gene 
product through the use of small molecule drug may be useful in the treatment of colon 
and prostate cancers, [mpattu, 29-Mar-02] 

15 

AK. CG96364-01: A DP/ATP Translocase 2 Like protein 

Expression of gene CG96364-01 was assessed using the primer-probe set Ag4073. 
described in Table AKA. Results of the RTQ-PCR runs are shown in Tables AKB, AKC, 
AKD, AKE and AKF. 
20 Table AKA . Probe Name Ag4073 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 

No 


Forward 


5 1 -tcctcagatacttccccacc-3 ' 


20 


263 


341 


Probe 


TET-5 ' -aacacaagcagatcttcctgggtggt-3 ' -TAMRA 




311 


342 


Reverse 


5 1 -aactgggtcctcttgtccac-3 ' 


20 


337 


343 



Table AKB . Al comprehensive panel vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
248081917 


Tissue Name 


Rel. 

Exp.(%) 
Ag4073, 
Run 

248081917 


110967 COPD-F 


11.3 


1 12427 Match Control Psoriasis-F 


40.6 


1 1 0980 COPD-F 


8.1 


1 12418 Psoriasis-M 


17.6 


1 1 0968 COPD-M 


12.1 


1 12723 Match Control Psoriasis-M 


8.8 


1 10977 COPD-M 


20.9 


112419 Psoriasis-M 


24.1 
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11 0989 Emphysema-F 


39.0 


I 1242 1 Match ( ontrol Pi u iasis-M 


14 7 

34.9 


1 10992 Emphysema-F 


16.4 


112420 Psoriasis-M 


1 1 0993 Emphysema-F 


12.3 


1 12425 Match Control Psoriasis-M 


27.0 


110994 Emphysema-F 


5.9 


104689 (MF) OA Bone-Backus 


31.0 


110995 Emphysema-F 


39.5 


i vHoyyj yivir ) aqj iNormai t>one- 
Backus 


8.4 


1 1 09 16 Emphysema-F 


8.5 


104691 (MF) OA Synovium-Backus 


26.2 
18.9 


il KW Asthma-M 


7.6 


104692 (BA) OA Cartilage-Backus 


1 1 1001 Asthma-F 


18.7 


iuhd^4 ^daj \jPl t>one-t>acKus 


38.7 


111002 Asthma-F 


24.7 


w+oyj (d/\) /\Q] iNormai tsone- 
Backus 


17.6 


111003 Atopic Asthma-F 


29.1 


104696 (BA) OA Synovium-Backus 


32.3 


11 1004 Atopic Asthma-F 


40.6 


1 04700 (SS) OA Bone-Backus 


18.2 


1 1 iuuj ALupic /\sinma-r 


20.6 


104701 (SS) Adj "Normal" Bone- 
Backus 


19.3 


111006 Atopic Asthma-F 


6.2 


104702 (SS) OA Synovium-Backus 


34.2 


111417 Allergy-M 


13.0 


1 17093 OA Cartilage Rep7 


11.6 


1 12347 Allergy-M 


0.2 


112672 OA Bone5 


27.4 


1123 19 Normal Lung-F 


0.2 


1 12673 OA Synoviums 


12.3 


11 2357 Normal Lung-F 


26.6 


I 12674 OA Synovial Fluid cellsS 


12.6 


1 12354 Normal Luns-M 


1 2.4 


i i j iw \JPl cartilage Kepi 4 


8.1 


1 12374 Crohns-F 


19.3 


i iz/jo ua tsoney 


1 00.0 


112389 Match Control 
Crohns-F 


21.6 


112757 OA Synovium9 


3.8 


112375 Crohns-F 


15.5 


1 12758 OA Synovial Fluid CeIIs9 


— - 


112732 Match Control 
Crohns-F 


42.6 


117125 RA Cartilage Rep2 


15.3 


1 12725 Crohn s-M 


7.5 


1 1 T.AQ1 Rnnp? R A 




112387 Match Control 
Crohns-M 


13.3 


113493 Synovium2 RA 


6.0 


1 12378 Crohns-M 


0.1 






112390 Match Control 
Crohns-M 


23.3 


1 13499 CartiIage4RA 


23.5 


112726 Crohns-M 


24.0 


1 13500 Bone4 RA 


22.4 


112731 Match Control 
Crohns-M 


11.8 


113501 Synovium4RA 


17.2 


112380 Ulcer Col-F 


14.4 


113502 Syn Fluid CeIIs4 RA 


10.4 


112734 Match Control 
Ulcer Col-F 


71.7 


1 13495 Cartilage3 RA 


3.4 


112384 Ulcer Col-F 


33.7 


1 13496 Bone3 RA 


3.3 | 
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112737 Match Control 
Ulcer Col-F 


9.9 


1 13497 Synovium3 RA 


7.1 


112386 Ulcer Col-F 


1.6 


113498 Syii Fluid Cells3 RA 


16.0 


112738 Match Control 
Ulcer Col-F 


38.4 


1 17106 Normal Cartilage Rep20 


5.3 


112381 Ulcer Col-M 


0.3 


1 13663 Bone3 Normal 


0.6 


1 12735 Match Control 
Ulcer Col-M 


9.3 


1 13664 Synovium3 Normal 


0.0 


11 382 UIcci C ! Pv 


26.4 


1 13665 Syn Fluid CeIIs3 Normal 


0.3 | 


112394 Match Control 
Ulcer Col-M 


3.6 


1 17107 Normal Cartilage Rep22 


3.0 


112383 Ulcer Col-M 


19.5 


1 13667 Bone4 Normal 


6.1 


112736 Match Control 
Ulcer Col-M 


14.2 


1 13668 Synovium4 Normal 


10.2 


1 12423 Psoriasis-F 


19.3 


1 13669 Syn Fluid CeIIs4 Normal 


13.6 



Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
214294826 


Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 


AD 1 Hippo 


8.4 


Control (Path) 3 Temporal Ctx 


3.8 


AD 2 Hippo 


18.3 


Control (Path) 4 Temporal Ctx 


14.2 


AD 3 Hippo 


2.1 


AD 1 Occipital Ctx 


8.9 


AD 4 Hippo 


3.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital Ctx 


3.9 


AD 6 Hippo 


28.1 


AD 4 Occipital Ctx 


12.4 


Control 2 Hippo 


12.9 


AD 5 Occipital Ctx 


25.2 


Control 4 Hippo 


6.5 


AD 6 Occipital Ctx 


77.4 


Control (Path) 3 Hippo 


3.2 


Control 1 Occipital Ctx 


3.3 


AD 1 Temporal Ctx 


3.5 


Control 2 Occipital Ctx 


58.2 


AD 2 Temporal Ctx 


17.4 


Control 3 Occipital Ctx 


12.8 


AD 3 Temporal Ctx 


2.3 


Control 4 Occipital Ctx 


3.9 


AD 4 Temporal Ctx 


11.1 


Control (Path) 1 Occipital Ctx 


49.3 


AD 5 Inf Temporal Ctx 


73.7 


Control (Path) 2 Occipital Ctx 


6.5 


AD 5 SupTemporal Ctx 


37.9 


Control (Path) 3 Occipital Ctx 


1.6 


AD 6 Inf Temporal Ctx 


27.0 


Control (Path) 4 Occipital Ctx 


8.6 


AD 6 Sup Temporal Ctx 


28.9 


Control 1 Parietal Ctx 


3.4 


Control 1 Temporal Ctx 


4.1 


Control 2 Parietal Ctx 


26.8 


Control 2 Temporal Ctx 


29.5 


Control 3 Parietal Ctx 


14.5 


Control 3 Temporal Ctx 


t.4 


Control (Path) 1 Parietal Ctx 


56.0 
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Control 4 Temporal Ctx 


6,3 


Control (Path) 2 Parietal Ctx 


15.7 


Control (Path) 1 Temporal 
Ctx 


26.6 


Control (Path) 3 Parietal Ctx 


3.7 


Control (Path) 2 Temporal 
Ctx 


26.4 


Control (Path) 4 Parietal Ctx 


26.8 


Table AKD. General screenina panpl vt d 




\ 

[Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
218906344 
1.4 


Tissue Name 


Rf»l Pvn 

Ag4073, Run 
218906344 


Adipose 


Renal ca. TK-10 


21 0 


Melanoma* Hs688(A).T 


5.9 


Bladder 


11.8 


Melanoma* Hs688(B).T 


6.7 


jGastric ca. (liver met.) NCI-N87 


22.5 


{Melanoma* M14 


72.7 jGastricca. KATO III 


100.0 


(Melanoma* LOXIMVI 


34.6 


Colon ca. SW-948 


^ 


Melanoma* SK-MEL-5 


67.8 


Colon ca. SW480 




iSquamous cell carcinoma 
jSCC-4 


22.7 


Colon ca.* (SW480 met) 
SW620 




66.0 


jTestis Pool 


2.0 


Colon ca. HT"'9 


34.6 


jProstate ca.* (bone met) 

,~. — 


40.9 


Colon ca. HCT-116 


68.3 


it rostate Pool 


1.2 


Colon ca. CaCo-2 


24.3 


jpiacenta 


3.1 


Colon cancer tissue 


35.1 


1 Uterus Pool 


0.4 


Colon ca. SW11 16 


7.5 


Ovarian ca. OVCAR-3 


36.6 


Colon ca. Colo-205 


23.8 


Ovarian i-a 0\/ 1 

jwvarian ca. otv-iJV-3 


32.8 


Colon ca. SW-48 


25.9 


[Ovarian ca. OVCAR-4 


37.9 


Colon Pool 


3.2 


(Ovarian ca. OVCAR-5 


24.1 


Small Intestine Pool 


1.7 


Ovarian ca. IGROV-1 


18.9 


Stomach Pool 


2.7 


Ovarian ca. OVCAR-8 


14.2 


Bone Marrow Pool 


0.8 


Ovary 


2.5 


Fetal Heart 


4.2 


Breast ca. MCF-7 


39.2 


Heart Pool 


1.5 


Breast ca. MDA-MB-23 1 


39.2 


Lymph Node Pool 


3.4 


Breast ca. BT 549 


52.1 


-etal Skeletal Muscle 


3.9 


Breast ca. T47D 


39.8 


skeletal Muscle Pool 


.2 


Breast ca. MDA-N 


52.9 


spleen Pool 


Z.4 


Breast Pool 


-.7 


Diyrnus Pool 


.6 


Trachea i 


\ 


:NS cancer (glio/astro) U87- 
vlG 


2.1 


Lung ( 


\ 


:NS cancer (glio/astro) U-l 1 8- L_ „ 
AG I 27 - 2 
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Fetal Lung 


4.9 


CNS cancer (neuro;met) SK-N- 
AS 


25.7 


1 ungca NCI-N417 1 


17.4 


CNS cancer (astro) SF-539 


31.4 


Lung ca. LX-1 


48.3 


CNS cancer (astro) SNB-75 


50.3 


Lungca. NCI-H146 


11.6 


CNS cancer (glio) SNB-19 


20.7 


Lungca. SHP-77 


30.6 


CNS cancer (glio) SF-295 


16.5 


Lung ca. A549 


52.1 


Brain (Amygdala) Pool 


3.8 


Lungca.NCI-H526 


12.4 


Brain (cerebellum) 


6^7 ' 


Lung ca. NCI-H23 


31.6 


Brain (fetal) 


3.2 


Lung ca. NCI-H460 


15.3 


Brain (Hippocampus) Pool 


4.2 


Lung ca. HOP-62 


1 1 .8 ICerebral Cortex Pool 


5.1 


Lungca. NCI lb.!.". 


17.7 


Brain (Substantia nigra) Pool 


6.6 


Liver 


2.0 


Brain (Thalamus) Pool 


6.3 


Fetal Liver 


8.0 


Brain (whole) 


4.1 


Liver ca. HepG2 


21.8 


Spinal Cord Pool 


2.6 


Kidney Pool 


3.0 


Adrenal Gland 


4.6 


Fetal Kidney 


3.3 


Pituitary gland Pool 


0.8™ 


Renal ca. 786-0 


15.2 


Salivary Gland 


4.4 


Renal ca. A498 


5.7 


Thyroid (female) 


2.1 


Renal ca. ACHN 


18.9 


Pancreatic ca. CAPAN2 


30.4 


Renal ca. UO-3 1 


12.9 


Pancreas Pool 


4.5 


Table AKE. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
171808865 


Tissue Name 


Rel. Exp.(%) 
Ag4073, Run 
171808865 


Secondary Thl act 


88.3 


HUVECIL-lbeta 


33.7 


Secondary Th2 act 


82.9 


HUVEC 1FN gamma 


44.1 


Secondary Trl act 


59.5 


HUVECTNF alpha + IFN 
gamma 


24.3 


Secondary Thl rest 


12.8 


HUVEC TNF alpha + IL4 


32.1 


Secondary Th2 rest 


16.6 


HUVEC IL-11 


17 1 


Secondary Trl rest 


26.2 


Lung Microvascular EC none 


50.7 


Primary Thl act 


86.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


31.6 


Primary Th2 act 


95.3 


Microvascular Dermal EC none 


29.3 


Primary Trl act 


100.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


28.1 


Primary Thl rest 


25.7 


Bronchial epithelium TNFalpha 
+ ILlbeta 


44.4 
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Primary Th2 rest 


13.9 


Small airway epithelium none 


20.3 


Primary Trl rest 


34.9 


Small airway epithelium 
TNFalpha + IL-lbeta 


51.4 


CD45RA CD4 lymphocyte 
act 


77.4 


Coronery artery SMC rest 


17.7 


CD45RO CD4 lymphocyte 
act 


94.6 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


21.3 


CDS lymphocyte act 


90.1 
80.1 


Astrocytes rest 


14.5 


Secondary CD8 
lymphocyte rest 


Astrocytes TNFalpha + IL- 1 beta 


12.2 


Secondary CDS KU-81 2 (Basophil) rest i24.5 
lymphocyte act | v k ; 


CD4 lymphocyte none 


7.2 


KU-812 (Basophil) 
PMA/ionomycin 


59.5 


2ryThl/Th2/Trl_anti- 
CD95CH11 


42.3 


CCD1 106 (Keratinocytes) none 


64.6 


LAK cells rest 


57.0 


CCD1 106 (Keratinocytes) 
TNFalpha + lL-lbeta 


46.0 


LAK cells IL-2 


42.9 
31.0 


Liver cirrhosis 


6.8 


LAK cells IL-2+IL-I2 


NCI-H292 none 


61.6 


LAK cells IL-2+IFN 
gamma 


23.2 


NCI-H292 IL-4 


79.0 


LAK cells IL-2+ IL-18 


28.7 


NCI-H292 IL-9 


95.3 


LAK cells PMA/ionomycin 


55.9 


NCI-H292 IL-13 


79.6 


NK Cells IL-2 rest 


59.0 


NCI-H292 I FN gamma 


65.5 


Two Way MLR 3 day 


27.! 


HPAEC none 


24.7 


Two Way MLR 5 day 


57.4 


HPAEC TNF alpha + IL-1 beta 


44.4 


Two Way MLR 7 day 


38.2 


Lung fibroblast none 


16.7 


PBMC rest 


14.2 


Lung fibroblast TNF alpha + IL- 
1 beta 


14.2 


PBMC PWM 


55.9 


Lung fibroblast IL-4 


23.2 


PBMC PHA L 


72.2 


Lung fibroblast IL-9 


35.4 


Ramos (B cell) none 


65.5 jLung fibroblast IL- 13 


23.5 


Ramos (B cell) ionomycin 


100.0 


Lung fibroblast IFN gamma 


28.3 


B lymphocytes PWM 


82.4 


Dermal fibroblast CCD1070 rest 


36.1 


B lymphocytes CD40L and 
IL-4 


57.8 


Dermal fibroblast CCD 1070 
TNF alpha 


65.1 


EOL-1 dbcAMP 


51.4 


Dermal fibroblast CCD 1070 IL-1 
beta 


37.6 


EOL-1 dbcAMP 
PMA/ionomycin 


33.2 


Dermal fibroblast IFN gamma 


26.8 
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Dendritic cells none 


46.0 


Dermal fibroblast IL-4 


31.2 


Dendritic cells LPS 


31.6 


Dermal Fibroblasts rest 


19.6 


Dendritic cells anti-CD40 


48.0 


Neutrophils TNFa+LPS 


3.1 


Monocytes rest 


25.3 


Neutrophils rest 


42 


Monocytes LPS 


18.0 


Colon 


35.4 


Macrophages rest 


70.2 


Lung 


17.6 


Macrophages LPS 


14.7 


Thymus 


32.8 


HUYEC none 


39.2 


Kidney 


47.6 


HUVEC starved 


43.8 






Table AKF. General oncoloj?v screening panel v 2.4 


Tissue Name 

.- -.. ... 


Rel. Exp.(%) 
Ag4073, Run 
268389975 



50.3 


[Rel. 
Exp.(%) 

Tissue Name Ag4073, 
(Run 

1268389975 


Colon cancer 1 


Bladder cancer NAT 2 


0.1 


Colon cancer NAT 1 
Colon cancer 2 


17.7 
17.8 


Bladder cancer NAT 3 
Bladder cancer NAT 4 


0.1 
1 .7 


Colon cancer NAT 2 


22.4 jAdenocarcinoma of the prostate 1 


6.9 


Colon cancer 3 


1 00.0 [Adenocarcinoma of the prostate 2 


0.3 


Colon cancer NAT 3 


23.8 [Adenocarcinoma of the prostate 3 


2.5 


Colon mal ignant cancer 4 


47.6 jAdenocarcinoma of the prostate 4 


7.8 


Colon normal adjacent 
tissue 4 


1 4.5 jprostate cancer NAT 5 


1.4 


Lung cancer 1 


8.2 jAdenocarcinoma of the prostate 6 


1.1 


Lung NAT 1 


0.5 jAdenocarcinoma of the prostate 7 


1.3 


Lung cancer 2 


14.8 JAdenocarcinoma of the prostate 8 


0.4 


Lung NAT 2 


0.7 jAdenocarcinoma of the prostate 9 


3.7 


Squamous cell carcinoma 3 


23.0 jProstate cancer NAT 1 0 


0.1 


Lung NAT 3 


0.4 [Kidney cancer 1 


6.3 


metastatic melanoma 1 


3.8 jlCidneyNAT 1 


3.3 . 


Melanoma 2 


3.3 [Kidney cancer 2 


21.2 


Melanoma 3 


2.3 jKidney NAT 2 


17.7 


metastatic melanoma 4 


8.8 |Kidney cancer 3 


4.5 


metastatic melanoma 5 


14.7 jKidney NAT 3 


6.7 


Bladder cancer 1 


0.6 jKidney cancer 4 


11.6 


Bladder cancer NAT 1 


0.0 [Kidney NAT 4 


17.4 


Bladder cancer 2 


1 
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AI_comprehensive panel_vl.O Summary: Ag4073 Highest expression of the 
CG96364-01 gene is detected in osteoarthritic bone sample (CT=27). Low to moderate 
levels of expression of this gene are detected in samples derived from osteoarthritic (OA) 
bone and adjacent bone as well as OA cartilage, OA synovium and OA synovial fluid 
5 samples. Moderate level expression is also detected in cartilage, bone, synovium and 
synovial fluid samples from rheumatoid arthritis patients. Low level expression of this 
gene is also detected in normal samples derived from cartilage, synovium, bone or 
synovial fluid cells, normal lung samples. COPD lung, emphysema, atopic asthma, 
asthma, allergy, Crohn's disease (normal matched control and diseased), ulcerative 
10 colitis(normal matched control and diseased), and psoriasis (normal matched control and 
diseased). Therefore, therapeutic modulation of this gene product may ameliorate 
symptoms/conditions associated with autoimmune and inflammatory disorders including 
psoriasis, allergy, asthma, inflammatory bowel disease, rheumatoid arthritis and 
osteoarthritis 

15 CNS_neurodegeneration_vl.O Summary: Ag4073 This panel confirms the 

expression of the CG96364-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 

20 treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag4073 Highest expression of the 
CG96364-01 gene is detected in a gastric cancer KATO III cell line (CT=22). High 
expression of this gene is seen in cluster of breast, ovarian, colon, gastric, renal, lung, 
pancreatic, CNS, hepatic, prostate cancer cell lines and melanoma cell lines. Thus, 

25 therapeutic modulation of this gene product could be beneficial in the treatment of these 
cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
30 activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 
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The CGI 00466-01 gene codes for adenine nucleotide translocator (ANT 2) 
homolog. Dysfunctioning of the ANT2 have been shown to to induce myopathies in 
mouse and in humans (Fiore et ah, 2001, Clin Chim Acta 3 1 1(2): 125-35, PMID: 
1 1566172). ANT has a role in mtDNA maintenance and mutation in ANT has been 
5 implicated in autosomal dominant progressive external ophthalmoplegia and other 
mitochondrial diseases (Kaukonen et al., 2000, Science 289(5480):782-5, PMID: 
10926541). Mice deficient in the heart/muscle specific isoform of the ANT1 exhibit many 
of the hallmarks of human oxidative phosphorylation (OXPHOS) disease, including a 
dramatic proliferation of skeletal muscle mitochondria (Murdoch et al., 1999, J Biol Chem 

10 274(20): 14429-33, PMID: 10318868). Therefore, therapeutic modulation of the ANT 
protein encoded by the CG100466-01 gene through the use of small molecule drug could 
be useful in the treatment mitochondria related diseases. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 

15 cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease. Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4073 Highest expression of the CG96364-01 gene is 
detected in activated primary Trl and ionomycin treated Ramos B cell (CTs=25.4). This 

20 gene is expressed at high to moderate levels in a wide range of cell types of significance in 
the immune response in health and disease. These cells include members of the T-cell, B- 
cell, endothelial cell, macrophage/monocyte, and peripheral blood mononuclear cell 
family, as well as epithelial and fibroblast cell types from lung and skin, and normal 
tissues represented by colon, lung, thymus and kidney. This ubiquitous pattern of 

25 expression suggests that this gene product may be involved in homeostatic processes for 
these and other cell types and tissues. This pattern is in agreement with the expression 
profile in General_screeningjpanel_vl.4 and also suggests a role for the gene product in 
cell survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 

30 to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 
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General oncology screening panel_v_2.4 Summary: Ag4073 Highest 
expression of the CG963 64-01 gene is detected in colon cancer OD06297-04 sample 
(CT=24.3). Higher expression of this gene is seen in colon cancer and lung cancer 
compared to the normal adjacent tissues. Therapeutic modulation of this gene product 
5 could be beneficial in the treatment of these cancers. Please see Panel 1 .4 for additional 
discussion of the potential utility of this gene in treatment of cancers. 

AL. CG96422-01: ADP/ATP Translocase 3 

Expression of gene CG96422-01 was assessed using the primer-probe set Ag4057, 
1 0 described in Table ALA. Results of the RTQ-PCR runs are shown in Tables ALB and 
ALC. 

Table ALA . Probe Name Ag4057 



Primers 
Forward 


Sequences 

5 1 -gccaatgtcatccgctact-3 ' 


Length 
19 


Start 
Position 


SEQ1D 
No 


232 


344 


Probe |TET-5 ' -agccctcaatttcgccttcaaggata-3 ' -TAMRA 


26 


26! 


345 


Reverse \5 ' -gccaggaagatctgcttgtact-3 ' 




287 


346 



Table ALB. Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag4057, Run 
171620020 


Tissue Name 


Rel. 

Exp.(%) 
Ag4057, 
Run 

171620020 


Secondary Thl act 


53.9 


HUVEC IL-lbeta 


51.1 


Secondary Th2 act 


32.5 


HUVECIFN num. 


56.6 


Secondary Trl act 


29.9 


HUVEC TNF alpha + I FN gamma 


29.9 


Secondary Th 1 rest 


14.9 


HUVEC TNF alpha +IL4 


42.9 


Secondary Th2 rest 


21.5 


ffl VECIL-11 


43.5 


Secondary Trl rest 


22.7 


Lung Microvascular EC none 


81.8 


Primary Thl act 


33.7 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


60.3 


Primary Th2 act 


54.7 


Microvascular Dermal EC none 


53.o 


Primary Trl act 


42.6 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


36.3 


Primary Thl rest 


16.4 


Bronchial epithelium TNFalpha + 
ILlbeta 


65.1 


Primary Th2 rest 


8.6 


Small airway epithelium none 


21.8 
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Primary Trl rest 


22.2 


Small airway epithelium TNFalpha 
+ IL-lbeta 


36.9 


CD45RA CD4 
lymphocyte act 


33.0 


Coronery artery SMC rest 


46.7 


CD45RO CD4 
lymphocyte act 


47.0 


Coronery artery SMC TNFalpha + 
IL-lbeta 


39.5 


CD8 lymphocyte act 


•55.1 


Astrocytes rest 


29.3 


Secondary CDS 
lymphocyte rest 




Astrocytes TNFalpha + 1L- 1 beta 


26.8 


Secondary CD8 
lymphocyte act 


25.D 


KU-812 (Basophil) rest 


72.7 


CD4 lymphocyte none 


26.2 


KU-8 12 (Basophil) 
PMA/ionomycin 


100.0 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


34.6 


CCD1 106 (Keratinocytes) none 


45.4 


LAK cells rest 


26.8 


CCD 11 06 (Keratinocytes) 
TNFalpha + 1L-1 beta 


27.4 


LAK cells IL-2 


0.0 


Liver cirrhosis 


8.1 


LAK cells 1L-2+IL-12 


29.5 

— " 


NC1-H292 none 


90.1 


LAK cells IL-2+1FN 
gamma 


22.1 


NC1-H292 IL-4 


81.8 


LAK cells 1L-2+ 1L-18 


17.7 


NC1-H292 1L-9 


92.7 


LAK cells 
PMA/ionomycin 


41.5 


NC1-H292 1L-13 


63.3 


NK Cells IL-2 rest 


28.7 


NCI-H292 I FN gamma 


57.4 


Two Way MLR 3 day 


24.0 


HPAEC none 


50.0 


Two Way MLR 5 day 


33.0 


HPAEC TNF alpha + 1L-1 beta 


65.5 


Two Way MLR 7 day 


28.1 


Lung fibroblast none 


49.0 


PBMC rest 


32.1 


Lung fibroblast TNF alpha + IL-1 


14.2 


PBMC PWM 


33.0 


Lung fibroblast IL-4 


37.6 


PBMC PHA-L 


30.1 


Lung fibroblast IL-9 


57.4 


Ramos (B cell) none 


36.1 


Lung fibroblast IL-1 3 


84.7 


Ramos (B cell) ionomycin 


51.8 


Lung fibroblast I FN gamma 


51.1 


B lymphocytes PWM 


42.9 


Dermal fibroblast CCD 1 070 rest 


46.7 


B lymphocytes CD40L 
and IL-4 


52.1 


Dermal fibroblast CCD1070 TNF 
alpha 


53.2 


EOL-1 dbcAMP 


46.7 


Dermal fibroblast CCD 1070 IL-1 
beta 


34.4 


EOL-1 dbcAMP 
PMA/ionomycin 


53.2 


Dermal fibroblast IFN gamma 


25.5 


Dendritic cells none 


57.0 


Dermal fibroblast IL-4 


55.5 
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Dendritic cells LPS 


18 2 


Dermal Fibroblasts rest 


21.2 


D ndri cell anli < 0 10 


66.9 


Neutrophils TNFa+LPS 


4.2 


Monocytes rest 


51.4 


Neutrophils rest 


6.5 


Monocytes LPS 


16.7 


Colon 


14.8 


Macrophages rest 


45.7 


Lung 


23.7 


Macrophages LPS 


12.7 


Thymus 


40.3 


HUVEC none 


38.7 


Kidney 


36.3 


HUVEC starved 


57.0 






Table ALC. General oncology screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4057, Run 
268390123 


Tissue Name 


Rel. 

Exp.(%) 
Ag4057, 
Run 

268390123 


Colon cancer 1 


9.2 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 {6.5 


Bladder cancer NAT 3 


0.1 


Colon cancer 2 j 1 2.8 


Bladder cancer NAT 4 


3.4 


Colon cancer NAT 2 il2.7 


Adenocarcinoma of the prostate 1 


11.6 


Colon cancer 3 


43.8 


Adenocarcinoma of the prostate 2 


0.9 


Colon cancer NAT 3 


16.2 


Adenocarcinoma of the prostate 3 


4.4 


Colon malignant cancer 4 


100.0 


Adenocarcinoma of the prostate 4 


14.4 


Colon normal adjacent tissue 
4 


8.7 


Prostate cancer NAT 5 


1.9 


Lung cancer 1 


8.0 


Adenocarcinoma of the prostate 6 


3 3 


Lung NAT 1 


0.4 


Adenocarcinoma of the prostate 7 


4.0 


Lung cancer 2 


19.1 


Adenocarcinoma of the prostate 8 


>6 


Lung NAT 2 


0.6 


Adenocarcinoma of the prostate 9 


17.6 


Squamous cell carcinoma 3 


39.0 


Prostate cancer NAT 1 0 


) ? 


Lung NAT 3 


0.3 


Kidney cancer 1 


6.5 


metastatic melanoma 1 


16.4 


KidneyNAT 1 


3.3 


Melanoma 2 


2.7 


Kidney cancer 2 


42.3 


Melanoma 3 


1.9 


Kidney NAT 2 


11.3 


metastatic melanoma 4 


44.1 


Kidney cancer 3 


13.1 


metastatic melanoma 5 


38.7 


Kidney NAT 3 


4.7 


Bladder cancer 1 


0 2 


Kidney cancer 4 


16.8 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


15.4 


Bladder cancer 2 


2.5 
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CNS_neurodegeneration_vl.O Summary: Ag4057 Results from one experiment 
with the CG96422-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag4057 Results from one 
5 experiment with the CG96422-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4.1 D Summary: Ag4057 Highest expression of the CG96422-01 gene is 
detected in PMA/ionomycin treated basophils (CT274). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 

10 health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 

15 tissues. This pattern is in agreement with the expression profile in 

General_screening jane l_v 1.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 

20 diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4057 Highest 
expression of the CG96422-01 gene is detected in colon malignant cancer sample 
(CT=27.5). In addition, significant expression of this gene is associated with kidney 

25 cancer, prostate adenocarcinoma, bladder cancer, melanoma, lung and colon cancers. 

Therefore, therapeutic modulation of this gene product through the use of small molecule 
target may be beneficial in the treatment of these cancers. 

AM. CG96442-01: Acyl CoA-Domain Protein 

30 Expression of gene CG96442-01 was assessed using the primer-probe set Ag4058, 

described in Table AMA. Results of the RTQ-PCR runs are shown in Tables AMB and 
AMC. 
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Table AMA . Probe Name Ag4058 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 ' -tcaaaggagattgtgccaatta-3 ' 


22 


703 


347 


Probe 


TET-5 ' -tgcacctgtcagtgttcaacaacaga-3 ' -TAMRA 


26 


726 


348 


Reverse 


5 ' -ctgtttgggtttccgaattaat-3 ' 


22 


756 


349 



Table AMB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4058, Run 
218902477 


.... .... . . _ . ... . ... ..... 

Tissue Name 


Rel. Exp.(%) 
Ag4058, Run 
218902477 


Adipose 


2.8 


Renal ca. TK-10 


9.9 


Melanoma* Hs688(A).T 


0.0 


Bladder 


33.2 


Melanoma* Hs688(B).T 


5.5 


Gastric ca. (liver met.) NCI-N87 


26.2 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0 0 


Melanoma* LOX1MV1 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


17.3 


Colon ca. SW480 


0.0 


Squamous cell carcinoma 

SCC-4 ; ' U 


Colon ca.* (SW480 met) SW620 


1.6 


Testis Pool 


66.4 
0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) PC-3 


Colon ca. HCT-1 16 


2 0 




Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


19.9 


Uterus Pool 


2.0 


Colon ca. SW1 1 16 


0.0 


Ovarian ca. OVCAR-3 


61.1 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV-3 


3.2 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


49.7 


Colon Pool 


16.0 


Ovarian ca. OVCAR-5 


1.8 


Small Intestine Pool 


5.2 


Ovarian ca. IGROV-1 


0.0 


Stomach Pool 


19.9 


Ovarian ca. OVCAR-8 


5.4 


Bone Marrow Pool 


6.7 


Ovary 


6.3 


Fetal Heart 


8.2 


Breast ca. MCF-7 


3.8 


Heart Pool 


2.4 


Breast ca. MDA-MB-231 


7.4 


Lymph Node Pool 


15.3 


Breast ca. BT 549 


4.0 


Fetal Skeletal Mus( le 


10.4 


Breast ca. T47D 


11.3 


Skeletal Muscle Pool 


1.2 


Breast ca. MDA-N 


0.0 


Spleen Pool 


100.0 


Breast Pool 


28.3 


Thymus Pool 


5.8 


Trachea 


1.1 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18- 
MG 


0.0 
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Fetal Lung 


13.5 


ncer (neun 


^0.0 


Lung ca. NCI-N417 


0.0 


jcNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


3.4 


|CNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI-H146 


12.7 


|CNS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 


1.9 


|CNS cancer (glio) SF-295 


>/• 


Lung ca. A549 


0.0 


Ci illl ' Aim u ila I Pool 


0.0 


Lung ca. NC1-H526 


0.0 


iBrain (cerebellum) 


0.0 


Lungca. NCI-H23 


14.7 


[Brain (fetal) 


5.8 


Luna ca. NCI-H4O0 


0.0 


(Brain (Hippocampus) Pool 


6.6 


Lung ca. HOP-62 


0.0 


|Cerebral Cortex Pool 


8.0 


Lung ca. NCI-H522 


3.2 


|Brain (Substantia nigra) Pool 


11.7 


Liver 


3.0 


jBrain (Thalamus) Pool 


9.0 


Fetal Liver 


4.2 


(Brain (whole)" 


0 0 


Liver ca. HepG2 


0.0 


jSpinal Cord Pool 


0.0 


Kidney Pool 


20.0 


{Adrenal Gland 


16.4 


Fetal Kidney 


23.8 


iPituitary gland Pool 


8.0 


Renal ca. 786-0 


0.0 


(Salivary Gland 


8.3 


Renal ca. A498 


0.0 


[Thyroid (female) 
:Pancreaticca. CAPAN2 


0.0 


Renal ca. ACHN 




7.4 


Renal ca. UO-31 


21.2 


IPancreas Pool 


8.7 


Table AMC. Panel 4.1 D 


Tissue Name 


Rel. Exp.(%) 
Ag4058, Run 
171620034 


Tissue Name 


Rel. Exp.(%) 
Ag4058, Run 
171620034 


Secondary Thl act 


0.0 


jHUVEC IL-lbeta 


7.7 


Secondary Th2 act 


0.0 


JHUVEC I FN gamma 


2.9 


Secondary Trl act 


0.0 


[hUVEC TNF alpha + I FN gamma 


2.9 


Secondary Thl rest 


0.0 


JHUVEC TNF alpha + IL4 


2.5 


Secondary Th2 rest 


0.0 


Jhuvecil-h 


5.8 


Secondary Trl rest 


3.1 


[Lung Microvascular EC none 


7.5 


Primary Thl act 


1.7 


[Lung Microvascular EC TNFalpha 
|+ IL-lbeta 


0.0 


Primary Th2 act 


0.0 


(Microvascular Dermal EC none 


0.0 


Primary Trl act 


3.0 


jMicrosvasular Dermal EC 
(TNFalpha + IL-lbeta 


O.O 


Primary Thl rest 


).0 


iBronchial epithelium TNFalpha + 
ILlbeta 


3.7 


Primary Th2 rest 


).0 


Small airway epithelium none 


D.O 


Primary Trl rest 


).0 


Small airway epithelium 


).0 
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TNFalpha + IL-lbeta 




CD45RA CD4 lymphocyte 
act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte 
act 


0.0 


Coronery artery SMC TNFalpha + 
IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CDS lymphocyte 
rest 


0.0 


Astrocytes TNFalpha + IL-lbeta 


4.0 
0.0 


Secondary CDS lymphocyte 
act 


0.0 


KU-812 (Basophil) rest 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) \ Q Q 
PMA/ionomycin 


2ry Thl/Th2/Trl_anti-CD95 
EMI! , 


0.0 


CCD1 106 (Keratinocstes) none 


0.0 


LAK cells rest 


9.5 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells 1L-2 


1.8 


Liver cirrhosis ^9.3 


1 \K cells 1L-2+IL-12 

1 Ak cells II 2 11 N gamma 


2.0 

o7o 


NCI-H292 none 10.0 
NCI-H292 IL-4 0.0 


LAK. cells IL-2+ IL-18 


0.0 |NCI-H292 IL-9 0.0 


LAK cells PM A/ionomycin 


0.0 |nCI-H292IL-13 0.0 


rS Iv k^q l IS 1 L--i 1 cSl 


0.0 


NCI-H292 I FN gamma jO.O 


Two Way MLR 3 day 


6.5 


HPAEC none 


5.2 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-1 beta 


48.3 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-1 
beta 


0.0 


PRur PWlVf 
rJDlviv^ r wivl 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 . 


0.0 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma 


2.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD1070 rest 


0.0 


B lymphocytes CD40L and 
IL-4 


0.0 


Dermal fibroblast CCD1070 TNF 
alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD1070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells LPS 


3.5 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


4.0 
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Monocles rest 


38.4 


Neutrophils rest 


0.0 


Monocytes LPS 


52 5 


Colon 


0.0 


Macrophages rest 


9.9 


Lung 


0.0 


Macrophages LPS 


5.9 


Thymus 


6.1 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


2.5 







AI_comprehensive pancl_vl.0 Summary: Ag4058 Expression of the CG96442- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 



CNS_neurodegeneration_vl.O Summary: Ag4058 Results from one experiment 
5 with the CG96442-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag4058 Highest expression of the 
CG96442-01 gene is detected in spleen (CT=31.4). In addition, moderate to low 
expression of this gene is seen in pancreas, adrenal gland, gastrointestinal tract, kidney, 
10 and testis. Therefore, therapeutic modulation of this gene can be useful in the treatment of 
diseases associated with these tissues, including diabetes, obesity, inflammatory bowel 
disease, lupus and glomerulonephritis. 

Low expression of this gene is also detected in brain thalamus and substantia nigra 
samples. Therefore, therapeutic modulation of this gene product can be beneficial in 
15 neurological disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple 
sclerosis, schizophrenia and depression. 

Low expression of this gene is associated with cluster of cancer cell lines including 
renal cancer, gastric cancer, breast cancer, ovarian cancer, lung cancer and melanoma cell 
lines and colon cancer tissue. Therefore, therapeutic modulation of this gene product can 
20 be beneficial in the treatment of these cancers. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=34.5) when 
compared to adult skeletal muscle and lung (CT>38). Thus, expression of this gene can be 
used to distinguish fetal from adult skeletal muscle and lung. In addition, the relative 
overexpression of this gene in fetal skeletal muscle suggests that the protein product may 
25 enhance growth or development of muscle and lung in the fetus and thus may also act in a 
regenerative capacity in the adult. Therefore, therapeutic modulation of the protein 
encoded by this gene could be useful in treatment of muscle and lung related diseases. 
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Panel 4.1D Summary: Ag4058 Highest expression of the CG96442-01 gene is 
detected in kidney (CT=3 1 .4). Therefore, expression of this gene can be used to 
distinguish this sample from other samples in the panel. In addition, small molecule 
therapies designed with the protein encoded for by this gene could modulate kidney 
5 function and be important in the treatment of inflammatory or autoimmune diseases that 
affect the kidney, including lupus and glomerulonephritis. 

Low level expression of this gene is also detected in TNF alpha + IL-1 beta treated 
HPAEC, liver cirrhosis, and monocytes. Therefore, therapeutic modulation of this gene 
product can be useful in the treatment of inflammatory and autoimmune diseases such as 
1 0 asthma, IBD, and psoriasis and liver cirrhosis. 

General oncology screening panel_v_2.4 Summary: Ag4058 Expression of the 
CG96442-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

15 AN. CG96501-01: Brain Mitochondrial Carrier Protein-1 Like protein 

Expression of gene CG96501-01 was assessed using the primer-probe set Ag4059, 
described in Table ANA. Results of the RTQ-PCR runs are shown in Tables ANB, ANC 
and AND. 

Table ANA . Probe Name Ag4059 



Primers 


Sequences 


Length 


■ Start 
Position 


SEQ ID 
No 


Forward 


5 ' -ctcagtttcccagtcattgatg-3 1 


22 


53 


350 


Probe 


TET-5 1 -cctgagcagcagcaaagtaccactgt-3 ' -TAMRA 


26 


77 


351 


Reverse 


5 ' -ggcgtcatatacaaagggttt-3 ' 


21 


131 


352 



20 Table ANB. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
218902504 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
218902504 

1.4 
1.2 


Adipose 


0.1 


Renal ca. TK-10 


Melanoma* Hs688(A).T 


0.6 


Bladder 


Melanoma* Hs688(B).T 


0.3 


Gastric ca. (liver met.) NCI-N87 


4.3 


Melanoma* M14 


0.6 


Gastric ca. KATO III 


1.8 


Melanoma* LOXIMVI 


0.4 


Colon ca. SW-948 


0.4 


Melanoma* SK-MEL-5 


0.7 


Colon ca. SW480 


1.7 


Squamous cell 


0.3 


Colon ca * (SW480 met) SW620 


2.6 
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carcinoma SCC-4 








Testis Pool 


1.1 


Colon ca. HT29 


1.3 


Prostate ca.* (bone met) 
PC-3 


1.0 


Colon ca. HCT-1 16 12 6 

I 


Prostate Pool 


0.5 


Colon ca. CaCo-2 J2.8 


Placenta 


0.7 


Colon cancer tissue 1 1 J 


Uterus Pool j0.3 


Colon ca. SW1 116 !0.9 


Ovaiian ca. OVCAR-3 ;0.6 


Colon ca. Colo-205 




Ovarian ca. SK-OV-3 


3.6 


Colon ca. SW-48 


0.1 


Ovarian ca. OVCAR-4 


0.5 


Colon Pool 




Ovarian ca. OVCAR-5 


2.1 


Small Intestine Pool 




Ovarian ca. IGROV-1 


0.6 


Stomach Pool 


1.5 


Ovarian ca. OVCAR-8 


0.6 


Bone Marrow Pool 


1.3 


?™l 


2.4 


Fetal Heart 


1.2 


Breast ca. MCF-7 


2.0 


Heart Pool 


1.2 


Breast ca. MDA-MB- 
231 


3.3 


Lymph Node Pool 


3.7 


Breast ca. BT 549 


5.4 


Fetal Skeletal Muscle 


- — 


Breast ca. T47D 
Breast ca. MDA-N 


2.7 
0.6 


Skeletal Muscle Pool 
Spleen Pool 


» - - 


Breast Pool 


3.9 


Thymus Pool 


-L 


Trachea 


1.3 


CNS cancer (glio/astro) U87-MG 


3.4 


Lung 


0.7 


CNS cancer (glio/astro) U-l 1 8- 
MG 


2.7 


Fetal Lung 


3.8 


CNS cancer (neuro;met) SK-N- 
AS 




Lung ca. NCI-N417 


0.4 


CNS cancer (astro) SF-539 


1.0 


Lung ca. LX-1 


2.1 


CNS cancer (astro) SNB-75 


3.5 


Lungca. NCI-H146 


0.0 


CNS cancer (glio) SNB-19 


0.7 


Lung ca. SHP-77 


1.1 


CNS cancer (glio) SF-295 


3.3 


Lung ca. A549 


2.6 


Brain (Amygdala) Pool 


0.6 


Lung ca. NCI-H526 


0.2 


Brain (cerebellum) 


0.9 


Lung ca. NCI-H23 


2.9 


Brain (fetal) 


1.2 


Lungca. NC1-H460 


2.3 


Brain (Hippocampus) Pool 


.2 


Lung ca. HOP-62 


2.3 


Derebral Cortex Pool 


3.8 


Lung ca. NCI-H522 


•5 


3rain (Substantia nigra) Pool 


L5 


Liver ( 


).l I 


3rain (Thalamus) Pool 


.2 


Fetal Liver ; 


•2 [ 


3rain (whole) 


.2 


Liver ca. HepG2 ; 


• 1 5 


pinal Cord Pool 


.3 
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Kidney Pool 


3.1 


Adrenal Gland 


0.7 


Fetal Kidney 


3.9 


Pituitary gland Pool 


0.5 


Renal ca. 7S<^ 0 


2.1 


Salivary Gland 


0.4 


Renal ca. A498 


1.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


1.3 


Pancreatic ca. CAPAN2 


2.1 


Renal ca. UO-31 


2.0 


Pancreas Pool 


100.0 


Table ANC. Panel 4.1 D 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
171620235 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
171620235 


Secondary Thl act 


14.4 


HUVEC IL-lbeta 


15.7 


Secondary Th2 act 


39.2 


IIUVF.CIFN gamma 


26.1 


Secondary Tr 1 act 


34.4 


HUVEC TNF alpha + IFN 
gamma 


16.8 


Secondary Th 1 rest 


18.8 


HUVEC TNF alpha +IL4 


10.6 


Secondary Th2 rest 


6.1 


HUVEC IL-1 1 




Secondary Trl rest 


36.6 


Lung Microvascular EC none 


19.5 


Primary Thl act 


134 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


14.4 


Primary Th2 act 


31.2 


Microvascular Dermal EC none 


18.8 


Primary Trl act 


22.2 


Microsvasular Dermal EC 
TNFalpha+ IL-lbeta 


12.2 


Primary Thl rest 


6.4 


Bronchial epithelium TNFalpha + 
ILlbeta 


4.6 


Primary Th2 rest 


16.5 


Small airway epithelium none 




Primary Trl rest 


12.1 


Small airway epithelium 
TNFalpha + IL-lbeta 


4.4 


CD45RA CD4 
lymphocyte act 


21.9 


Coronery artery SMC rest 


2.8 


CD45RO CD4 
lymphocyte act 


39.5 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


4.5 


CD8 lymphocyte act 


40.3 


Astrocytes rest 


4.3 


Secondary CDS 
lymphocyte rest 


23.2 


Astrocytes TNFalpha + IL-lbeta 


10.5 


Secondary CD8 
lymphocyte act 


7.2 


KU-8 12 (Basophil) rest 


33.9 


CD4 lymphocyte none 


27.7 


KU-8 12 (Basophil) 
PMA/ionomycin 


34.6 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


45.7 


CCD1 106 (Keratinocytes) none 


9.2 


LAK cells rest 


17.9 


CCD1 106 (Keratinocytes) 


8.7 
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NFalpha ML Ifvtu 




i lK cIML-2 


27.7 


jiver cirrhosis 


4.3 


LAK cells IL-2+IL-12 


20.2 


MCI-H292 none 


11.0 


LAK cells IL-2+IFN 
gamma 


12.0 


NC1-H292 IL-4 


11.5 


LAK cells 1L-2+ IL-18 




S JCI-FP92 IL-9 


24.3 


LAK cells 
PMA/ionomycin 


5.6 


NC1-H292 1L-13 


12.3 


NK Cells IL-2 rest 


ilA _ . 


s JCl-H29' > 1FN gamma 


13.8 


Two Way MLR 3 day 




4PAEL none 


1 8.8 


l wo w ay iviJUiv j uay 


22.8 


HPAEC TNF alpha + IL-1 beta 


20.0 


Two Way MLR 7 day 


30.8 


Lung fibroblast none 


23.8 


PBMC rest 


4.6 


Lung fibroblast TNF alpha + IL-1 


9.2 


PBMC PWM 


24.5 


Lung fibroblast IL-4 


10.5 


PBMC PHA-L 


29.1 


Lung fibroblast IL-9 


23.8 


Ramos (B cell) none 


41.8 


Lung fibroblast IL-1 3 


19.1 


Ramos (B cell) 
ionomycin 


37.9 


Lung fibroblast I FN gamma 


20.9 


B h mphocytes PWM 


16.4 


Dermal fibroblast CCD 1070 rest 


29.1 


B lymphocytes CD40L 
and IL-4 


4">3 


Dermal fibroblast CCD 1 070 TNF 
alpha 


23.7 


COT 1 Hk^ATV/IP 


— 


Dermal fibroblast CCD 1070 IL-1 
beta 


4.4 


EOL-l dbcAMP 
PMA/ionomycin 


15.5 


Dermal fibroblast IFN gamma 




7.3 


Dendritic cells none 


1 1.4 


Dermal fibroblast IL-4 


20.3 


Dendritic cells LPS 






|21.3 


Dendritic cells anti- 
CD40 


— 

4.2 


Neutrophils TNFa+LPS 


I 2 - 8 


Monocytes rest 


6.7 


Neutrophils rest 


j 1 7.4 


Monocytes LPS 


21.9 


Colon 


3.1 


Macrophages rest 


16.2 


Lung 


6.5 


Macrophages LPS 


12.9 


Thymus 


57.0 


HUVEC none 


12.7 


Kidney 


100.0 


HUVEC starved 


33.9 






Table AND. General oncology screening panel_v_ 


2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
268390135 


Tissue Name 


Rel. Exp.(%) 
Ag4059, Run 
268390135 
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Colon cancer 1 


8.7 


Bladdei cance NA'I 2 


0.0 


Colon NAT 1 


0.6 


Bladder cancer NAT 3 


0.4 


Colon cancer 2 


9.5 


Bladder cancer NAT 4 


8.1 


Colon cancer NAT 2 


4.1 


Adenocarcinoma of the prostate 1 


31.2 


Colon cancer 3 


12.9 


Adenocarcinoma of the prostate 2 


2.4 


Colon cancer NAT 3 


14.7 


Adenocarcinoma of the prostate 3 112.2 


Colon malignant cancer 
4 


29.7 


Adenocarcinoma of the prostate 4 


17.0 


Colon normal adjacent 

— - — - 


2.1 


Prostate cancer NAT 5 


2.7 


Lung cancer 1 





Adenocarcinoma of the prostate 6 


47 


Lung NAT 1 


__ 


Adenocarcinoma of the prostate 7 


0.0 


Lung cancer 2 


23.8 


Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 


0.6 


Adenocarcinoma of the prostate 9 


27.4 


carcinoma 3 


11.2 


Prostate cancer NAT 1 0 


0.5 


Lung NAT 3 


0.0 


Kidney cancer 1 


17.0 


met i italic melan >mz i 


19.5 ' 


KidncyNAT 1 


Melanoma 2 
Melanoma 3 


1.9 kidney cancer 2 :47.6 
1.3 ;Kidne y NAT2 .20.7 


metastatic melanoma 4 


97.9 jKidney cancer 3 |44.1 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


6.4 


Bladder cancer 1 


0.5 


Kidney cancer 4 


10.4 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


2.1 


Bladder cancer 2 


3.4 







CNSneurodegenerationvl.O Summary: Ag4059 Results from one experiment 
with the CG96501-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag4059 Highest expression of the 
CG96501-01 gene is detected in pancreas (CT=24.6). In addition, moderate expression of 
this gene is seen among the tissues with metabolic or endocrine function such as pancreas, 
adipose, adrenal gland, pituitary gland, skeletal muscle, heart, liver and the gastrointestinal 
tract. Therefore, therapeutic modulation of the activity of this gene may prove useful in the 
treatment of endocrine/metabolically related diseases, such as obesity and diabetes. 

The CG96501-01 gene codes for homolog of mouse mitochondrial uncoupling 
protein 5 (UCP 5). Proteins belonging to UCP family play important role in 
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thermoregulatory responses, including acute hypothermia usually followed by temperature 
recovery (Yu et al, 2000, Am J Physiol Endocrinol Metab 279(2):E433-46). In addition, 
mutation in UCP3, another member of UCP family, causes severe obesity and type II 
diabetes (Brown et al., 1999, Hum. Mutat. 13: 508 only). Therefore, therapeutic 
5 modulation of this gene product can be useful in treatment of disorders affecting 
thermoregulatory responses, obesity and type II diabetes. 

Significant expression of this gene is also seen in cluster of cancer cell lines 
including pancreatic cancer, CNS cancer, colon and gastric cancer, renal cancer, lung 
cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell carcinoma and 

10 melanoma cell lines. Therefore, therapeutic modulation of this gene product can be 
beneficial in the treatment of these cancers. 

Also, this gene is expressed at high levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 

1 5 central nervous system disorders such as Alzheimer's disease. Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1 D Summary: Ag4059 Highest expression of the CG96501-01 gene is 
detected in kidney (CT=32). This gene is expressed at low to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 

20 cells include members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 
This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. This pattern is in 

25 agreement with the expression profile in General_screening_panel_vl.4 and also suggests 
a role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 

30 inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 
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General oncology screening panel_v_2.4 Summary: Ag4059 Highest 
expression of the CG96501-01 gene is detected in tow metastatic melanoma samples 
(CTs=31.7). Interestingly, significant expression of this gene is also seen in cluster of 
cancer samples including kidney, prostate adenocarcinoma, metastatic melanoma, lung 
and colon cancers. Please see Panel 1 .4 for a discussion of the potential utility of this gene 
in treatment of these cancers. 

AO. CG96557-01: L-ALLO-Thrconine Aldolase-Like Protein 

Expression of gene CG96557-01 was assessed using the primer-probe set Ag4067, 
described in Table AOA. 

Table AOA . Probe Name Ag4067 



! 

jPrimers 


Sequences 


, „. ! Start 
1 Position 


SEQ ID No 


j Forward 


5 ' -tggagcactgtgactctgtgt-3 ' 


21 ! 614 


353 


Probe 


TET-5 • -ctttctgcctctccaagggcctg-3 ' -TAMRA 


23 j 635 


r 354 


;Reverse 


5 • -aggcttcttcaatgaagtcctt-3 ' 


22 ! 691 


355 



CNS_neurodegeneration_vl.O Summary: Ag4067 Results from one experiment 
with the CG96557-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag4067 Expression of the CG96557- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 2.2 Summary: Ag4067 Expression of the CG96557-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4.1D Summary: Ag4067 Expression of the CG96557-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
Results from one experiment (run 171808741) with the CG96557-01 gene are not 
included. The amp plot indicates that there were experimental difficulties with this run. 

AP. CG96581-01: RP42 
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Expression of gene CG96581-01 was assessed using the primer-probe set Ag4070, 
described in Table APA. Results of the RTQ-PCR runs are shown in Tables APB, APC, 
APD and APE. 



Table APA. Probe Name A £4070 


Primers (Sequences 


. tl Start 
Length; r , . . 
| Position 


SEQID 
No 


Forward 


5 ' -ccagtagtcacaggtggaaaac-3 ' 


22 8^ 


356 


Probe 


TET-5 ' -cagccttttctaggcagcaagttaagca-3 1 -TAMRA 


28 | 858 


357 


Reverse j5 ' -caatctccttgcaggatacaaa-3 1 


22 ; 906 


358 



Table APB . CNS neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) | 

Ag4070, Run jTissue Name 

214294588 j 


Rel. Exp.(%) 
Ag4070, Run 
214294588 


AiJ 1 Hippo 


1 0.2 jControl (Path) 3 Temporal Ctx 


0.1 


AiJ 2. Hippo 


14.1 (Control (Path) 4 Temporal Ctx 
6.4 ;AD 1 Occipital Ctx 


26.6 
16.8 


AD 3 Hippo 


AD 4 Hippo 


4.0 


;AD 2 Occipital Ctx (Missing) 
jAD 3 Occipital Ctx 


0.0 


AD 5 hippo 


•100.0 




AD 6 Hippo 


55.1 jAD 4 Occipital Ctx [75. 6 


Control 2 Hippo 


15.3 


AD 5 Occipital Ctx 


12.8 


Control 4 Hippo 


10.1 


AD 6 Occipital Ctx 


28.9 


Control (Path) 3 Hippo 


4.5 


Control 1 Occipital Ctx 


3.5 


AD 1 Temporal Ctx 


13.5 


Control 2 Occipital Ctx 


76.8 


AD 2 Temporal Ctx 


19.8 


Control 3 Occipital Ctx 


19.3 


AD 3 Temporal Ctx 


4.1 


Control 4 Occipital Ctx 


5.2 


AD 4 Temporal Ctx 


16.8 


Control (Path) 1 Occipital Ctx 


82.9 


AD 5 Inf Temporal Ctx 


50.0 


Control (Path) 2 Occipital Ctx 


11.3 


AD 5 SupTemporal Ctx 


20.0 


Control (Path) 3 Occipital Ctx 


1.6 


AD 6 Inf Temporal Ctx 


66.4 


Control (Path) 4 Occipital Ctx 


12.6 


AD 6 Sup Temporal Ctx 


62.4 


Control 1 Parietal Ctx 


3.7 


Control 1 Temporal Ctx 


4.2 


Control 2 Parietal Ctx 


21.5 


Control 2 Temporal Ctx 


32.5 


Control 3 Parietal Ctx 


17.8 


Control 3 Temporal Ctx 


9.9 


Control (Path) 1 Parietal Ctx 


58.6 


Control 4 Temporal Ctx 


4.9 


Control (Path) 2 Parietal Ctx 


21.0 


Control (Path) 1 
Temporal Ctx 


37.1 


Control (Path) 3 Parietal Ctx 


2.4 


Control (Path) 2 32.1 


Control (Path) 4 Parietal Ctx 


56.9 J 
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[Temporal Ctx 



Table APC . Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
172382436 


Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
172382436 


Secondary Till act 


42.9 


HUVEC IL lbeta 


27.0 


Secondary Th2 act 


58.6 


HUVEC 1FN gamma 


27.0 


Secondary Trl act 


47.3 


HUVEC TNF alpha + I FN 
s^am ma 


10.8 


Secondary Thl rest 


16.2 


HUVEC TNF alpha + IL4 


21.3 


Secondary Th2 rest 


14.6 


HUVEC IL-1 1 


15.7 


Secondary Trl rest 


27.7 


Lung Microvascular EC none 


38.2 


Primary Thl act 


46.7 


Lung Microvascular EC 
TNFalpha + 1L- lbeta 


31.0 


Primary Th2 act 


46.7 


Microvascular Dermal EC none 


26.1 


Primary Trl act 


42.9 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


19.8 


Primary Th 1 rest 


17.9 


Bronchial epithelium TNFalpha 
+ 1L lbeta 


33.7 


Primary Th2 rest 


5.6 


Small airway epithelium none 


11.1 


Primary Trl rest 


19.6 


Small airway epithelium 
TNFalpha + IL-1 beta 


22.4 


CD45RA CD4 
lymphocyte act 


42.3 


Coronery artery SMC rest 


17.8 


CD45RO CD4 
lymphocyte act 


48.0 


Coronery artery SMC TNFalpha 
+ 1L- lbeta 


19.5 


CDS lymphocyte act 


57.4 


Astrocytes rest 


14.1 


Secondary CD8 
lymphocyte rest 


26.6 


Astrocytes TNFalpha + IL-1 beta 


8.2 


Secondary CD8 


14.3 


KU-812 (Basophil) rest 


49.3 


CD4 lymphocyte none 


19.5 


KU-8 12 (Basophil) 
PMA/ionomycin 


74.2 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


35.8 


CCD1 106 (Keratinocytes) none 


39.5 


LAK cells rest 


24.8 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


17.9 


LAK cells IL-2 


30.8 


Liver cirrhosis 


3.8 


LAK cells IL-2+1L-12 


18.6 


NCI-H292 none 


28.3 


LAK cells IL-2+IFN 
gamma 


11.7 


NCI-H292 IL-4 


52.5 
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! ' K c ! l- II -2 II 18 


14.2 


NCI-H292 IL-9 




LAK cells 

PMA/ionomycin 


13.3 


NCI-H292 IL-13 


43.5 


NK Cells IL-2rest 


50.7 


NCI-H292 I FN gamma 


20.6 


Two Way MLR 3 day 


26.1 


HPAEC none 


12.9 


Two Way MLR 5 day 


22.4 


I IPAEC TNF alpha + IL-1 beta 


22.1 


l\vo V a\ Ml R 7 day 


23.8 


Lung fibroblast none 


16.8 


PBMC rest 


10.0 


Lung fibroblast TNF alpha + IL- 
1 beta 


12.9 


PBMC PWM 


23.2 


Lung fibroblast IL-4 |l3.3 


PBMC PHA-L 


21.2 


Lung fibroblast IL-9 


13.5 


Ramos (B cell) none 


54.3 


Lung fibroblast IL-13 


14.9 


Ramos (B cell) 
ionomycin 


52.1 


Luno fibroblast IFN «amma 
ung i ro ast gamma 


21.0 


B lymphocytes PWM 


26.4 


Dermal fibroblast CCD 1070 rest 


36.6 


B lymphocytes CD40L 
and IL-4 


27.7 


Dermal fibroblast CCD1070 
TNF alpha 


51.8 


jEOL-1 dbcAMP 

1 


55.1 


Dermal fibroblast CCD 1 070 IL-1 
beta 


23.7 


■EOL-1 dbcAMP 
PMA/ionomycin 


49.7 


Dermal fibroblast IFN gamma 


16.3 


Dendritic cells none 


49.7 


Dermal fibroblast IL-4 


Dendritic cells LPS 


20.9 


Dermal Fibroblasts rest 


13.8 


Dendritic cells anti- 
CD40 


41.2 


Neutrophils TNFa+LPS 


6.7 


Monocytes rest 


23.5 


Neutrophils rest 


17.1 


Monocytes LPS 


36.1 


Colon 


7.0 


Macrophages rest 


24.8 


Lung 


6.9 


Macrophages LPS 


7.0 


Thymus 


100.0 


HUVEC none 


12.7 


(Cidney 


44.1 


HUVEC starved 


19.6 






Table APD. Panel CNS 1 


Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
180912025 


Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
180912025 


BA4 Control 


25.2 


BA17PSP 


21.5 


BA4 Control2 


32.5 


BA17PSP2 


6.5 


BA4 Alzheimer's2 


2.9 


Sub Nigra Control 


33.7 


BA4 Parkinson's 


40.1 


Sub Nigra Control2 


23.2 
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BA4 Parkinson's2 


100.0 


Sub Nigra Alzheimer's2 


12.1 


BA4 Huntington's 


29,1 


Sub Nigra Parkinson's2 


• 7 


BA4 Huntington's2 


6.1 


Sub Nigra Huntington's 


49.7 


BA4 PSP 


17.0 


Sub Nigra Huntington's2 


28.1 


BA4 PSP2 


26.6 


Sub Nigra PSP2 


8.1 


BA4 Depression 


9.2 


Sub Nigra Depression 


8.2 


BA4 Depression2 


10.4 


Sub Nigra Depression2 


7.2 


BA 7 Control 


37.6 


.1 )1 i* lladusC ontrol 


11.1 


BA7 Control 2 


32.8 


Glob Palladus Control2 


______ 


BA7 Alzheimer's2 


5.1 


Glob Palladus Alzheimer's 


7.1 


BA7 Parkinson's 


18.0 


Glob Palladus Alzheimer's2 


5.5 


BA7 Parkinson's2 


28.1 


Glob Palladus Parkinson's 


35.4 


BA7 Huntington's 


40.9 


Glob Palladus Parkinson's2 


4.5 


BA7 Huntington's2 


36.1 

------ 


Glob Palladus PSP 


2.2 


BA7 PSP 




Glob Palladus PSP2 


10.2 


BA7 PSP2 


24.7 


Glob Palladus Depression 


5.7 


BA7 Depression 


11.7 


Temp Pole Control 


8.1 


BA9 ( introl 


14.2 


Temp Pole Control 




18.8 


BA9 Control2 


39.0 


Temp Pole Alzheimer's 


3.1 


BA9 Alzheimer's 


3.9 


Temp Pole Alzheimer's2 


0.0 


BA9 Alzheimer's2 


11.0 


Temp Pole Parkinson's 


19.1 


BA9 Parkinson's 


16.7 


Temp Pole Parkinson's2 


14.6 


BA9 Parkinson's2 


48.0 


Temp Pole Huntington's 


21.6 


BA9 Huntington's 


31.9 


Temp Pole PSP 


2.8 


BA9 Huntington's2 


21.5 


Temp Pole PSP2 


1.8 


BA9 PSP 


10.2 


Temp Pole Depression2 


5.6 


BA9 PSP2 


7.0 


Cing Gyr Control 


41.8 


BA9 Depression 


6.7 


Cing Gyr ControI2 


13.2 


BA9 Depression2 


4.6 


Cing Gyr Alzheimer's 


11.4 


BA 17 Control 


50.3 


Cing Gyr AIzheimer's2 


6.3 


BA17Control2 


58.2 


Cing Gyr Parkinson's 


8.6 


BA17 Alzheimer's2 


14.6 


Cing Gyr Parkinson's2 


24.7 


BA17 Parkinson's 


38.7 


Cing Gyr Huntington's 


22.5 


BA17 Parkinson's2 


53.2 


Cing Gyr Huntington's2 


12.9 


BA17 Huntington's 


35.1 


Cing Gyr PSP 


12.7 


BA17 Huntington's2 


9.5 


Cing Gyr PSP2 


1.6 


BA17 Depression 


9.3 


Cing Gyr Depression 


5.3 


BA17 Depression2 


26.8 


Cing Gyr Depression2 


3.8 
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Table APE. General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
268390170 


Tissue Name 


Rel. Exp.(%) 
Ag4070, Run 
268390170 


Colon cancer 1 


14.3 


Bladder cancer NAT 2 


0.0 


Colon cancer NAT I 


3.5 


Bladder cancer NAT 3 


0 0 






Bladder cancer NAT 4 


S.6 


Colon cancer NAT 2 


7.5 


Adenocarcinoma of the prostate 
1 


!27.7 


Colon cancer 3 


47.3 


Adenocarcinoma of the prostate 

2 


0.9 


Colon cancer NAT 3 


14.2 


Adenocarcinoma of the prostate 
3 


16.6 


Colon malignant cancer 
4 


28.3 


Adenocarcinoma of the prostate 
4 


44.4 


Colon normal adjacent 
tissue 4 


1.3 


Prostate cancer NAT 5 


2.8 


Lung cancer I 


10.2 


Adenocarcinoma of the prostate 
6 


2.9 
3.6 


Lung NAT 1 jo. 1 


Adenocarcinoma of the prostate 
7 


Lung cancer 2 


100.0 


Adenocarcinoma of the prostate 
8 


2.0 


Lung NAT 2 


3.5 


Adenocarcinoma of the prostate 
9 


20.0 


Squamous cell 
carcinoma 3 


17.9 


Prostate cancer NAT 10 


0.6 


Lung NAT 3 


0.3 


— ■ — 

xidney cancer 1 


• 

9.1 


metastatic melanoma 1 


15.6 


KidneyNAT 1 


8.1 


Melanoma 2 


1.4 


Kidney cancer 2 


45.7 


Melanoma 3 


2.2 


Kidney NAT 2 


17.6 


metastatic melanoma 4 


39.8 


Cidney cancer 3 


19.9 


metastatic melanoma 5 


52.0 


KidneyNAT 3 


3.1 


Bladder cancer 1 


1.2 


Kidney cancer 4 


7.2 


Bladder cancer NAT 1 


3.0 


Kidney NAT 4 


1.7 


Bladder cancer 2 


?.8 







CNS_neurodegeneration_vl.O Summary: Ag4070 This panel confirms the 
expression of the CG96581-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
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Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Furthermore, low expression of this gene in brain suggests that this gene may 
play a role in central nervous system disorders such as Parkinson's disease, epilepsy, 
multiple sclerosis, schizophrenia and depression. 
5 The CG9658 1 -0 1 gene codes for RP42 homolog. The mouse RP42 gene is 

expressed in proliferating neuroblasts, whose human orthologs map to susceptibility loci 
for autism (Mas et al., 2000, Genomics 65: 70-74, PubMed ID: 10777668). Therefore, this 
gene may also play a role in autism and therapeutic modulation of this gene may be useful 
in the treatment of autism and other neurological disorders. 

10 General screening panel vl. 4 Summary: Ag40 70 Results from one 

experiment with the CG96581-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4.1D Summary: Ag4070 Highest expression of the CG96581-01 gene is 
detected in thymus (CT=30.3). This gene is expressed at high to moderate levels in a wide 

15 range of cell types of significance in the immune response in health and disease. These 
cells include members of the T-cell. B-cell, endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 
This ubiquitous pattern of expression suggests that this gene product may be involved in 

20 homeostatic processes for these and other cell types and tissues. This pattern suggests a 
role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 

25 inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

Panel CNS 1 Summary: Ag4070 This panel confirms the expression of the 
CG96581-01 gene at low levels in the brains. Please see Panel 
CNS_neurodegeneration_vl.0 for a discussion of the potential utility of this gene in 
30 treatment of central nervous system disorders. 

General oncology screening panel_v_2.4 Summary: Ag4070 Highest 
expression of the CG96581-01 gene is detected in lung cancer sample (CT=30). 
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Interestingly, expression of this gene is higher in number of cancer samples including 
lung, prostate, colon, melanoma and kidney cancers (CTs=30-33) as compared to control 
normal samples (CTs>35). Therefore, expression of this gene can be used as diagnostic 
marker for these cancers and therapeutic modulation of this gene can be beneficial in the 
5 treatment of these cancers. 

AQ. CG96624-01 and CG96624-02: Putative Seven Pass Transmembrane Protein 

Expression of gene CG96624-01 and variant CG96624-02 was assessed using the 
primer-probe sets Agl372 and Ag4082, described in Tables AQA and AQB. Results of the 
1 0 RTQ-PCR runs are shown in Tables AQC, AQD, AQE and AQF. 

Table AQA . Probe Name Agl372 



Primers 


Sequences 


Length 


Start 1 SEQ ID 
Position j No 


Forward 


5 ' -ctttgttccccaggtcattc-3 1 


20 


792 359 


Probe 


TET- 5 ' - cgc ctggtcagacacattgtaccagt -3 ' -TAMRA 


26 


758 | 360 


Reverse 


5 ' -ggctggacaccttcgattac-3 ' 


20 


737 i 361 1 



Table APR . Probe Name Ag4082 



Primers jSequences 

i 


Length 


Start 
Position 


SEQ ID 
No 


Forward|5 ' -tgcccttctggcttctctac-3 ' 


20 


320 


362 


Probe JTET- 5 ' -cctgcagttcttcaccttgacgctta-3 • -TAMRA 


26 


354 


363 


Reverse j5 ' -ttacctgggcaaagtagaggtt-3 ' 


22 


382 


364 



Table AQC . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
214294986 


Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
214294986 


AD 1 Hippo 


26.8 


Control (Path) 3 Temporal Ctx 


7.3 


AD 2 Hippo 


34.2 


Control (Path) 4 Temporal Ctx 


29.1 


AD 3 Hippo 


16.6 


AD 1 Occipital Ctx 


15.4 


AD 4 Hippo 


9.2 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


95.9 


AD 3 Occipital Ctx 


15.9 


AD 6 Hippo 


52.9 


AD 4 Occipital Ctx 


15.4 


Control. 2 Hippo 


27.7 


AD 5 Occipital Ctx 


55.1 


Control 4 Hippo 


15.8 


AD 6 Occipital Ctx 


22.8 


Control (Path) 3 Hippo 


8.5 


Control 1 Occipital Ctx 


4.6 
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AD 1 Temporal Ctx 


23.5 


Control 2 Occipital Ctx 


89.5 


AD 2 Temporal Ctx 


33.4 


Control 3 Occipital Ctx 


18.2 


AD 3 Temporal Ctx 


11.6 


Control 4 Occipital Ctx 


9.4 


AD 4 Temporal Ctx 


18.2 


Control (Path) 1 Occipital Ctx 


95.9 


AD 5 Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


8.5 


AD 5 Sup Temporal Ctx 


51.4 


Control (Path) 3 Occipital Ctx 


6.2 ~ 


AD 6 lnf Temporal Ctx 


39.0 


Control (Path) 4 Occipital Ctx 


15.3 


AD 6 Sup Temporal Ctx 


36.3 


Control 1 Parietal Ctx 


8.1 


( o (i -1 1 Temporal Ctx 


11. 1 


Control 2 Parietal Ctx 


42 9 


Control 2 Temporal Ctx 


66.0 


Control 3 Parietal Ctx 


18.4 


Control 3 Temporal Ctx 


19.1 


Control (Path) 1 Parietal Ctx 


69.3 


Control 3 Temporal Ctx 


15.8 


Control (Path) 2 Parietal Ctx 


18.0 


Control (Path) 1 Temporal 
Ctx 


58.2 


Control (Path) 3 Parietal Ctx 


9.7 


Control (Path) 2 Temporal 
Ctx 


30.1 


Control (Path) 4 Parietal Ctx 


36.3 


Table AOD. General screening panel vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
218897864 


Tissue Name 


Rel. Exp.(%) 
Ag4082 r Run 
218897864 


Adipose 


1.3 


Renal ca. TK-10 


8.8 


Melanoma* Hs688(A).T 


18.8 


Bladder 


8.2 


Melanoma* Hs688(B).T 


15/; 


Gastric ca. (liver met.) NCI-N87 


27.5 


Melanoma* M14 


38.4 


Gastric ca. KATO III 


21.8 


Melanoma* LOXIMVI 


27.0 


Colon ca. SW-948 


7.2 


Melanoma* SK-MEL-5 


10.3 


Colon ca. SW480 


39.8 


Squamous cell carcinoma 
SCC-4 


21.0 


Colon ca.* (SW480 met) SW620 


7.9 


Testis Pool 


40.3 


Colon ca. HT29 


9.3 


Prostate ca.* (bone met) 
PC-3 


51.4 


Colon ca. HCT-116 


22.2 


Prostate Pool 


3.4 


Colon ca. CaCo-2 


17.2 


Placenta 


32.3 


Colon cancer tissue 


7.9 


Uterus Pool 


1.2 


Colon ca. SW1116 


6.2 


Ovarian ca. OVCAR-3 


12.2 


Colon ca. Colo-205 


5.3 


Ovarian ca. SK-OV-3 


26.1 


Colon ca. SW-48 


5.9 


Ovarian ca. OVCAR-4 


14.2 


Colon Pool 


9.0 


Ovarian ca. OVCAR-5 


31.4 


Small Intestine Pool 


6.2 


Ovarian ca. IGROV-1 


17.6 


Stomach Pool 


2.8 
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Ovarian ca. OVCAR-8 


').: 




2.4 


Ovary 


7.2 


Fetal Heart 


3.6 


Breast ca. MCF-7 


17.4 


Heart Pool 


5.1 


Breast ca. MDA-MB-23 1 |80. 1 


Lymph Node Pool 


10.1 


Breast ca. BT 549 


38.2 


Fetal Skeletal Muscle 


2.0 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


2.1 


■Breast ca. MDA-N 


37.6 


Spleen Pool 


3.9 


iBreast Pool 


11.9 


Thymus Pool 


6.4 


Trachea 


6.6 


CNS cancer (glio/astro) U87-MCI 


55.9 


Lung 


0.4 


CNS cancer (glio/astro) U-l 1 8- 
MG 


38.4 


Fetal Lung 


12.0 


CNS cancer (neunxmet) SK-N- 
AS 


13.8 


Lung ca.NCI-N417 


7.5 


CNS cancer (astro) SF-539 


— 


Lung ca. LX-I 


12.0 


CNS cancer (astro) SNB-75 




Lung ca. NCI-H146 


17.2 


CNS cancer ( CT lio) SNB-19 




Lurigca. SHP-77 


26.6 


CNS cancer (°lio) SF-295 




Lungca. A549 1 1 8.2 




~^ — 


Lung ca.NCI-H526 


20.6 


irain , (cereSm l m) P00 ' " 




Lung ca.NCl-H23 


20.9 


Brain (fetal) 


~- 


Lung ca.NCI-H460 


7.6 




10.5 


Lung ca. HOP-62 


6.4 


Sbrt^^ 


15.2 


Lung ca. NCI-H522 


47.3 


'^n^Si^a^^raif^l 


19.8 


Liver 


2.4 


Brain (Thalamus) Pool 


16.3 


Fetal Liver 


12" 


Brain (whole) 


17.9 


Liver ca. HepG2 


3.8 


Spinal Cord Pool 


9.7 


Kidney Pool 


9.5 


Adrenal Gland 


11.2 


Fetal Kidney 


5.3 


Pituitary gland Pool 


2.5 


Renal ca. 786-0 


15.3 


Salivary Gland 


6.7 


Renal ca. A498 |s. 1 


Thyroid (female) 


8.8 


Renal ca. ACHN 


7.3 


Pancreatic ca. CAPAN2 


21 9 


Renal ca. UO-31 


17.0 


Pancreas Pool 


10 ■ 


Table AOE. Panel 1.2 


Tissue Name 


Rel. Exp.(%) 
Agl372, Run 
134887810 


Tissue Name 


Rel. Exp.(%) 
Agl372, Run 
134887810 


Endothelial cells 


13.1 


Renal ca. 786-0 


2.6 


Heart (Fetal) 


47.6 


Renal ca. A498 


10.0 
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Pancreas 


0.5 


Renal ca. RXF 393 


0.7 


Pat ;reati< ca ( \PA\ 2 


1.3 


Renal ca. ACHN 


7.5 


Adrenal Gland 


7.0 


Renal ca. UO-31 


6.4 


Thyroid 


2.5 


Renal ca. TK-10 


5.8 


{Salivary gland 




Liver 


8.5 


(Pituitary gland |0.2 jLiver (fetal) 


3.5 


Brain (fetal) 


1 .6 jLiver ca. (hepatoblast) HepG2 


6.8 


Brain (whole) 


8.0 jLung 


1.4 


Brain (amygdala) 


0.2 


Lung (fetal) 


2.1 


Brain (cerebellum) 


5.3 


Lung ca. (small cell) LX-1 


7.7 


Brain (hippocampus) 


32.3 


Lung ca. (small cell) NC1-H69 


7.6 


Brain (thalamus) 


20.3 


Lung ca. (s.cell var.) SHP-77 


1.1 


Cerebral Cortex j 100.0 


Lung ca. (large cell)NCl-H460 


13.9 


Spinal cord 3.7 


Lung ca. (non-sm. cell) A549 


9.5 


glio/astro U87-MG J15.4 


Lung ca. (non-s.cell) NC1-H23 ;9.7 


glio/astro U-118-MG 10.1 


Lung ca. (non-s.cell) HOP-62 8.7 


istr . Mom , SV 1783 ;3.8 


Lung ca. (non-s.cl) NC1-H522 71.7 


neuro*; met SK-N-AS 


14.8 


Lung ca. (squam.) SW 900 


7.3 


astrocytoma SF-539 


2.8 


' ing ca ;quam.)NCI-H >6 


1~9~6 


astrocytoma SNB-75 


2.5 


Mammary gland 


3.4 


glioma SNB-19 


12.6 


Breast ca.* (pl.ef) MCF-7 


2.7 


glioma U251 


0.0 


Breast ca.* (pl.ef) MDA-MB- 
231 


6.4 


glioma SF-295 


19.5 


Breast ca (pi ef) T4 'D 


11.6 


Heart 


15.0 


Breast ca. BT-549 


3.1 


Skeletal Muscle 


4.5 


Breast ca. MDA-N 


50.7 


Bone marrow 


2.5 


Ovary 


22.8 


Thymus 


1.7 


Ovarian ca. OVCAR-3 


0.0 


Spleen 


3.6 


Ovarian ca. OVCAR-4 


19.6 


Lymph node 


1.7 


Ovarian ca. OVCAR-5 


25.2 


Colorectal Tissue 


2.1 


Ovarian ca. OVCAR-8 


23.7 


Stomach 


9.7 


Ovarian ca. IGROV-1 


6.9 


Small intestine 


14.5 


Ovarian ca. (ascites) SK-OV-3 


10.4 


Colon ca. SW480 


2.6 


Uterus 


3.7 


Colon ca.* SW620 (SW480 
met) 


1.5 


Placenta 


32.8 


Colon ca. HT29 


1.2 


Prostate 


23.0 


Colon ca. HCT-116 


1.0 


Prostate ca* (bone met) PC-3 


52.5 
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Colon ca. CaCo-2 


3.7 


Testis 


10.4 


Colon ca. Tissue 
(OD03866) 


1.0 


Melanoma Hs688(A).T 


3.8 


I f , _ 0 u/^r^ none 

Colon ca. HCC-zyys 


7.1 


Melanoma* (met) Hs688(B).T 


2.5 


Gastric ca.* (liver met) 
NCI-N87 


7.5 


Melanoma UACC-62 


29.3 


Bladder 


9.5 


Melanoma M 14 


9.3 


Trachea 


1.3 


Melanoma LOX IMVI 


:<> 


.Kidney 


50.3 


M lanom (met) 1EI 


4.4 


Kidney (fetal) 


3.1 




... 


Table AOF. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4082, Run 
171809988 


Tissue Name 


Rel. E.\p.(%) 
Ag4082, Run 
171809988 


Secondary Th 1 act 


44.1 


HUVEC IL-lbeta 


29.3 


Secondary Th2 act 


55.5 


HUVEC I FN gamma 


24.7 


Secondary Tr 1 act 


45.1 


HUVECTNF alpha +IFN 
gamma 


29.5 


Secondary Th 1 rest |9.2 


HUVEC TNF alpha + IL4 


47.0 


Secondary Th2 rest \ 1 0.8 


HUVEC IL- 11 


13.9 


Secondary IV] rest 10.8 


Lung Microvascular EC none 


58.2 


Primary Thl act 


29.9 


Lung Microvascular EC 
TNFalpha+ IL-lbeta 


66.4 


Primary Th2 act 


47.6 


Microvascular Dermal EC none 


20.2 


Primary Trl act 


45.7 


Microsvasular Dermal EC 
TNFalpha+ IL-lbeta 


37.1 


Primary Thl rest 


9.2 


Bronchial epithelium TNFalpha 
+ ILlbeta 


31.2 


Primary Th2 rest 


5.8 


Small airway epithelium none 


9.2 


Primary Trl rest 


14.7 


Small airway epithelium 
TNFalpha + IL-lbeta 


17.9 


CD45RA CD4 lymphocyte 
act 


40.6 


Coronery artery SMC rest 


16.0 


CD45RO CD4 lymphocyte 
act 


45.1 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


18.3 


CD8 lymphocyte act 


51.8 


Astrocytes rest 


25.3 


Secondary CDS 
lymphocyte rest 


60.7 


Astrocytes TNFalpha + IL- 1 beta 


27.0 


Secondary CD8 
lymphocyte act 


29.5 


KU-812 (Basophil) rest 


13.2 


CD4 lymphocyte none 


6.9 


KU-8 12 (Basophil) 


37.4 
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PMA/ionomycin 




2ryThl/Th2/Trl anti- 
CD95 CH11 


19.3 


CCD1 106 (Kcratinocytcs) none 


31.2 


LAK cells rest 


31.4 


CCD 1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


23.0 


LAK cells IL-2 


28.9 


Liver cirrhosis 


4.5 


LAK cells IL-2+IL-12 


28.1 


NCI-H292 in no 


40.3 


LAK cells IL-2+IFN 


14.7 


NCI-H292 IL-4 


0.2 


LAK cells IL-2+ IL-18 


25.9 


NCI-H292 IL-9 


100.0 


LAK cells PMA/ionomycin 


27.4 


NCI-H292 IL-13 


83.5 


NK Cells IL-2 rest 


55.9 


NCLH292 I FN gamma 


99.3 


Two Way MLR 3 day 


45.1 


HPAEC none 


16.5 


Two Way MLR 5 day 


36.9 


HPAEC TNF alpha + IL-1 beta 


68.3 


Two Way MLR 7 day 


17.9 


Lung fibroblast none 


27.0 


PBMC rest 


9.6 


Lung fibroblast TNF alpha + IL- 
1 beta 


29.7 


PBMC PWM 


49.3 


Lung fibroblast IL-4 


29.9 


PBMC PHA-L 


25.5 


Lung fibroblast IL-9 


28.3 ' 


Ramos (B cell) none 


35.4 


Lung fibroblast IL-13 


26.4 


■Ramos (B cell) ionomycin 


31.6 


Lung fibroblast IFN gamma 


32.8 


B lymphocytes PWM 


34.4 


Dermal fibroblast CCD1070 rest 


23.2 


B lymphocytes CD40L and 
IL-4 


41.2 


Dermal fibroblast CCD 1070 
TNF alpha 


40.3 


EOL-1 dbcAMP 


12.8 


Dermal fibroblast CCD 1070 IL-1 
beta 


19.6 


EOL-1 dbcAMP 


15.8 


Dermal fibroblast IFN gamma 


28.1 


Dr n dritic 0 cenInone ' 


24.5 


Dermal fibroblast IL-4 


30.8 


Etendritic cells LPS 


37.6 


Dermal Fibroblasts rest 


24.8 


Dendritic cells anti-CD40 


32.1 


Neutrophils TNFa+LPS 


3.5 


Monocytes rest 


11.7 


Neutrophils rest 


8.0 


Monocytes LPS 


33.7 


Colon 


9.0 


Macrophages rest 


72.2 


Lung 


14.9 


Macrophages LPS 


25.2 


Thymus 


11.3 


HUVEC none 


15.7 


Kidney 


24.0 


MUVEC starved 


23.3 







CNS_neurodegeneration_vl.O Summary: Ag4082 This panel does not show 
differential expression of the CG96624-01 gene in Alzheimer's disease. However, this 
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expression profile confirms the presence of this gene in the brain. Please see Panel 1.4 for 
discussion of utility of this gene in the central nervous system. 

General_screening_panel_vl.4 Summary: Ag4082 Highest expression of the 
CG96624-01 gene is seen in a breast cancer cell line (CT=25.8). Significant levels of 
5 expression are also seen in a cluster of samples derived from breast, brain, colon, lung, 
ovarian, prostate and melanoma cancer cell lines. 

In addition, this gene is expressed at much higher levels in fetal lung (CT=28.9) 
when compared to expression in the adult counterpart (CT=36.7). Thus, expression of this 
gene may be used to differentiate between the fetal and adult source of this tissue. 

10 The ubiquitous expression of this gene in this panel and the higher levels of 

expression in cancer cell lines and some fetal tissues suggest a role for this gene product in 
cell survival and proliferation. 

Among tissues with metabolic function, this gene is expressed at moderate levels 
in pituitary, adrenal gland, pancreas, thyroid, and adult and fetal skeletal muscle, heart. 

15 and liver and at low but significant levels in adipose. This widespread expression among 
these tissues suggests that this gene product may play a role in normal neuroendocrine and 
metabolic function and that disregulated expression of this gene may contribute to 
neuroendocrine disorders or metabolic diseases, such as obesity and diabetes. 

This gene is also expressed at high to moderate levels in the CNS, including the 

20 hippocampus, thalamus, substantia nigra, amygdala, cerebellum and cerebral cortex. 

Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Panel 1.2 Summary: Ag4082 Highest expression of the CG96624-01 gene is seen 

25 in the cerebral cortex (CT=23.8). Expression of this gene is ubiquitous in this panel, with 
prominent levels of expression in lung and breast cancer cell lines and normal kidney 
(CT=24.8). Thus, expression of this gene could be used as a marker of lung and breast 
cancer and to differentiate kidney from fetal kidney (CT=28.8). Please see Panel 1.4 for 
further discussion of utility of this gene. 

30 Panel 4.1D Summary: Ag4082 Highest expression of the CG96624-01 gene is 

seen in IL-9 treated NCI-H292 cells (CT=29.6). In addition, this gene is expressed at 
moderate levels in a wide range of cell types of significance in the immune response in 
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health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

AR. CG96747-01: Voltage-Dependent Calcium Channel Gamma-3 Subunit - Like 
Protein 

Expression of gene CG96747-01 was assessed using the primer-probe set Ag4076, 
described in Table ARA. Results of the RTQ-PCR runs are shown in Tables ARB and 
ARC. 



Table ARA . Probe Name Ag4076 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -agcgtgtctctgctgcttt-3 • 


19 


421 


365 


Probe 


TET-5 • -tcaccggctgctacttcctgctg-3 ' -TAMRA 


23 


440 


366 


Reverse 


5 ' -aggtgcgagtagctgatgtaga-3 ' 


22 


494 


367 



Table ARB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4076, Run 
214294983 


Tissue Name 


Rel. 

Exp.(%) 
Ag4076, 
Run 

214294983 


AD 1 Hippo 


15.1 


Control (Path) 3 Temporal Ctx 


0.0 


AD 2 Hippo 


71.2 


Control (Path) 4 Temporal Ctx 


13.3 


AD 3 Hippo 


6.4 


AD 1 Occipital Ctx 


9.1 


AD 4 Hippo 


7.1 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


53.2 


AD 3 Occipital Ctx 


2.4 
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AD 6 Hippo 


15.2 


AD 4 Occipital Ctx 


232 


Control 2 Hippo 


54 2 


AD 5 Occipital Ctx 


14.0 


Control 4 Hippo 


29.3 


AD 6 Occipital Ctx 




Control (Path) 3 Hippo 


3.5 


Control 1 Occipital Ctx 


1 1.7 


AD 1 Temporal Ctx 


17.0 




Control 2 Occipital Ctx 


74.7 


AD 2 Temporal Ctx 


43.8 




Control 3 Occipital Ctx 


9.0 


AD 3 Temporal Ctx 


0.0 


Control 4 Occipital Ctx 


1 1 .3 


AD 4 Temporal Ctx 


13.6 


Control (Path) 1 Occipital Ctx 


97.9 


AD 5 Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


8.8 


AD 5 SupTemporal Ctx 


48.0 


< < ntn 1 (Path) ( »c piHI ' t> 


0.9 


AD 6 Inf Temporal Ctx 


27.9 


Control (Path) 4 Occipital Ctx 


4.2 


AD 6 Sup Temporal Ctx 


17.9 


Control 1 Parietal Ctx 


32.8 


Control 1 Temporal Ctx 


16.7 




Control 2 Parietal Ctx 


48 3 


Control 2 Temporal Ctx 


50.3 


[Control 3 Parietal Ctx 


14.0 


Control 3 Temporal Ctx 


8.8 


Control (Path) 1 Parietal Ctx 


42.9 i 


Control 4 Temporal Ctx 
Control (Path) 1 Temporal 
Ctx 


25.9 
27.0 


( ont I (Path) 1 Parietal Ctx H 7.4 
jControl (Path) 3 Parietal Ctx b.0 ! 


Control (Path) 2 Temporal 
Ctx 


7.0 


I ~ ~ ~ | "i 
Control (Path) 4 Parietal Ctx ;5.8 


Table ARC. General screening panel vl.4 


Tissue Name 


iRel. Exp.(%) 
A :;,:() V6. Run 
218903895 


Tissue Name 


Rel. 

Exp.(%) 
Ag4076, 
Run 

218903895 


Adipose 


lo.o 


Renal ca. TK-10 


0.0 


Melanoma* Hs688(A).T 


10.0 


Bladder 


0.0 


Melanoma* Hs688(B).T 


jO.O 


Gastric ca. (liver met.) NCI-N87 


0.0 


Melanoma* Ml 4 


jo.o 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


jo.o 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


jo.o 


Colon ca. SW480 


0.0 


Squamous cell carcinoma SCC-L „ 
4 0-0 


Colon ca.* (SW480 met) SW620 


0.0 


Testis Pool 


jo.o 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) PC-3 jO.O 


Colon ca. HCT-116 


0.0 


Prostate Pool 


jo.o 


Colon ca. CaCo-2 


0.0 


Placenta 


Jo.o 


Colon cancer tissue 


0.0 


Uterus Pool 


jo u 


Colon ca. SW1116 


0.0 
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Ovarian ca. OVCAR-3 


0.0 


Colon ca. Colo-205 


(HI 


Ovarian ca. SK-OV-3 


0.0 


Colon ca. SW-48 


O.H 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. OVCAR-5 


0.0 


Small Intestine Pool 


0.0 


Ovarian ca. IGROV-1 


0.0 


Stomach Pool 


0.0 


Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool 10.0 




0.0 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA-MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer (glio/asti > U8 VI < 


0.0 


Lung 


0.0 


' NS cancel ( 1 io/astro) U- 1 1 8- 
MG 


0.0 


Fetal Lung 


1.1 


CNS cancer (neuro;met) SK-N-As[o.O 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX- 1 


0.0 


i r S cancel (astro) SNB-75 


0.0 


Lung ca. NC1-H146 


0.0 


CNS cancel (gho) SNB-19 


0.0 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


54.7 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


37.1 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


0.0 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) Pool 


41.2 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


38.2 


Lung ca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


100.0 




0.0 


Brain (Thalamus) Pool 


45.7 


Fetal Liver 


0.0 


Brain (whole) 


18.2 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


67.8 


Kidney Pool 


0.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


()(■ 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



CNS_neurodegeneration_vl.O Summary: Ag4076 This panel confirms the 
expression of the CG96747-01 gene at low levels in the brains of an independent group of 
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individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 
5 Gencral_screening_panel_vl.4 Summary: Ag4076 Highest expression of the 

CG96747-01 gene is detected in substantia nigra of brain (CT=30.1). Interestingly 
expression of this gene is exclusive to brain regions examined. Therefore, expression of 
this gene can be used to distinguish brain from other other samples used in this panel. 
Furthermore, therapeutic modulation of this gene through use of small molecule target 

10 may be useful in the treatment of neurological disorders such as Alzheimer's disease, 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

The CG96747-01 gene codes for a protein belonging to PMP- 
22/EMP/MP20/Claudin family. Proteins belonging to this family are small integral 
membrane glycoproteins which are evolutionarily related including eye lens specific 

15 membrane protein 20 (MP20 or MP19); epithelial membrane protein-1 (EMP-1/TMP), 
epithelial membrane protein-2 (EMP-2). and peripheral myelin protein 22 (PMP-22). 
PMP-22 plays a role both in myelinization and in cell proliferation. Mutations affecting 
PMP-22 are associated with hereditary motor and sensory neuropathies such as Charcot- 
Marie-Tooth disease type 1 A (CMT-1 A) in human or the trembler phenotype in mice 

20 (Jetten AM, Suter U, 2000, Prog Nucleic Acid Res Mol Biol 64:97-129, PMID: 10697408; 
PFAM: IPR00403 1). Thus, the CG96747-01 gene product may also play a role in 
hereditary motor and sensory neuropathies such as CMT-1 A and therapeutic modulation 
of this gene product may be useful in the treatment of this disease. 

Panel 4.1D Summary: Ag4076 Expression of the CG96747-01 gene is 

25 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4076 Expression of the 
CG96747-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



30 AS. CG97462-01: Prohibitin 

Expression of gene CG97462-01 was assessed using the primer-probe set Ag41 11, 
described in Table ASA. 
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Table ASA . Probe Name Ag4111 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 • -ggtctccagggagcaattac-3 ' 


20 


432 


368 


Probe 


TET-5 ' -ttacagagcgagcagccacctttg-3 1 -TAMRA 


24 


452 


369 


Reverse 


5 1 -atgtgtcaaggacacgtcatc-3 ' 


21 


487 


370 



CNS_neurodegeneration_vl.O Summary: Ag41 1 1 Expressio7roTth^G974^2- 



01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

General_screening_panel_vl.4 Summary: Ag41 1 1 Expression of the CG97462- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1D Summary: Ag41 1 1 Expression of the CG97462-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag41 1 1 Expression of the 
CG97462-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 



AT. CG97472-01: Glucose Transporter 

Expression of gene CG97472-01 was assessed using the primer-probe set Ag41 13, 
described in Table ATA. Results of the RTQ-PCR runs are shown in Tables ATB, ATC, 
ATD and ATE. 

Table ATA . Probe Name Ag4113 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

No 


Forward 


5 ' -agctccttttctgttggactgt-3 ' 


22 


235 


371 


Probe 


TET-5 ' -tcaacagctttgacaggcgtaattca-3 ' -TAMRA 


26 


260 


372 


Reverse 


5 ' -cagccaacaggttgacagtaag-3 ' 


22 


289 


373 



Table ATB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4I13,Run 
219543072 


Tissue Name 


Rel. 

Exp.(%) 
Ag41l3, 
Run 

219543072 


Adipose 


3.4 


Renal ca. TK-10 


16.5 


Melanoma* Hs688(A).T 


2A 


Bladder 


11.4 
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Melanoma* IIs688(B).T 


1.0 


Gastric ca. (liver met.) NCI-N87 


23.3 


Melanoma* M14 


2.1 


Gastric ca. KATO III 


2.0 


Melanom , LOXII IV] 


0.8 


Colon ca. SW-948 


0.2 


Melan ™a* SK.-MEL-5 


2.3 


Colon ca. SW480 


5.0 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


3.9 


Testis Pool 


5.5 


Colon ca. HT29 


1.6 


Prostate ca.* (bone met) 
I'C-3 


0.0 


Colon ca. HCT-116 


1.3 


p^clSr"" 




Colon ca. CaCo-2 


2 -L 




,.0 " _ " 







Uterus Pool 




(-Olon ca. sw 1 1 io 


00 . - 


Ovarian ca. O.VCAR-3 


4.3 


Colon ca. Colo-205 




Ovarian ca SK.-OV-3 


~ 


Pnlnn /-a CM/ AS 

\_,oion ca. ow-^o 




— 


Ovarian ca OVCAR-4 


~- — 


Colon Pool 


37.1 


Ovarian ca OVCAR-5 


TJ _ 


Small Intestine Pool 


71.2 


Ovarian ca. IGROV-1 




Stomach Pool 


25.3 


WVdndll Cd. LJV\^/\I\-0 




Bone Marrow Pool 








Fetal Heart 




Breast ca. MCF-7 


~ Q J - 


Heart Pool 




Breast ca. MDA-MB-231 


6.0 


Lymph Node Pool 


36.1 


Breast ca. BT -549 


6.3 


Fetal Skeletal Muscle 


3.9 


Breast ca. T47D 


25.7 


Skeletal Muscle Pool 


2.8 


Breast ca. MDA-N 


3.1 


Spleen Pool 


5.5 


Breast Pool 


35.1 


Thymus Pool 


15.6 


Trachea 


8.7 


CNS cancer (glio/astro) U87-MG 


8.8 


Lung 


24.8 


CNS cancer (glio/astro) U-l 1 8- 
MG 


1.7 


Fetal Lung 


16.5 

— ' 


CNS cancer (neuro;met) SK-N- 
AS 


5.1 


Lung ca. NCI-N4 1 7 




i^i\o cancer ^asrro^ jr-jj? 




Lung ca. LX-1 


_ , 


cancer ^asiro^ iiNts~/-> 


— - 


Lung ca. NCI-H146 


0.2 


CNS cancer (glio) SNB-19 


44 ~~ 


Lung ca. SHP-77 


1.7 


CNS cancer (glio) SF-295 


7.6 


Lung ca. A549 


4.4 


Brain (Amygdala) Pool 


4.1 


Lung ca. NCI-H526 


1.9 


Brain (cerebellum) 


3.6 


Lung ca. NCI-H23 


9.9 


Brain (fetal) 


8.7 


Lung ca. NCI-H460 


5.9 


Brain (Hippocampus) Pool 


7.7 


Lung ca. HOP-62 


5 3 


Cerebral Cortex Pool 


8.7 
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Lung ca. NCI-H522 


3.8 


Brain (Substantia nigra) Pool 


7.9 


Liver 


0.4 • 


Brain (Thalamus) Pool 




Fetal Liver 


1.7 


Brain (whole) 


6.0 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 




Kidney Pool 


100.0 


Adrenal Gland 


11.0 


Feial Kidney 


12.9 


Pituitary gland Pool 


6.3 


Renal ca. 786-0 


3.6 


Salivary Gland 


2.4 


Renal ca. A498 


0.0 


Thyroid (female) 


2.6 


Renal ca. ACHN 


2.7 


Pancreatic ca. CAPAN2 


9.8 


Renal ca. UO-3 1 


0.5 


Pancreas Pool 


30.4 


Table ATC. Panel 4.1D 




Tissue Name 


Rel. Exp.(%) 
Ag4113, Run 
172792545 


Tissue Name 


Exp.(%) 
Ag4113, 
Run 

J 12. /yZJHJ 




0.0 


HUVEC IL-ibeta 


2.8 


jsecondary Th2 act 


3.8 


HUVEC 1FN gamma 


41.2 | 


jSecondary Trl act 


1.8 IHUVEC TNF alpha + 1FN gamma 


9.7 


Secondary Th I rest 


12.0 


HUN * rNFalph - IL4 


1.4 


Secondary Th2 rest 


21.8 


HUVEC IL-ll 


6.1 


Secondary Trl rest 


24.7 


Lung Microvascular EC none 


21.8 


Primary Thl act 


5.3 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


31.6 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


4.0 


Primary Trl act 


0.3 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


14.2 


Primary Thl rest 


8.7 


Bronchial epithelium TNFalpha + 
ILlbeta 


5.7 


Primary Th2 rest 


0.1 


Small airway epithelium none 


0.0 


Primary Trl rest 


9.6 


Small airway epithelium 
TNFalpha + I L-l beta 


0.0 


CD45RA CD4 lymphocyte 
act 


10.3 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte 
act 


24.5 


Coronery artery SMC TNFalpha + 
IL-lbeta 


3.3 


CD8 lymphocyte act 


4.4 


Astrocytes rest 


1.3 


Secondary CD8 
lymphocyte rest 


0.6 


Astrocytes TNFalpha + IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


23.8 
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CD4 lymphocyte none 


5.3 


KU-8 12 (Basophil) 
PMA/ionomycin 


23.0 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 




CCD1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


12.9 


CCD1106(Keratinocytes) 
TNFalpha + IL-lbeta 


0.2 


LAK cells IL-2 


.... _. 


Liver cirrhosis 


3.6 


LAK cells IL-2+IL-12 


Jir 


NCI-H292 none 


12.9 


LAK cells IL-2+IFN 
gamma 


5.5 


NCI-H292 IL-4 


4.5 


LAK cells IL-2+IL-18 


10.4 


NCI-H292 IL-9 


17.3 


LAK cells PMA/ionomycin 


0.0 


NCI-H292 IL-13 


12.3 


NK Cells IL-2 rest 


17.6 


NCI-H292 I FN gamma 


21.9 


Two Way MLR 3 day 


20.3 


HPAEC none 


14.8 


Two Way MLR 5 day 


10.9 


HPAEC TNF alpha + IL-1 beta 


17.2 


Two Way MLR 7 day 


0.9 


Lung Fibroblast none 


1.5 


PBMC rest 


2.5 


Lung fibroblast TNF alpha + IL-1 
beta 


5.4 


PBMC PWM 


7.3 


Lung fibroblast IL-4 


4.0 i 


PBMC PHA-L 


10.7 


Lung fibroblast IL-9 


3-5 


Ramos (B cell) none 


5.7""" ~"~ " 


Lung fibroblast IL-13 


10.2 


Ramos (B cell) ionomycin 1.5 


Lung fibroblast IFN gamma 


4.8 


B lymphocytes PWM 


1.5 


Dermal fibroblast CCD1070 rest 


12.2 


B lymphocytes CD40L and 
IL-4 


0.9 


Dermal fibroblast CCD1070 TNF 
alpha 


12.8 


EOL-1 dbcAMP 


6.0 


Dermal fibroblast CCD 1070 IL-1 
beta 


0.4 


EOL-1 dbcAMP 
PMA/ionomycin 


3.1 


Dermal fibroblast IFN gamma 


2.3 


Dendritic cells none 


9.6 


Dermal fibroblast IL-4 


2.5 


Dendritic cells LPS 


21.5 


Dermal Fibroblasts rest 


1 1.5 


Dendritic cells anti-CD40 


5.3 


Neutrophils TNFa+LPS 


11.1 


Monocytes rest 


14.0 


Neutrophils rest 


92.7 


Monocytes LPS 


88.9 


Colon 


24.7 


Macrophages rest 


8.7 


Lung 


29.7 


Macrophages LPS 


6.5 


Thymus 


56.6 


HUVEC noi e 


3.7 


KLidney 


100.0 


HUVEC starved 


3.1 






Table ATD. Panel 5D 


Tissue Name 


Rel. Exp.(%) 


Tissue Name 


Rel. | 
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Ag4113,Run 

1 TiH&AmA 
1 1 A 1 b'W 10 




Exp.(%) 
Run 

172784076 


97457_Patient- 
02go_adipose 


71.2 


94709_Donor 2 AM - A_adipose 


0.0 


97476_Patient- 
07sk_skeletal muscle 


13.2 


94710_Donor 2 AM - B_adipose 


0.0 


97 177 Patient-07ut uteru . 


35.4 


9171 1 D .ik 2 KM < adipose 


0.0 


97478JPatient- 
07pl_placenta 


0.0 


94712_Donor 2 AD - A adipose 


36.3 


97481_Patient- 
08sk_skeletal muscle 


40.6 


94713_Donor 2 AD - B_adipose 


12.4 


9 '8 ! Patient-08ut uterus 


9.4 


94714_Donor 2 AD - C_adipose 


29.3 


97483_Patient- 
08pl_placenta 


22.5 


94742_Donor 3 U - 

A Mesenchymal Stem Cells 


0.0 


97486_Patient- 
09sk_skeletal muscle 


0.0 


94743_Donor 3 U - 

B Mesenchymal Stem Cells 


6.8 


97487 Patient 0 )ul uterus 


30.8 


94730_Donor 3 AM - A_adipose 


0.0 


97488_Patient- 
09pl_placenta 


0.0 


9473 l_Donor 3 AM - B_adipose 


0.0 


97 ' >2_Patient-10ut_ut rus 


32.5 


94732_Donor 3 AM - C_adipose 




97493JPatient- 
10pl_placenta 


29.3 


94733_Donor 3 AD - A_adipose 


10.6 


97495_Patient- 
1 lgo_adipose 


100.0 


94734_Donor 3 AD - B_adipose 


11.6 


97496_Patient- 

1 1 skskeletal muscle 


23.2 


94735_Donor 3 AD - C_adipose 


0.0 


97497_Patient-l lut_uterus 


26.4 


77138_Liver_HepG2untreated 


0 0 


97498_Patient- 
1 1 pl_placenta 


12.9 


73556_Heart_Cardiac stromal 
cells (primary) 


15.7 


97500JPatient- 
12go_adipose 


97.9 


81735_SmaII Intestine 


44.1 


97501 Patient- 
12sk_skeletal muscle 


82.9 


/z4uy_ivianey rroximai 
Convoluted Tubule 


0.0 


97502_Patient-l 2ut_uterus 


15.7 


82685_Small intestine_Duodenum 


21.6 


97503_Patient- 
12pl_placenta 


25.2 


90650_AdrenaI_Adrenocortical 
adenoma 


0.0 


94721_Donor2U- 
A Mesenchymal Stem 
Cells 


0.0 


724I0_Kidney_HRCE 


37.1 


94722JDonor 2 U - 
B Mesenchymal Stem 
Cells 


0.0 


72411_Kidney_HRE 


6.9 
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94723_Donor2U- 
C Mesenchymal Stem 
Cells 


14.7 


73139_Llterus_Uterine smooth 
muscle cells 


0.0 


Table ATE. General oncolopv screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4113, Run 
268389993 

~ 


Tissue Name 


Exp.(%) 
Ag4l 13 
Run 

268389993 


Colon cancer : " 




Bladder cancer NAT 2 


1.0 


Colon cancer NAT 1 


1.8 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


2 ; 6 


Bladder cancer NAT 4 


3.1 


Colon cancer NAT 2 




Adenocarcinoma of the prostate 1 


70.2 


Colon cancer 3 


— -— 

Zi 6 


Adenocarcinoma of the prostate 2 


3.7 


Colon cancer NAT 3 


i£i 


Adenocarcinoma of the prostate 3 


11.0 


Colon malignant cancer 4 


7 ' 7 - - — — 


Adenocarcinoma of the prostate 4 


16.8 


Colon normal adjacent 
: tissue 4 


2.6 


Prostate cancer NAT 5 


1.6 


Tung cancer 1 


6.1 


Adenocarcinoma of the prostate 6 


3.1 


jLung NAT I 


1.5 


Adenocarcinoma of the prostate 7 


7.3 


;Lung cancer 2 


9.1 

4.9 ~ 


Adenocarcinoma of the prostate 8 |2.9 


Lung NAT 2 


Adenocarcinoma of the prostate 9 


37.4 


Squamous cell carcinoma 3 


10.2 


Prostate cancer NAT 10 


3.3 


Lung NAT 3 


0.6 


Kidney cancer 1 


35.6 


metastatic melanoma 1 


46.3 


KidneyNAT 1 


8.9 


Melanoma 2 


0.5 


Kidney cancer 2 


46.3 


Melanoma 3 


0.9 


Kidney NAT 2 


9.2 


metastatic melanoma 4 


90.8 


Kidney cancer 3 


29.1 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


5.6 


Bladder cancer 1 


2.2 


Kidney cancer 4 


5.8 


Bladder cancer NAT 1 


i0 


Kidney NAT 4 


5.3 


Bladder cancer 2 


6.0 







CNS_neurodegeneration_vl.O Summary: Ag41 13 Results from one experiment 
with the CG97472-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General_screeningjpanel_vl.4 Summary': Ag41 13 Highest expression of the 
CG97472-01 gene is detected in kidney (CT=29.5). Therefore, therapeutic modulation of 
this gene may be useful in the treatment of kidney related disease including lupus and 
glomerulonephritis. 
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Among tissues with metabolic or endocrine function, this gene is expressed at low 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
5 diseases, such as obesity and diabetes. 

In addition, this gene is expressed at moderate to low levels in all regions of the 
central nervous system examined, including amygdala, hippocampus, substantia nigra, 
thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a 
role in central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 

10 epilepsy, multiple sclerosis, schizophrenia and depression. 

Low level expression of this gene is also seen in cluster of ovarian cancer, breast, 
lung, renal, colon and CNS cancer. Therefore, therapeutic modulation of this gene may be 
useful in the treatment of these cancers. 

Panel 4.1D Summary: Ag4113 Highest expression of the CG97472-01 gene is 

1 5 detected in kidney (CT=3 1 ). This gene is expressed at low to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 
cells include members of the T-cell, B-cell. endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 

20 This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. This pattern is in 
agreement with the expression profile in General_screeningjpanel_vl.4 and also suggests 
a role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 

25 associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

Panel 5D Summary: Ag41 13 Highest expression of the CG97472-01 gene is 
30 detected in adipose (CT=34.4). Low level of expression of this gene is seen exclusively in 
adipose and skeletal muscle samples. Therefore, expression of this gene can be used to 
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distinguish these samples from other samples used in this panel. Please see Panel 1 .4 for a 
discussion of the potential utility of this gene 

General oncology screening panel_v_2.4 Summary: Ag41 13 Highest 
expression of the CG97472-01 gene is detected in metastatic melanoma sample (CT=31). 
5 Interestingly, significant expression of this gene is seen in number of cancer samples 
including kidney, metastatic melanoma, prostate adenocarcinoma and lung cancer. 
Therefore, expression of this gene may be used as diagnostic marker for these cancers and 
therapeutic modulation of this gene can be useful in the treatment of these cancers. 

1 0 AU. CG97528-01: Guanylate Binding Protein 

Expression of gene CG97528-01 was assessed using the primer-probe set Ag4107, 
described in Table AUA. Results of the RTQ-PCR runs are shown in Tables AUB and 
AUC. 

Table AUA . Probe Name Ag4107 



Primers 
Forward 


Sequences 

5 ' -aacaggcccgagtactaaagg-3 1 


Length 
21 


Start 
Position 
1814 


SEQID ! 
No 
374 


Probe 


TET-5 1 -tgccaaggtgaaagtacccaacttca- 3 ' -TAMRA 


26 


1840 


375 


Reverse 


5 ' - tcagggtcttctgtagcttttg-3 ' 


22 | 1876 


376 | 



15 Table AUB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Aa4107, Run 
219447017 


Tissue Name 


Rel. 

Exp.(%) 
Ag4107, 
Run 

219447017 


Adipose 


5.5 


Renal ca. TK-10 


3.3 


Melanoma* Hs688(A).T 


12.5 


Bladder 


19.5 


h lei noma* Hs688(B).T 


11.0 


Gastric ca. (liver met.) NC1-N87 


100.0 


Melanoma* Ml 4 


10.8 


Gastric ca. KATO III 


24.0 


Melanoma* LOXIMVl 


3.4 


Colon ca. SW-948 


4.0 


Melanoma* SK-MEL-5 


0 0 


Colon ca. SW480 


6.1 


Squamous cell carcinoma 
SCC-4 


4.0 


Colon ca* (SW480 met) SW620 


0.0 


Testis Pool 


1.3 


Colon ca. HT29 


4.0 


Prostate ca.* (bone met) 
PC-3 


11.6 


Colon ca. HCT-116 


0.6 


Prostate Pool 


4.9 


Colon ca. CaCo-2 


0.4 
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Placenta 


0.7 


Colon cancer tissue jll.8 


Uterus Pool 


3.0 


Colon ca.SWl 116 ]0.5 


Ovarian ca. OVCAR-3 


3.8 


Colon ca. Colo-205 


0.6 


Ovarian ca. SK-OV-3 


6.9 


Colon ca. SW-48 


0.7 


Ovarian ca. OVCAR-4 


2.8 


Colon Pool 


7.2 


Ovarian ca. OVCAR-5 


22.8 


Small Intestine Pool 


5.3 


Ovarian ca. 1GROV-1 


1.9 


Stomach Pool 


2.8 


Ovarian ca. OVCAR-8 


1.4 


Bone Marrow Pool 


3.3 




4.3 


Fetal Heart 


1.1 


Breast ca. MCF-7 


1.2 


Heart Pool 


2.5 


Breast ca. MDA-MB-231 


40.9 


Lymph Node Pool 


6.8 


Breast ca. BT 549 


5.8 


Fetal Skeletal Muscle 


0.7 


Breast ca. T47D 


29.3 


Skeletal Muscle Pool 


2.7 


Breast ca. MDA-N 


1.1 


Spleen Pool j 1 2.2 


Breast Pool 


5.6 


Thymus Pool 


5.2 


Trachea 


5.2 


CNS'cancer (glio/astro) U87-MG 


0.7 


Lung 


0.8 


< N • cancer (glio/astro) U 1 1 8- 
MG 


40.6 


Fetal Lung 


14.4 


CNS cancer (neuroanet) SK-N-AS 


2.6 


Lung ca. NC1-N4I7 


0.0 


CNS cancer (astro) SF-539 


8.2 


Lung ca. LX- 1 


0.9 


CNS cancer (astro) SNB-75 


17.9 


Lung ca. NCI-H146 


0.0 


CNS cancer (gIio)SNB-19 


1.5 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF-295 


40.1 


Lung ca. A549 


0.5 


Brain (Amygdala) Pool 


0.4 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.1 


Lung ca.NCI-H23 


0.2 


Brain (fetal) 


0.3 


Lung ca. NCI-H460 


0.2 


Brain (Hippocampus) Pool 


0.7 


Lung ca. HOP-62 


4.0 


Cerebral Cortex Pool 


0.8 


Lung ca. NCI-H522 


0.2 


Brain (Substantia nigra) Pool 


0.3 




0.2 


Brain (Thalamus) Pool 


1.1 


Fetal Liver 


0.7 


Brain (whole) 


0.3 


Liver ca. HepG2 


0.1 


Spinal Cord Pool 


2.1 


Kidney Pool 


9.6 


Adrenal Gland 


2 1 


Fetal Kidney 


4.7 


Pituitary gland Pool 


0.8 


|Renal ca. 786-0 


13.0 


Salivary Gland 


0.3 


jRenal ca. A498 


3.8 


Thyroid (female) 


3.2 


(Renal ca. ACHN 


2.8 


Pancreatic ca. CAPAN2 


1.9 


(Renal ca. UO-3 1 


6.0 


Pancreas Pool 


8.7 
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Table AUC . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4107, Run 
268623659 


Tissue Name 


Rel. 

Exp.(%) 
Ag4107, 
Run 

268623659 


Colon cancer 1 


22.4 


Bladder cancer NAT 2 


0.8 


Colon NAT 1 


13.9 


Bladder cancer NAT 3 


1.3 


Colon cancer 2 


36.1 


Bladder cancer NAT 4 


2.4 


Colon cancer NAT 2 


19.5 


Adenocarcinoma of the prostate 1 


69.7 


Colon cancer 3 


76.3 


Adenocarcinoma of the prostate 2 


6.5 


Colon cancer NAT 3 


66.4 


Adenocarcinoma of the prostate 3 


19.8 


Colon malignant cancer 4 


64.6 


Adenocarcinoma of the prostate 4 


51.1 


Colon normal adjacent 
tissue 4 




Prostate cancer NAT 5 




Lung cancer 1 


23.3 


Adenocarcinoma of the prostate 6 


4.8 


Lung NAT 1 


5.8 


Adenocarcinoma of the prostate 7 


6.4 


Lung cancer 2 


11.3 


Adenocarcinoma of the prostate 8 


1.6 


I ung NAT 2 


5.4 


Adenocarcinoma of the prostate 9 


28.5 


Squamous cell carcinoma 
3 


21.9 


Prostate cancer NAT 1 0 


1.7 


Lung NAT 3 


2.7 


r — - 

Kidney cancer 1 




metastatic melanoma I 




KidneyNAT 1 


18.7 


Melanoma 2 


0.8 


Kidney cancer 2 


100.0 


Melanoma 3 


2.4 


Kidney NAT 2 


13.2 


metastatic melanoma 4 


50.7 


Kidney cancer 3 


94.0 


metastatic melanoma 5 


47.6 


Kidney NA'I 3 


1 1.9 


Bladder cancer 1 


3.3 


Kidney cancer 4 


16.5 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


3 1 


Bladder cancer 2 


9.2 







CNS_neurodegeneration_vl.O Summary: Ag4107 Results from one experiment 
with the CG97528-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General_screening_panel_vl.4 Summary: Ag4107 Highest expression of the 
CG97528-01 gene is detected in gastric cancer cell line (CT=25). High expression of this 
gene is also seen in cluster of cancer cell lines including CNS, colon, renal, breast, 
ovarian, prostate, squamous cell carcinoma and melanoma. Therefore, therapeutic 
modulation of this gene may be useful in the treatment of these cancers. 
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Significant expression is also detected in fetal and adult lung. Interestingly, this 
gene is expressed at much higher levels in fetal (CT = 27.8) when compared to adult lung 
(CT=32). This observation suggests that expression of this gene can be used to distinguish 
fetal from adult lung. In addition, the relative overexpression of this gene In fetal lung 
5 suggests that the protein product may enhance lung growth or development in the fetus 
and thus may also act in a regenerative capacity in the adult. Therefore, therapeutic 
modulation of the protein encoded by this gene could be useful in treatment of lung related 
diseases. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
10 to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at moderate to low levels in all regions of the 
15 central nervous system examined, including amygdala, hippocampus, substantia nigra, 
thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a 
role in central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4107 Results from one experiment with the CG97528- 
20 01 gene are not included. The amp plot indicates that there were experimental difficulties 
with this run. 

General oncology screening panel_v_2.4 Summary: Ag4107 Highest 
expression of the CG97528-01 gene is detected in kidney cancer (CT=27). Interestingly, 
expression of this gene is higher in number of cancer samples including kidney, 
25 adenocarcinoma of the prostate, melanoma, lung, and colon cancers. Therefore, 

therapeutic modulation of this gene may be useful in the treatment of these cancers. 



AV. CG97629-01: Cell Division Protein Kinase 7 

Expression of gene CG97629-01 was assessed using the primer-probe set Ag4122, 
30 described in Table AVA. 

Table AVA . Probe Name Ag4122 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
ID No 
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Forward|5 ' -gggacattttgctacagcctat-3 ' 


22 


142 


| 377 


Probe |tet-5 ' -agaacaccaaccaaatcgtcaccatt-3 ' -TAMRA 


26 


177 


I 378 


Reverse J5 ' -agcttctgacctgtgtccaa-3 ' 


20 


216 


1 379 



CNS_neurodegeneration_vl.O Summary: Ag4122 Expression of the CG97629- 
01 gene is low/undetec table (CTs > 35) across all of the samples on this panel (data not 
shown). 



General_screening_panel_vl.4 Summary: Ag4122 Expression of the CG97629- 
5 01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1D Summary: Ag4122 Expression of the CG97629-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel CNSJ Summary: Ag4122 Expression of the CG97629-01 gene is 
10 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4122 Expression of the 
CG97629-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(data not shown). 

15 AW. CG97648-01: G Protein-Coupled Receptor Kinase GRK7 

Expression of gene CG97648-01 was assessed using the primer-probe sets Ag3046 
and Ag4125, described in Tables AWA and AWB. Results of the RTQ-PCR runs are 
shown in Tables AWC, A WD, AWE and AWF. 

Table AWA . Probe Name Ag3046 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

No 


Forward 


5 ' -gaagcaaagaactctgcaagac-3 ' 


22 


1281 


380 


Probe 


TET-5 ' -ttccagcatgataacttcacagagga-3 ' -TAMRA 


26 


1312 


381 


Reverse 


5 ' -gagcctgcaaatatcttttgct-3 ' 


22 


1338 


382 . 



20 Table AWB . Probe Name Ag4125 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

No 


Forward 


5 ' -gaagcaaagaactctgcaagac-3 ' 


22 


1281 


383 


Probe 


TET-5 ' -ttccagcatgataacttcacagagga-3 1 -TAMRA 


26 


1312 


384 


Reverse 


5 ' -gagcctgcaaatatcttttgct-3 ' 


22 


1338 


385 



Table AWC . Panel 1.3D 
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Tissue Name 


Rel. 

Exp.(%) 
Ag3046, 
Run 

165533236 


Rel. 

Exp.(%) 
Ag3046, 
Run 

165724481 




Rel. 

Exp.(%) 
Ag3046, 
Run 

165533236 


Rel. Exp.(%) 
Ag3046, Run 
165724481 


Liver 

adenocarcinoma 


27.5 


0.0 


Kidney (fetal) 


0.0 


0.0 


Pancreas 


0.0 


72.2 


Renal ca. 786-0 


0.0 


0.0 


Pancreatic ca. ; ft - 
CAP AN 2 ; 


0.0 


Renal ca. A498 


43.2 


0.0 


Adrenal gland :0.0 


0.0 


Renal ca. RXF 393 


0.0 


0.0 


Thyroid jo.O 


0.0 


Renal ca. ACHN 


0.0 


0.0 


Salivary gland jO.O 


25.5 


Renal ca. UO-3 1 


0.0 


0.0 


Pituitary gland jO.O 


0.0 


Renal ca.TK-10 


().() 


0.0 


Brain (fetal) ;38.4 


0.0 


Liver 


30 6 


0.0 


Brain (whole) 31.4 ;0.0 'Liver (fetal) 


0.0 


0.0 


i (Liver ca. 
Brain (amygdala) 41.8 0.0 '(hepatoblast) 
; HepG2 


0.0 


0.0 


Brain (cerebellum) 0.0 lO.O jLung 


0.0 


0.0 


Brain (hippocampus) 37.1 0.0 Lung (fetal) 0.0 ,0.0 


Brain (substantia 
nigra) 


0.0 122.8 | Lu " g I C ^i Sma11 
icell) LX-1 


0.0 0.0 


Brain (thalamus) 


73.2 


12.8 


Lungca. (small 
cell) NC1-H69 


0.0 


0.0 


Cerebral Cortex 


0.0 


0.0 


Lung ca. (s.cell 
var.) SHP-77 


0.0 


0.0 


Spinal cord 


0.0 


0.0 


Lung ca. (large 
cell)NCI-H460 


0.0 


0.0 


glio/astro U87-MG 


0.0 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


0.0 


glio/astroU-118-MG 


0.0 


0.0 


Lung ca. (non- 
s.cell)NCI-H23 


33.7 


0.0 


astrocytoma SW1783 


0.0 


0.0 


Lung ca. (non- 
s.cell) HOP-62 


0.0 


0.0 


neuro*; met SK.-N- L . 
AS i 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


0.0 


astrocytoma SF-539 


0.0 


0.0 


Lung ca. (squam.) 
SW900 


0.0 


35.4 


astrocytoma SNB-75 


0.0 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


0.0 


glioma SNB-1 9 


0.0 


0.0 


Mammary gland 


0.0 


0.0 


glioma U251 


0.0 


0.0 


Breast ca.* (pl.ef) 


0.0 


100.0 



662 



WO 03/010327 PCT/US02/14199 







i 


vlCF-7 


1 


glioma SF-295 


).0 


0.0 


Breast ca.* (pl.ef) 
V1DA-MB-23 1 


j 

).0 0.0 


Heart (fetal) 


3.0 


24.7 


3reastca.* (pl.ef) 
r47D 


3.0 0.0 


Heart 


3.0 


0.0 


Breast ca. BT-549 


3.0 | 


3 4.6 


Skeletal muscle 
(fetal) 


3.0 


0.0 


Breast ca. MDA-N 


3.0 | 


3.0 


Skeletal muscle 0.0 






Ovary 


3.0 j 




Bone marrow 


0.0 


- jOvarian ca. 
f 95 jOVCAR-3 


37.9 


3.0 


Thymus 


0.0 


_ iOvarian ca. 
i 52 ' 5 joVCAR-4 


0.0 


0.0 


Spleen 


0.0 


j" Ovarian ca. 

;ovcar-5 


0.0 


0.0 


Lymph node 


100.0 


'94.6 

i 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


46.7 


'82.9 


Ovarian ca. 
1GROV-1 


0.0 


0.0 


Stomach 


0.0 


,0.0 


Ovarian ca.* 
(ascites) SK-OV-3 


0.0 


39.8 


Small intestine 


0.0 


10. 0 


Uterus 


0.0 


0.0 


Colon ca. SW480 


0.0 


0.0 


Placenta 


41.2 


12.6 


Colon ca.* 
SW620(SW480 met) 


0.0 


0.0 


Prostate 


0.0 


33.2 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


0.0 


Colon ca. HCT-116 


0.0 


0.0 


Testis 






Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


p - 
0.0 


Colon ca. 
tissue(OD03866) 


0.0 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


0.0 


Colon ca. HCC-2998 


0.0 


0.0 


Melanoma UACC- 
62 


0.0 


0,0 


Gastric ca.* (liver 
met)NCI-N87 


0.0 


0.0 


Melanoma M14 


0.0 


0.0 


Bladder 


45.1 


0.0 


Melanoma LOX 
IMVI 


0.0 


0.0 


Trachea 


64.2 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 


0.0 


Kidney 


40.9 


0.0 


Adipose 


0.0 


28.7 



Table AWD . Panel 2D 
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Tissue Name 


Rel. Exp .(%) 
Ag3046, Run 
162559104 


Tissue Name 


Rel. Exp.(%) 
Ag3046, Run 
162559104 


Normal Colon 


0.0 


Kidney Margin 8120608 


0.0 


CC Well to Mod Diff 
(OD03866) 


0.0 


Kidney Cancer 8 1206 13 


0.0 


Margin (OD03866) 


;o.o 

0.0 


Kidney Margin 8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


Kidney Cancer 90 1 0320 


0.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 9010321 


0.0 


CC Mod Diff (ODO3920) 


0.0 
0 0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


Uterus Cancer 064011 


0.0 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03 921) 


0.0 


Thyroid Cancer 064010 


0.0 


CC from Partial Hepatectomv 
(ODO4309) Mets 


0.0 


Thyroid Cancer A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin A302153 


0.0 


Colon mets to lung (OD0445 1 - L 
01) ; U0 


Normal Breast 


0.0 


s lii g Margin (OD044 i 02) 0.0 


Bi east C nicer (OD04566) 


100.0 


Normal Prostate 6546-1 


0.1 


Breast Cancer (OD04590-0 1 ) 


0.0 


Prostate Cancer (OD044 1 0) 


0.0 


Breast Cancer Mets (OD04590- 
03) 


00 


Prostate Margin (OD04410) 


0.0 


Breast Cancer Metastasis 
(OD04655-05) 


0.0 


Prostate Cancer (OD04720-01) 


0.1 


Breast Cancer 064006 


0.0 


Prostate Margin (OD04720- 
02) 


0.0 


Breast Cancer 1 024 


0.0 


Normal Lung 061010 


0.0 


Breast Cancer 9100266 


5.0 


Lung Met to Muscle 
(OD04286) 


0.0 


Breast Margin 9100265 


0.0 


Muscle Margin (OD04286) 


0.0 


Breast Cancer A209073 




Lung Malignant Cancer 
(OD03126) 


0.0 


Breast Margin A209073 


— 

0.0 


Lung Margin (OD03126) 


0 0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


0.0 


Lung Margin (OD04404) 


0.0 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 1026 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237-01) 


).0 


Liver Tissue 6004-N 


0.0 
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Lung Margin (OD04237-02) 


0.0 


Liver Cancer 6005-T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver Margin (OD043 10) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


Lung Margin (OD04321) 


0.0 


Bladder Cancer A3 02 173 


0.0 


Normal Kidney 


0.0 


Bladder Cancer (OD047 1 8-0 1 ) 


0 0 


Kidney Ca, Nuclear grade 2 
(OD04338) 


0.0 


Bladder Normal Adjacent 
(OD047 18-03) 


0.0 


Kidney Margin (OD04338) 


0.0 


Normal Ovary 


0 0 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


0.0 


Ovarian Cancer 064008 


0.0 


Kidney Margin (OD04339) 


0.0 


Ovarian Cancer (OD04768-07) 


r o.o 


Kidney Ca, Clear cell type 
(OD04340) 


0.0 

— 


Ovary Margin (OD04768-08) 


0.0 


Kidney Margin (OD04340) 




Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 3 
(OD04348) 


0.0 


Gastric Cancer 9060358 


0.0 


Kidney Margin (OD04348) 


0.0 


Stomach Margin 9060359 


0.0 


Kidney Cancer (OD04622-0 1 ) 


0.0 


Gastric Cancer 9060395 


0.0 


Kidney Margin (OD04622-03) 


0.0 


Stomach Margin 9060394 


0.0 


Kidney Cancer (OD04450-0 1 ) , 


0.0 


Gastric Cancer 9060397 


0.0 


Kidney Margin (OD04450-03) ; 


0.0 


Stomach Margin 9060396 


0.0 


Kidney Cancer 8120607 j 


0.0 


Gastric Cancer 064005 


0.0 



Table AWE . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag4I25, Run 
1 728593 1 5 


Tissue Name 


Rel. Exp.(%) 
Ag4I25, Run 
172859315 


Secondary Th 1 act 


3.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


4.2 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL- 11 


0.0 


Secondary Trl rest 


2.7 


Lung Microvascular EC none 


0.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC 
TNFalpha+ IL-lbeta 


0.0 


Primary Th2 act 


3.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


11.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


7.4 
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Primary Thl rest 


7.2 


Bronchial epithelium TNFalpha 
+ ILlbeta 


Its 
7.5 


Primary Th2 rest 


3.1 


Small airway epithelium none 


jO.O 


Primary TrI rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 




CD45RA CD4 lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte act 


0.0 


Coronery artery SMC TNFalpha 
+ 1L-Ibeta 


0.0 


CD8 lymphocyte act 


lo.o 


Astrocytes rest 


0.0 


Secondary CDS lymphocyte 
rest 


0.0 


Astrocytes TNFalpha + IL-1 beta 


0.0 


Secondary CD8 IymphocNte 
act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti-CD95 


0.0 


CCD1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


3.3 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta' 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


5.3 


NC1-H292 none 


0.0 


LAK cells IL-2+IFN gamma 


1.8 


NCM-I292 IL-4 


5.8 


LAK cells IL-2+ IL-I8 


3.6 


NCI-H292 IL-9 


3.5 


LAK cells PIVI A/ionomycin 




NCI-H292 IL-1 3 


1.3 


lNjv v^ens iLi-z resi 


— 

~ — 


NCI-H292 IFN gamma 


0.8 


T\wn Wax/ Ml O 1 rtax, 

i wo w dy ivi L)t\ j Gay 


H _ 


HPAEC none 


0.0 


Tu/n Wax/ Ml R <; ^ov 

i wo way jviutx. j uay 


— - — 


HPAEC TNF alpha + IL-1 beta 


0.0 


Two Way MLR 7 day 


^2 _ 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL- 
1 beta 


2.3 


PBMC PWM 




Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


1.6 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma 


J.O 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


1.0 


B lymphocytes CD40L and IL- 

4 


0.0 


Dermal fibroblast CCD 1 070 
TNF alpha 


D.O 


EOL-1 dbcAMP 


2.1 


Dermal fibroblast CCD1070 IL- 
1 beta 


3.0 


EOL-1 dbcAMP 
PMA/ionomycin 


3.3 


Dermal fibroblast IFN gamma 


3.0 



666 



WO 03/010327 



PCT/US02/14199 



Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


13.5 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


5.4 


Macrophages LPS 


0.0 


Thymus 


18.4 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 


i 



Tabic AWF . General oncology screening panel_v_2.4 



Tissue Name 


Rel. 

Exp.(%) 
Ag3046, 
Run 

267908989 


Rel. 

Exp.(%) 
Ag4125, 
Run 

26839002 
5 


Tissue Name 


Rel. 

Exp.(%) 
Ag3046, 

267908989 


Rel. Exp.(%) 
Ag4125, Run 


Colon cancer 1 


0.0 


0.0 


Bladder cancer NAT 2 


0.0 




0.0 


Colon NAT 1 jo.O io.O 


Bladder cancer NAT 3 


0.0 


3.6 


Colon cancer 2 


0.0 


0.0 (Bladder cancer NAT 4 


0.0 


7.6 


Colon cancer NAT 
2 


0.0 


0.0 


Adenocarcinoma of the 
prostate 1 


26.8 


27.5 


Colon cancer 3 


12.1 


14.1 


Adenocarcinoma of the 
prostate 2 


0.0 


0.0 


Colon cancer NAT 
3 


0.0 


8.8 


Adenocarcinoma of the 
prostate 3 


0.0 


18.4 


Colon malignant 
cancer 4 


11.7 


23.8 


Adenocarcinoma of the 
prostate 4 


0.0 


0.0 


Colon normal 
adjacent tissue 4 


3.5 


0.0 


Prostate cancer NAT 5 


0.0 


0.0 


Lung cancer 1 


0.0 


14.2 


Adenocarcinoma of the 
prostate 6 


13.0 


8.4 


Lung NAT 1 


7.3 


0.0 


Adenocarcinoma of the 
prostate 7 


9.6 


0.0 


Lung cancer 2 


8.7 


0.0 


Adenocarcinoma of the 
prostate 8 


10.7 


0.0 


Lung NAT 2 


0.0 


0.0 


Adenocarcinoma of the 
prostate 9 


0.0 


11.7 


Squamous cell 
carcinoma 3 


16.7 


0.0 


Prostate cancer NAT 1 0 


0.0 


0.0 


Lung NAT 3 


0.0 


0.0 


Kidney cancer 1 


10.2 


7.7 


metastatic 
melanoma 1 


8.8 


14.8 


KidneyNAT 1 


0.0 


0.0 
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Melanoma 2 


9.1 


0.0 


Kidney cancer 2 


100.0 f 100.0 


Melanoma 3 


16.2 


8.2 


Kidney NA1 ' 


0.0 ]7.7 


metastatic 
melanoma 4 


0.0 


19.8 


Kidney cancer 3 


18.6 13.3 


metastatic 
melanoma 5 


58.2 


33.7 


Kidney NAT 3 


0.0 8.1 


Bladder cancer 1 


0.0 


0.0 


Kidney cancer 4 117.8 jO.O 


Bladder cancer 
NAT 1 


0.0 


0.0 


Kidney NAT 4 


o.o bi.o 

! 


Bladder cancer 2 


0.0 


0.0 







CNS neurodegeneration vl.O Summary: Ag3046 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



Ag4125 Results from one experiment with this gene are not included. The amp 
plot indicates that there were experimental difficulties with this run (data not shown). 
5 General_screeningjpanel_vl.4 Summary: Ag4125 Expression of this gene is 

low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag3046 Expression of this gene is low/undetectable (CTs 
> 35) across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag3046 Significant expression of this gene is seen 
1 0 exclusively in a breast cancer sample (CT = 25.2). Therefore, expression of this gene may 
be used to distinguish breast cancers from the other samples on this panel. Furthermore, 
therapeutic modulation of the activity of the GPCR encoded by this gene may be 
beneficial in the treatment of breast cancer. 

Panel 3D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 
15 35) across all of the samples on this panel (data not shown). 

Panel 4.1D Summary: Ag4125 This gene is only expressed at detectable levels in 
the kidney (CT = 32.6). The putative GPCR encoded for by this gene could allow cells 
within the kidney to respond to specific microenvironmental signals (For example, ref. 1). 
Therefore, antibody or small molecule therapies designed with the protein encoded for by 
20 this gene could modulate kidney function and be important in the treatment of 
inflammatory or autoimmune diseases that affect the kidney, including lupus and 
glomerulonephritis. 

References: 
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1. Mark M.D., Wittemann S., Herlitze S. (2000) G protein modulation of 
recombinant P/Q-type calcium channels by regulators of G protein signalling proteins. J. 
Physiol. 528 Pt 1:65-77. 

1. Fast synaptic transmission is triggered by the activation of presynaptic Ca2+ 
5 channels which can be inhibited by Gbetagamma subunits via G protein-coupled receptors 
(GPCR). Regulators of G protein signalling (RGS) proteins are GTPase-accelerating 
proteins (GAPs), which are responsible for > 100-fold increases in the GTPase activity of 
G proteins and might be involved in the regulation of presynaptic Ca2+ channels. In this 
study we investigated the effects of RGS2 on G protein modulation of recombinant P/Q- 

1 0 type channels expressed in a human embryonic kidney (HEK293) cell line using whole- 
cell recordings. 2. RGS2 markedly accelerates transmitter-mediated inhibition and 
recovery from inhibition of Ba2^ currents (IBa) through P/Q-type channels heterologously 
expressed with the muscarinic acetylcholine receptor M2 (mAChR M2). 3. Both RGS2 
and RGS4 modulate the prepulse facilitation properties of P/Q-type Ca2+ channels. G 

15 protein reinhibition is accelerated, while release from inhibition is slowed. These kinetics 
depend on the availability of G protein alpha and betagamma subunits which is altered by 
RGS proteins. 4. RGS proteins unmask the Ca2+ channel beta subunit modulation of Ca2+ 
channel G protein inhibition. In the presence of RGS2, P/Q-type channels containing the 
beta2a and beta3 subunits reveal significantly altered kinetics of G protein modulation and 

20 increased facilitation compared to Ca2+ channels coexpressed with the betalb or beta4 
subunit. 

PMID: 11018106 

Panel 4D Summary: Ag3046 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

25 General oncology screening panel_v_2.4 Summary: Ag3046/Ag4125 Two 

experiments with same probe and primer set are in excellent agreement. Significant 
expression of this gene is seen exclusively in a kidney cancer sample (CT=34.6). 
Therefore, expression of this gene may be used to distinguish kidney cancers from the 
other samples on this panel. Furthermore, therapeutic modulation of the activity of the 

30 GPCR encoded by this gene may be beneficial in the treatment of kidney cancer. 

AX. CG97658-01: Human Protein Tyrosine Phosphotase 
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Expression of gene CG97658-01 was assessed using the primer-probe set Ag4128, 
described in Table AXA. Results of the RTQ-PCR runs are shown in Tables AXB, AXC, 
AXD and AXE. 

Table AXA . Probe Name Ag4128 



Primers 


Sequences 


(Length 


Start Position 


SEQIDNo 


Forward 


5 ' -aagcagaagagcttcatgaaaa-3 1 


i 22 


784 


386 


Probe 


TET-5 1 -cctttcctgctgcaggcggaatt-3 ' 


-TAMRAj 23 


816 


387 


Reverse 


5 ' -aaagttcatggggatttcaaag-3 ' 


| 22 


839 


388 



Table AXB . CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) 
Aa4128 Run 
2 14956370 


Tissue Name 


Rel. Exp.(%) 
Ag4128, Run 
214956370 


M) I Hippo 


10.7 


Control (Path) 3 Temporal Ctx 


2.9 


AD 2 Hippo 


24.1 


1 nt ol (Path) 4 T mp >rj) < t 


44.4 


AD 3 Hippo 


3.2 


AD 1 Occipital Ctx 


12.5 


AD 4 Hippo 


7.7 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


3.4 


AD 6 Hippo 


32.5 


AD 4 Occipital Ctx 


39.5 


Control 2 Hippo 


31.0 


ML/ J v^ccipiiai L-tX 


54.0 


Control 4 Hippo 


3.2 


AD 6 Occipital Ctx 


20.0 


Control (Path) 3 Hippo 




Control 1 Occipital Ctx 


1.8 


AD 1 Temporal Ctx 


6.0 


Control 2 Occipital Ctx 


83.5 


AD 2 Temporal Ctx 


25.3 


Control 3 Occipital Ctx 


26.1 


AD 3 Temporal Ctx 


5.1 


Control 4 Occipital Ctx 


3.7 


AD 4 Temporal Ctx 


24.5 


Control (Path) 1 Occipital Ctx 


78.5 


AD 5 Inf Temporal Ctx 


92.0 


Control (Path) 2 Occipital Ctx 


23 8 


AD 5 Sup Temporal Ctx 


25.2 


Control (Path) 3 Occipital Ctx 


1.4 


ADC Inf Temporal Ctx 


33.7 


Control (Path) 4 Occipital Ctx 


23.0 


AD 6 Sup Temporal Ctx 


41.8 


Control 1 Parietal Ctx 


3.9 


Control 1 Temporal Ctx 


4.2 


Control 2 Parietal Ctx 


32.5 


Control 2 Temporal Ctx 


52.5 


Control 3 Parietal Ctx 


19.9 


Control 3 Temporal Ctx 


15.3 


Control (Path) 1 Parietal Ctx 


76.8 


Control 3 Temporal Ctx 


13.3 


Control (Path) 2 Parietal Ctx 


22.7 


Control (Path) 1 Temporal 
Ctx 


71.2 


Control (Path) 3 Parietal Ctx 


2.8 


Control (Path) 2 Temporal 
Ctx 


41.2 


Control (Path) 4 Parietal Ctx 





Table AXC . General_screening_panel_vl.4 
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Tissue Name 


Rel. Exp.(%) 
Ag4128, Run 
220364381 


Tissue Name 


Rel. Exp.(%) 
Ag4128, Run 
220364381 


Adipose 


0.7 


Renal ca. TK-10 


0.0 


Melanoma* Hs688(A).T 


0.0 


Bladder 


0.9 


Melanoma* Hs688(B).T 
Melanoma* M14 


0.0 


Gastric ca. (liver met.) NCI- 
N87 


0.0 


0.0 


Gastric ca. KATO III 


0.0 


Melanon V LOXIM\ 1 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


0.3 


Colon ca. SW480 


0.0 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) 
SW620 


0.0 


Testis Pool 


2.2 ' 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) 
PC-3 


0.7 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.3 


Colon ca CaCo-2 


0.0 


Placenta 


0.0 




0.5 


Uterus Pool 


0.2 


Colon ca. SW1 1 16 


0.0 


Ovarian ca. OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV-3 


0.4 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool 


1.6 


Ovarian ca. OVCAR-5 


0.0 


Small Intestine Pool 


0.3 


Ovarian ca. IGROV-1 


0.0 


Stomach Pool 


0.4 


Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool 


0.1 




6.0 


Fetal Heart 


2.2 


Breast ca. M< I -7 


0.0 


Heart Pool 


1.2 


Breast ca. MDA-MB-23 1 


0.0 


Lymph Node Pool 


1.6 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.2 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.2 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.2 


Breast Pool 


0.6 


Thymus Pool 


0.3 


Trachea 


0.9 


CNS cancer (glio/astro) U87- 
MG 


0.0 


Lung 


0.3 


CNS cancer (glio/astro) U-l 18- 
MG 


0.4 


Fetal Lung 


1.3 


CNS cancer (neuro;met) SK-N- 
AS 


0.1 


Lung ca. NCI-N417 


0 0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) SNB-19 


0.0 
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I ung c i SHP-77 


0.0 


CNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


63.7 


Lungca NCI-H526 


0.0 


Brain (cerebellum) 




Lung ca NCI-H23 


0 I 


Brain (fetal) 


81.8 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) Pool 


40.7 


Lung ca. HOP-62 


0.1 


Cerebral Cortex Pool 


77.9 


Lungca. NCI-H522 


1.8 


Brain (Substantia nigra) Pool 


65.1 


Liver 


0.0 


Brain (Thalamus) Pool 


100.0 


Fetal Liver 


0.1 


Brain (whole) 


95.9 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


11.3 


Kidney Pool 


0.9 


Adrenal Gland 


0.6 


Fetal Kidney 


0.5 


Pituitary gland Pool 


0.0 


Renal _a 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.3 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


0.0 


Renal ca.UO-31 


0.0 


Pancreas Pool 


0.8 



Table .VXD . Panel 4. ID 



Tissue Name 


Rel Lxp.(%) 
Ag4l28, Run 
172859562 


Tissue Name 


Rel. Exp.(%) 
Ag4128, Run 
172859562 


Secondary Th 1 act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 1L4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + lL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium TNFalpha 
+ lLlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 lymphocyte 
act 


0.0 


Coronery artery SMC rest 


35.1 


CD45RO CD4 lymphocyte 


0.0 


Coronery artery SMC 


100.0 
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rNFalpha + IL-lbeta 




CD8 lymphocyte act 


0.0 


Astrocytes rest 


J.O 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 1L- 
lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


).0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD 1 106 (Keratinocytes) none 


).0 


LAK cells rest 


0.0 


CCD1 106 (Keratinocytes) 
TNFalpha + IL- lbeta 


0.0 


S AJfC cells IL-2 


0.0 jLiver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


0.0 jNCI-H292 none 


0.0 


LAK cells IL-2+1FN 
gamma 


0.0 |NCI-H292 IL-4 


0.0 


LAK cells IL-2+IL- 18 


0.0 JNC1-H292 1L-9 


0.0 


LAK cells PMA/ionomycin 


f\ f\ KirM UOQ") TT 1 
U.U jlN^l-nZVZ 


0.0 


NK Cells IL-2 rest 




0.0 


Two Way MLR 1 day 


u.u |nrAtc none 


0.0 


Two Way MLR 5 day 


U.U UrAcL 1 INr aipila ^ IL,-1 DCLa 


" :- — 1 

0.0 


Two Way MLR 7 day 


0.0 |l_»ung fibroblast none 


0.0 


PBMC rest 


{Lung fibroblast TNF alpha + 
00 jIL-1 beta 


0.0 


PBMCPWM 


0.0 |Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 jLung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 jLung fibroblast IL-M 


0.0 


Ramos (B cell) ionomycin 


0.0 jLung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


Dermal fibroblast CCD 1 070 
jrest 


0.0 


B lymphocytes CD40L and 
IL-4 


Dermal fibroblast CCD 1070 
9-9 jTNF alpha 


0.0 


EOL-1 dbcAMP 


Dermal fibroblast CCD 1070 IL- 
°-° |l beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 Joermal fibroblast IFN gamma 


0.0 


Dendritic cells none 


0.0 jDermal fibroblast IL-4 


10.4 


Dendritic cells LPS 


0.0 {Dermal Fibroblasts rest 


0.0 


Dendritic cells anti-CD40 


0.0 [Neutrophils TNFa+LPS 


0 0 


Monocytes rest 


0.0 {Neutrophils rest 


0.0 


Monocytes LPS jo.O jColon 


10.4 
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Macrophages rest 


0.0 


Lung 


56.6 


Macrophages LPS 


0.0. 


Thymus 


12.9 


HUVECnone 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







Table AXE . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4128, Run 
268390026 


Tissue Name 


Rel. Exp.(%) 
Ag4128, Run 
268390026 


Colon cancer 1 


4.1 


Bladder cancer NAT 2 


3.4 


Colon NAT 1 


4.3 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


3.8 


Bladder cancer NAT 4 


10.7 


Colon cancer NAT 2 


4.2 


Adenocarcinoma of the prostate 
1 


4.2 


Colon cancer 3 


14.6 


Adenocarcinoma of the prostate 
2 


0.0 


Colon cancer NAT 3 


14.7 


Adenocarcinoma of the prostate 
3 


2.1 


Colon malignant cancer 4 


9.3 


Adenocarcinoma of the prostate 
4 


15.4 


Colon normal adjacent 
tissue 4 


1.7 


Prostate cancer NAT 5 


1 3 


Lung cancer 1 


0.0 


Adenocarcinoma of the prostate 

1 . 


3.3 


Lung NAT 1 


0.0 


Adenocarcinoma of the prostate 
7 


3.3 


Lung cancer 2 


43.8 


Adenocarcinoma of the prostate 
8 


0.0 


Lung NAT 2 


1.6 


Adenocarcinoma of the prostate 
9 


2.6 


Squamous cell carcinoma 3 


3.8 


Prostate cancer NAT 1 0 


1.7 


Lung NAT 3 


0.0 


Kidney cancer 1 


6.4 


metastatic melanoma 1 


42.9 


KidneyNAT 1 


9.7 


Melanoma 2 


3.7 


Kidney cancer 2 


20.1 


Melanoma 3 


2.1 


Kidney NAT 2 


9.7 


metastatic melanoma 4 


90.8 


Kidney cancer 3 


5.2 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


2.0 


Bladder cancer 1 


0.0 


Kidney cancer 4 


18.3 


Bladder cancer NAT 1 


0.0 


KidneyNAT 4 


6.1 


Bladder cancer 2 


0.8 







CNS_neurodegeneration_vl.O Summary: Ag4128 This panel confirms the 
expression of the CG97658-01 gene at low levels in the brains of an independent group of 
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individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 
5 General_screening_panel_vl.4 Summary: Ag4128 Highest expression of the 

CG97658-01 gene is seen in thalamus (CT=25.7). High expression of this gene is detected 
in all regions of the central nervous system (CNS) examined, including amygdala, 
hippocampus, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. 
Therefore, expression of this gene can be used to distinguish CNS samples from other 

10 samples in this panel. Furthermore, therapeutic modulation of this gene may be useful in 
the treatment of central nervous system disorders such as Alzheimer's disease, Parkinson's 
disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 

15 muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Panel 4.1D Summary: Ag4128 Highest expression of the CG97658-01 gene is 
detected in TNFalpha + IL-lbeta treated coronery artery SMC cells. Therefore, exression 

20 of this gene can be used to distinguish this sample from other samples in this panel. 

In addition, low expression of this gene is seen in lung. Therefore, therapeutic 
modulation of this gene product may be beneficial in the treatment of lung related 
disorders such as chronic obstructive pulmonary disease, asthma, allergy and emphysema. 
General oncology screening panel_v_2.4 Summary: Ag4128 Highest 

25 expression of the CG97658-01 gene is detected in metastic melanoma (CT=31.5). 

Significant expression of this gene is associated with kidney cancer, melanoma and lung 
cancer. Therefore, therapeutic modulation of this gene may be useful in the treatment of 
these cancers. 

30 AY. CG97842-01 : Protein Kinase-Formin-Like Protein 
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Expression of gene CG97842-01 was assessed using the primer-probe set Ag4130, 
described in Table AYA. Results of the RTQ-PCR runs are shown in Tables AYB, AYC 
and A YD. 

Table AYA . Probe Name Ag4130 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
ID No 


Forward 


5 ' -attcaacatgcccgttaataca-3 ' 


22 


2409 


389 


Probe 


TET- 5 ' -ccagaacttctacagtagtccaagcaca-3 1 -TAMRA 


28 


2433 


300 


Reverse 


5 1 -agagtcatcttggtcactccaa-3 1 


22 


2462 





Table AYB . CNS_neurodegeneration_vl.O 



— - — " - 


Rel. Exp.(%) 
Ag4 130, Run 
21°4959899 


Tissue Name 


Rel. Exp.(%) 
Ag4130 Run 
21°4959899 


AD 1 Hippo 


16.4 


Control (Path) 3 Temporal Ctx 


T2_ 


AD 2 Hippo 


34.6 


Control (Path) 4 Temporal Ctx 


50.0 


AD 3 Hippo 


9.6 


AD 1 Occipital Ctx 


21.5 


AD 4 Hippo 


16.3 


kI 1 . Oc ipital ( r (Mi , 1 m 




AD 5 Hippo 


90.8 


AD 3 Occipital Ctx 




AD 6 Hippo 


80.1 


AD 4 Occipital Ctx 


20 ~> 


Control 2 Hippo 


33.0 


AD 5 Occipital Ctx 


37.1 


Control 4 Hippo 


13.8 


AD 6 Occipital Ctx 


55.9 


Control (Path) 3 Hippo 


17.0 


Control 1 Occipital Ctx 


4.4 


AD 1 Temporal Ctx 


20.7 


Control 2 Occipital Ctx 


36.3 


AD 2 Temporal Ctx 


42.3 


Control 3 Occipital Ctx 


25.0 


AD 3 Temporal Ctx 


10.2 


Control 4 Occipital Ctx 


10.7 


AD 4 Temporal Ctx 


40.1 


Control (Path) 1 Occipital Ctx~ 


85.3 


AD 5 Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


24.8 


AD 5 Sup Temporal Ctx 


58.0 


Control (Path) 3 Occipital Ctx 


3.5 


AD 6 Inf Temporal Ctx 


69.3 


Control (Path) 4 Occipital Ctx 


29.S 


AD 6 Sup Temporal Ctx 


0.1 


Control 1 Parietal Ctx 


5.7 


Control 1 Temporal Ctx 


8.6 


Control 2 Parietal Ctx 


43.5 


Control 2 Temporal Ctx 


29.7 


Control 3 Parietal Ctx 


24.7 


Control 3 Temporal Ctx 


19.8 


Control (Path) 1 Parietal Ctx 


67.4 


Control 3 Temporal Ctx 


12.9 


Control (Path) 2 Parietal Ctx 


31.0 


Control (Path) 1 Temporal 
Ctx 


50.7 


Control (Path) 3 Parietal Ctx 


5.3 


Control (Path) 2 Temporal 
Ctx 


35.1 


Control (Path) 4 Parietal Ctx 


57.8 
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Table AYC. General_screeningj3anel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4130, Run 
220424645 


Tissue Name 


Rel. 

Exp.(%) 
Ag4130, 
Run 

220424645 


Adipose 


12.0 


Renal ca. TK-10 


34.2 


Melanoma* Hs688(A).T 


27.0 


Bladder 


28.5 


Melanoma* Hs688(B).T 


24.5 


Gastric ca. (liver met.) NC1-N87 


71.7 


Melanoma* M14 


41.5 


Gastric ca. KATO III 


47.6 


Melanoma* LOXIMVI 


25.2 


Colon ca. SW-948 


10.9 


Melanoma* SK-MEL-5 


33.9 


Colon ca. SW480 


65.5 


Squamous cell carcinoma SCC- 
4 


17.4 


Colon ca.* (SW480 met) SW620 


38.2 


Testis Pool 


12.9 


Colon ca. HT29 


27.7 


Prostate ca.* (bone met) PC-3 


95.9 


Colon ca. HCT-1 16 


43.8 


Prostate Pool 


8.7 


Colon ca. CaCo-2 


30.8 


Placenta 


3.5 


Colon cancer tissue 


37.4 


Uterus Pool 


4.2 


Colon ca. SW1 1 16 


125 _ 


Ovarian ca. OVCAR-3 


35.4 


Colon ca. Colo-205 


JL° 


Ovarian ca. SK-OV-3 


61 .6 


Colon ca. SW-48 


7 A 


Ovarian ca. OVCAR-4 


27.0 


Colon Pool 


L 8 - 6 _ . 


Ovarian ca. OVCAR-5 


37.1 


Small Intestine Pool 


, 15 ' 6 ■■ - 


Ovarian ca. IGROV-1 


18.4 


Stomach Pool 


1^1 — _ 


Ovarian ca. OVCAR-8 


14.9 


Bone Marrow Pool 


^ 


Ovary 


1 1.3 


Fetal Heart 




Breast ca. MCF-7 


59.9 


Heart Pool 


9.3 


Breast ca. MDA-MB-23 1 


46.7 


Lymph Node Pool 


17.0 


Breast ca. BT 540 


54.7 


Fetal Skeletal Muscle 


6.6 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


13.8 


Breast ca. MDA-N 


18.3 


Spleen Pool 


12.2 


Breast Pool 


16.8 


Thymus Pool 


15.3 


Trachea 


9.5 


CNS cancer (glio/astro) U87-MG 


is.: 


Lung 


3.5 


CNS cancer (glio/astro) U-l 1 8- 
MG 


90.1 


Fetal Lung 


50.0 


CNS cancer (neuro;met) SK-N- 
AS 


57.8 


Lung ca. NCI-N417 


6.1 


CNS cancer (astro) SF-539 


21.2 


Lung ca. LX-1 


50.1) 


CNS cancer (astro) SNB-75 


51.8 
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Lung ca. NCI-H 1 46 


13.3 


CNS cancer (glio) SNB-19 


16.4 


Lung ca. SHP-77 


30.4 


CNS cancer (glio) SF-295 


73.7 


Lung ca. A549 


37.4 


Brain (Amygdala) Pool 


6.1 


Lung ca. NCI-H526 


7.5 


Brain (cerebellum) 


8.2 


Lung ca. NCI-H23 


38.2 


Brain (fetal) 


23.5 


Lung ca. NCI-H460 


24.0 


Brain (Hippocampus) Pool 


9.1 


Lungca. HOP-62 


13.0 


Cerebral Cortex Pool 


14.2 


Lung ca. NCI-H522 


46.3 


Brain (Substantia nigra) Pool 


6.2 


Liver 


I.I 


Brain (Thalamus) Pool 


15.6 


Fetal Liver 


28.1 


Brain (whole) 


5.3 


Liver ca. HepG2 


23.2 


Spinal Cord Pool 


8.1 


Kidney Pool 


21.9 


Adrenal Gland 


7.6 


Fetal Kidney 


28.3 


Pituitary gland Pool 


8.0 


Renal ca. 786-0 


43.8 


Salivary Gland 


1.9 


Renal ca. A498 


9.2 


Thyroid (female) 


4.0 


Renal ca. ACHN 


22.1 


Pancreatic ca. CAPAN2 


16.8 


Renal ca. UO-3 1 


32.3 


Pancreas Pool 


20.0 


Table A YD. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) 
Ag4130. Run 
172859566 


Tissue Name 


Rel. 

Exp.(%) 
Ag4130, 
Run 

172859566 


Secondary Thl act 


39.8 


HUVEClL-lbeta 


43.8 


Secondary Th2 act 


46.3 


HUVEC I FN gamma 


48.3 


Secondary Trl act 


36.3 


HUVECTNF alpha +IFN 
gamma 


34.2 


Secondary Th 1 rest 


17.3 


HUVECTNF alpha +IL4 


32.8 


Secondary Th2 rest 


15.8 


HUVEC 1L-1 1 


30.8 


Secondary Trl rest 


l l ).l 


Lung Microvascular EC none 


57.0 


Primary Th 1 act 


41.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


55.9 


Primary Th2 act 


34.2 


Microvascular Dermal EC none 


37.1 


Primary Trl act 


33.7 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


37.1 


Primary Th 1 rest 


28.5 


Bronchial epithelium TNFalpha-t- 
ILlbeta 


100.0 


Primary Th2 rest 


14.5 


Small airway epithelium none 


12.4 


Primary Trl rest 


22.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


40.9 
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CD45RA CD4 lymphocyte act 


3 i 8 


Coronery artery SMC rest 


21.3 


CD45RO CD4 lymphocyte act 


37.6 


Coronery artery SMC TNFalpha 
+• IL-1 beta 


23.8 


CD8 lymphocyte act 


37.1 


Astrocytes rest 


34.2 


Secondary CD8 lymphocyte 
rest 


26.4 


Astrocytes TNFalpha + IL-1 beta 


22.8 


Secondary CD8 lymphocyte act 


16.2 


KU-8 1 2 (Basophil) rest 


36.6 


CD4 1 m hoc 

ymp locyte none 


122 


KU-8 12 (Basophil) 
PMA/ionomycin 


T 7 


2ry Thl/Th2/Trl_anti-CD95 
C "-- ' - - - 


31.9 


CCD1 106 (Keratinocytes) none 


30.1 


LAK cells rest 


36.9 


CCD1 106 (Keratinocytes) 
TNFalpha + 1L-1 beta 


26.1 


LAK cells IL-2 


30.1 


Liver cirrhosis 


16.2 


LAK cells IL-2+IL-12 


21.8 


NCI-H292 none 


24.5 


LAK cells IL-2+IFN gamma 


33.9 


NCI-H292 IL-4 


46.3 


LAK cells IL-2+ IL-18 


29.5 


NCI-H292 IL-9 


48.0 


LAK cells PMA/ionomycin 


81 .2 


NC1-H292 1L-13 


41 .8 


NK Cells IL-2 rest 


36.1 


NCI-H292 IFN gamma 


39.2 


Two Way MLR 3 day 


35.4 


HPAEC none 


30.4 


Two Way MLR 5 day 


i 1 - 9 - — 


HPAEC TNF alpha + IL-1 beta 


51.4 


Two Way MLR 7 day 


21.8 


Lung fibroblast none 


39.8 


PBMC rest 


14.6 


Lung fibroblast TNF alpha + IL-1 
beta 


27.0 


PBMC PWM 


27.9 


Lung fibroblast IL-4 


27.5 


PBMC PHA-L 


23.7 


Lung fibroblast IL-9 


36.6 


Ramos (B cell) none 


30 \ 


Lung fibroblast IL-1 3 


5 1 4 


Ramos (B cell) ionomycin 


35.4 


Lung fibroblast IFN gamma 


46.0 


B lymphocytes PWM 


23.3 


Dermal fibroblast CCD 1070 rest 


45.7 


B lymphocytes CD40L and IL- 
4 


23.2 


Dermal fibroblast CCD 1070 TNF 
alpha 


61.6 


EOL-1 dbcAMP 


29.5 


Dermal fibroblast CCD 1070 IL-1 


30.4 


EOL-1 dbcAMP 
PMA/ionomycin 


61.1 


— — 

Dermal fibroblast IFN gamma 


24.8 


Dendritic cells none 


57.0 


Dermal fibroblast IL-4 


36.9 


Dendritic cells LPS 


38.2 


Dermal Fibroblasts rest 


17.1 


Dendritic cells anti-CD40 


39.8 


Neutrophils TNFa+L PS 


11.3 


Monocytes rest 


38.4 


Neutrophils rest 


30.1 


Monocytes LPS 


72.2 


Colon 


10.0 
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Macrophages rest 


36.9 


Lung 


17.9 


Macrophages LPS 


36.1 


Thymus 


47.3 


HUVEC none 


31.0 


Kidney 


44.8 


HUVEC starved 


38.4 







CNS_neurodegeneration_vl.O Summary: Ag4130 This panel confirms the 
expression of the CG97842-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
5 Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag4130 Highest expression of the 
CG97842-01 gene is detected in a breast cancer cell line (CT=25.6). High expression of 

10 this gene is seen a cluster of cancer cell lines including CNS, colon, gastric, renal, lung, 
breast, ovarian, prostate, melanoma, squamous cell carcinoma and pancreatic cancer cell 
lines. Therefore, therapeutic modulation of this gene product may be beneficial in the 
treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 

15 to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=27) when 

20 compared to adult liver and lung (CT=30-32). This observation suggests that expression of 
this gene can be used to distinguish fetal from adult liver and lung, respectively. In 
addition, the relative overexpression of this gene in fetal tissue suggests that the protein 
product may enhance growth or development of lung and liver in the fetus and thus may 
also act in a regenerative capacity in the adult. Therefore, therapeutic modulation of the 

25 protein encoded by this gene could be useful in treatment of liver and lung related 
diseases. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
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central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4130 Highest expression of the CG97842-01 gene is - 
detected in TNFalpha + ILlbeta treated bronchial epithelium (CT=28.7). This gene is 

5 expressed at high to moderate levels in a wide range of cell types of significance in the 
immune response in health and disease. These cells include members of the T-cell, B-cell, 
endothelial cell, macrophage/monocyte, and peripheral blood mononuclear cell family, as 
well as epithelial and fibroblast cell types from lung and skin, and normal tissues 
represented by colon, lung, thymus and kidney. This ubiquitous pattern of expression 

1 0 suggests that this gene product may be involved in homeostatic processes for these and 
other cell types and tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_y 1.4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 

1 5 to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 



AZ. CG98021-01: Synaptotagmin III 

Expression of gene CG98021-01 was assessed using the primer-probe set Ag413 8, 
described in Table AZA. Results of the RTQ-PCR runs are shown in Tables AZB, AZC, 
AZD and AZE. 

Table AZA . Probe Name Ag4138 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 1 -tcattcattcaacaaacatcca-3 ' 


22 


117 


392 


Probe 


TET-5 1 -agcaccaactacgccctacgtgct-3 1 -TAMRA 


24 


90 


393 


Reverse 


5 1 -gaactccaggaccttgttctct-3 1 


22 


45 


394 



Table AZB. CNSjneurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4138, Run 
214961618 


Tissue Name 


Rel. 

Exp.(%) 
Ag4138, 
Run 

214961618 


AD 1 Hippo 


8.0 


Control (Path) 3 Temporal Ctx 


1.7 
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AD 2 Hippo 


1 7.8 


Control (Path) 4 Temporal Ctx 


19.5 


lD Hippo 


1. 9 


AD 1 Occipital Ctx 


6.4 


AD 4 Hippo 


-2,9 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital Ctx 


1.3 


AD 6 Hippo 


29.3 


AD 4 Occipital Ctx 


1 1.7 


Control 2 Hippo 


18.7 


AD 5 Occipital Ctx 


18.8 


Control 4 Hippo 


2.0 


AD 6 Occipital Ctx 


60.7 


( onto 1 (Path) > Hippo 


1.4 


Control 1 Occipital Ctx 


0.5 


AD 1 Temporal Ctx 


4.6 


Control 2 Occipital Ctx 


76.8 


AD 2 Temporal Ctx 


17.2 


Control 3 Occipital Ctx 


5.3 


AD 3 Temporal Ctx 


2.4 


Control 4 Occipital Ctx 


0.9 


AD 4 Temporal Ctx 


15.3 


Control (Path) 1 Occipital Ctx 


79.6 


AD 5 Inf Temporal ( b 


81.8 


Control (Path) 2 Occipital Ctx 


8.6 


AD 5 SupTemporal Ctx 


24.1 


Control (Path) 3 Occipital Ctx 


0.4 


AD 6 Inf Temporal Ctx 


25.0 


Control (Path) 4 Occipital Ctx 


10.8 


AD 6 Sup Temporal Ctx 




Control 1 Parietal Ctx 


1.1 


Control 1 Temporal Ctx 


0.3 


Control 2 Parietal Ctx 




Control 2 Temporal Ctx 


38.7 


( i trol 3 Panetal Ctx 


8.2 


Control 3 Temporal Ctx 


92 


Control (Path) 1 Parietal Ctx 


78.5 


Control 4 Temporal Ctx 


1.7 


Control (Path) 2 Parietal Ctx 


29.7 


Control (Path) 1 Temporal 
Ctx 


57.8 


Control (Path) 3 Parietal Ctx 


0.7 


Control (Path) 2 Temporal 
Ctx 


36.1 


Control (Path) 4 Parietal Ctx 


38.4 


Table AZC. General screeningjpanel v1.4 


Tissue Name 


Rel. Exp.(%) 
Ag4138,Run 
220967147 


Tissue Name 


Rel. 

Exp.(%) 
Ag4138, 
Run 

220967147 


Adipose 


0.0 


Renal ca. TK-10 


0.8 


Melanoma* Hs688(A).T 


0.0 


Bladder 


0.0 


Melan .ma* Hs688(B).T 


).] 


Gastric ca. (liver met.) NCI-N87 


0.1 


Melanoma* M14 


0.1 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


).() 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


2.1 


Colon ca. SW480 


0.6 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


0.1 


Testis Pool 


0.8 


Colon ca. HT29 


0.0 
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Prostate ca.* (bone met) 
PC-3 


0.5 


Colon ca.HCT-1 16 


6.1 


Prostate Pool 


0.4 


Colon ca. CaCo-2 


0.7 


Placenta 


0.7 


Colon cancer tissue 


0.0 


Uterus Pool 


jo.o 




Colon ca. SWI1 16 


0.0 


.Ovarian ca. OVCAR-3 


0.1 


Colon ca. Colo-205 


0.0 


< >\ irian c SK (>\ i 


5.4 


Colon ca. SW-48 


0.0 


jOvarian ca. OVCAR-4 


0.5 


Colon Pool 


0.2 


iOvarian ca. OVCAR-5 






Small Intestine Pool 




0.0 


< ■ i in ca. IGROV-1 


0.8 


Stomach Pool 


0.0 




Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.3 


Fetal Heart 


0.1 


Breast ca. MCF-7 


0.3 


Heart Pool 


0.0 


Breast ca. MDA-MB-23 1 


0.0 


Lymph Node Pool 


0.6 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 
;Breast ca. MDA-N 


1.1 

6.7 




Skeletal Muscle Pool 
Spleen Pool 


0.0 

oT 




jBreast Pool 


0.2 


Thymus Pool 


10.0 




'Trachea 


0.0 


CNS cancer (glio/astro) U87-MG Jo.O 


t " 

.Lung 


0.0 


ICNS cancer (elio/astro) U-118- 
jMG 


0.1 




Fetal Lung 


0.9 


jCNS cancer (neuro;met) SK-N-ASjl.l 




Lung ca. NCI-N417 


0.2 


jCNS cancer (astro) SF-539 


jo.o 




Lung ca. LX-1 


0.0 


jCNS cancer (astro) SNB-75 


jo.o 




Lungca. NCI-HI 46 


0.0 


JCNS cancer (glio) SNB-19 


j0.3 




Lung ca. SHP-77 


0.0 


|CNS cancer (glio) SF-295 


10.7 




Lung ca. A549 |0.7 


jBrain (Amygdala) Pool 


jll.7 




Lung ca. NC1-H526 jo.O 


jBrain (cerebellum) 


jc 


8.6 




Lung ca. NCI-H23 


3.7 


jBrain (fetal) 


j 


00.0 




Lung ca. NCI-H460 


0.5 


jBrain (Hippocampus) Pool 


|l2.3 




Lung ca. HOP-62 


0.1 


jCerebral Cortex Pool 


J24.5 




Lung ca. NCI-H522 


3.2 


jBrain (Substantia nigra) Pool 


]l7.7 




Liver 


0.O 


jBrain (Thalamus) Pool 


129.3 




Fetal Liver 


0.0 


jBrain (whole) 


h 


8.3 




Liver ca. HepG2 


D.O 


jSpinal Cord Pool 


|3.0 




Kidney Pool j 


3.1 


jAdrenal Gland 


|,.o 




Fetal Kidney 


.2 


jPituitary gland Pool 


|l.9 




Renal ca. 786-0 


).0 


jSalivary Gland 


jo.i 





683 



WO 03/010327 



PCT/US02/14199 



Renal ca. A498 


1.1 


Thyroid (female) 


0.7' 


Renal ca. ACHN 


0 2 


Pancreatic ca. CAPAN2 


0.0 


Renal ca. UO-3 1 


0.3 


Pancreas Pool 


|0 7 


Table AZD. Panel 4. ID 


Tissue Name 


iRel. Exp.(%) 
Ag4 138. Run 
173118876 


1 issue Name 


Rel. 

Exp.(%) 
Ag4l38, 
Run 

1731 18876 


Secondary Thl act 


j0.3 


HUVEC lL-lbeta 


0.5 


Secondary Th2 act 


0.0 


nuvcL iriN gamma 


1.4 


Secondary Trl act 


0.0 


nu v i Nr alpha + IJ-JN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.7 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha 
+ IL-1 beta 


0.0 


Primary Th2 act 


0.0 


ii ! vasculai Dermal EC none 


0.0 


! Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


0.0 


jPrimary Thl rest 


0.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 lymphocyte 
act 


0.8 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte 
act 


0.3 


Coronery artery SMC TNFalpha + 
IL-1 beta 


0.7 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.4 


Secondary CD8 lymphocyte 
rest 


0.5 


Astrocytes TNFalpha + IL- 1 beta 


0.0 


Secondary CD8 lymphocyte 
act 


0.0 


KU-8 12 (Basophil) rest 


0.8 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
3 MA/ionomycin 


4.5 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD1 106 (Keratinocytes) none 


D.O 


LAK cells rest 


0.0 


CCD 1106 (Keratinocytes) 
TNFalpha + IL-1 beta 


10 


LAK cells IL-2 


0.7 


Liver cirrhosis jo.l 


LAK cells IL-2+IL-12 


o.o 


MCI-H292 none jo.O 
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I AV «„||t TT O-I-TTTM 
LAK CcllS lJLr-Z^lrlN 

gamma 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-9 


0.0 


LAK. cells PMA/ionomycin 


— — 


M/^T UOQ') Tl 1 1 

NCl-Hzy2 1L-13 


0.0 


in is. L- ciis il-z resi 


— - 


NCI-H292 I FN gamma 


0.0 


I wo Way MLK j day 


0.0 


HPAEC none 


0.0 


i wo way ivilix j aay 


°' 0 


HrAbL I Nr alpha + IL-1 beta 


0.0 


1 wo Way MLR 7 day 


— - — 


Lung fibroblast none 


0.9 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-1 
beta 


0.0 


rtSML rWM 




Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


2.0 


Lung fibroblast IL-9 


0.9 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


0 0 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast I FN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal nbi >bl; t O D10 7 0 rest 


0.0 


B lymphocytes CD40L and 
IL-4 


04 


Dermal fibroblast CCD1070 TNF 
alpha 


°; 4 


EOL-I dbcAMP 


2.7 


Dermal fibroblast CCD 1070 IL-1 
beta 


0.0 


bUL-l aDCAlvlr 

PMA/ionomycin 


0.5 


Dermal fibroblast IFN gamma 


1.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.5 


Dendritic cells LPS 


1.0 


Dermal Fibroblasts rest 


2.0 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.8 


Monocytes LPS 


0.0 


Colon 


0.8 


Macrophages rest 


0.0 


Lung 


0.9 


Macrophages LPS 


0.0 


Thymus 


13.4 


HUVEC none 


0.2 


Kidney 


100.0 


HUVEC starved 


0.5 






Table AZE. General oncology screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4138,Run 
268390088 


Tissue Name 


Rel. 

Exp.(%) 
Ag4138, 
Run 

268390088 


Colon cancer 1 


3.1 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


0.0 


Colon cancer 2 


11.8 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 2 


7.6 


Adenocarcinoma of the prostate 1 


0.0 
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Colon cancer 3 


0.0 


Adenocarcinoma of the prostate 2 


0.0 


Colon cancer NAT 3 


0.0 


Adenocarcinoma of the prostate 3 


14.5 


Colon malignant cancer 4 


32.1 


Adenocarcinoma of the prostate 4 


5.4 


Colon normal adjacent 
" SSUe — ~~ — - 


0.0 


Prostate cancer NAT 5 


10.4 


Lung cancer 1 


6.0 


Adenocarcinoma of the prostate 6 


29.7 


1 nno NAT 1 
LUHg IN A. 1 1 


0.0 


Adenocarcinoma of the prostate 7 


0.0 




74.7 


\ i ! 0< 11 ^1 U< 1 it i ! 1 K \ > ! H 




Lung NAT 2 


3.7 


Adenocarcinoma of the prostate 9 


6.5 




21.8 


Prostate cancer NAT 1 0 


0.0 


I nnn 7\J AT 1 


0.0 




0.0 




6.2 


KidneyNAT 1 


4.2 


Melanoma 2 


0.0 


Xidney cancer 2 


24.3 


Melanoma 3 


0.0 


Kidney NAT 2 


100.0 


metastatic melanoma 4 


8.3 


Kidney cancer 3 


26.4 


metastatic melanoma 5 


11.0 


Kidney NAT 3 


7.5 


Bladder cancer 1 


0.0 


Kidney cancer 4 


11.0 


Bladder cancer NAT 1 


0.0 


KidneyNAT 4 


8.5 


Bladder cancer 2 


4- ! 



CNS_neurodegeneration_vI.O^urnrnai y: ^g41 v liis panel confirms the 
expression of the CG98021-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag4138 Highest expression of the 
CG98021-01 gene is detected in cerebellum and fetal brain (CT=28). In addition high 
expression of this gene is detected exclusively in all the region of central nervous system 

10 examined. Therefore, expression of this gene can be used to distinguish CNS samples 
from other samples used in this panel. In addition, therapeutic modulation of this gene 
may be useful in the treatment of CNS disorders such as Alzheimer's disease, Parkinson's 
disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Low expression of this gene is also seen in a few of colon cancer, renal cancer, 

15 lung cancer, breast, ovarian and melanoma cell lines. Therefore, therapeutic modulation of 
this gene may be beneficial in the treatement of these cancers. 
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Low expression of this gene is also detected in fetal lung. Interestingly, this gene is 
expressed at much higher levels in fetal (CT=34.8) when compared to adult lung (CT=40). 
This observation suggests that expression of this gene can be used to distinguish fetal from 
adult lung. In addition, the relative overexpression of this gene in fetal lung suggests that 
the protein product may enhance lung growth or development in the fetus and thus may 
also act in a regenerative capacity in the adult. Therefore, therapeutic modulation of the 
protein encoded by this gene could be useful in treatment of lung related diseases. 

Panel 4.1D Summary: Ag4138 Highest expression of the CG98021-01 gene is 
detected in kidney (CT=29.6). In addition low expression of this gene is also detected in 
thymus, eosinophils and PMA/ionomycin treated basophils. Therefore, expression of this 
gene can be used to distinguish these samples from other samples used in this panel. In 
addition, therapeutic modulation of this gene may be useful in the treatment of 
inflammation and autoimmune disease that affect kidney including lupus and 
glomerulonephritis. 

General oncology screening panel_v_2.4 Summary: Ag4138 Highest 
expression of the CG98021-01 gene is detected in kidney (CT=34.5). In addition, low 
expression of this gene is also seen in lung cancer sample. Please see Panel 1 .4 and 4. 1 D 
for a discussion of the potential utility of this gene. 

BA. CG98030-01: Tyrosine Phosphatase 

Expression of gene CG98030-01 was assessed using the primer-probe set Ag4139, 
described in Table BAA. Results of the RTQ-PCR runs are shown in Tables BAB and 
BAC. 



Table BAA . Probe Name Ag4139 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 1 -ccaattccatacaaaccaagag-3 ' 


22 


548 


395 


Probe 


TET-5 ' -actcagtttctaactcctctccgcaa-3 1 -TAMRA 


26 


595 


396 


Reverse 


5 ' -tgggatcacagcaagagaatat-3 ' 


22 


622 


397 



Table BAB . CNS_neurodegeneration_vl.O 









Rel. 


Tissue Name 


Rel. Exp.(%) 




Exp.(%) 


Ag4139, Run 
214962531 


Tissue Name 


Ag4139, ■ 
Run 

214962531 
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AD 1 Hippo 


21.2 


Control (Path) 3 Temporal Ctx 


jo.o 


AD 2 Hippo 


12.4 


Control (Path) 4 Temporal Ctx 


1 12.8 


AD 3 Hippo 


6.2 


AD 1 Occipital Ctx 


]l3.4 


AD 4 Hippo 


18.6 


AD 2 Occipital Ctx (Missing) 


jo.o 


AD 5 Hippo 


62.4 


AD 3 Occipital Ctx 


j 1 0.8 


AD 6 Hippo 


73.7 


AD 4 Occipital Ctx 




Control 2 Hippo 


21.0 


AD 5 Occipital Ctx 




Control 4 Hippo 


14.8 


AD 6 Occipital Ctx 


30.4 


- ntn 1 (Patl | 3 Hup , 


1.7 


Control 1 Occipital Ctx 


6.3 


AD 1 Temporal Ctx 


21.9 


Control 2 Occipital Ctx 


28.1 


AD 2 Temporal Ctx 


26.6 


Control 3 Occipital Ctx 


19.5 


AD 3 Temporal Ctx 


5.3 


Control 4 Occipital Ctx 


1.5 


AD 4 Temporal Ctx 


6.6 


Control (Path) 1 Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx 


56.3 


Control (Path ) 0( ipital i tx 6 6 


AD 5 Sup Temporal Ctx 


84.7 


Control (Path) 3 Occipital Ctx 


2 - 5 


AD 6 Inf Temporal Ctx 


91.4 


Control (Path) 4 Occipital Ctx 


4.8 


AD 6 Sup Temporal Ctx 




Control 1 Parietal Ctx 


3.7 


Control 1 Temporal Ctx 


1.1 


Control 2 Parietal Ctx 


37.4 


Control 2 Temporal Ctx 




Control 3 Parietal Ctx 


475 


Control 3 Temporal Ctx 


27.5 


Control (Path) 1 Parietal Ctx 


62.9 


Control 3 Temporal Ctx 


3.1 


Control (Path) 2 Parietal Ctx 


24.8 


Control (Path) 1 Temporal 
Ctx 


31.9 


Control (Path) 3 Parietal Ctx 


0.0 


Control (Path) 2 Temporal 
Ctx 


28.7 


Control (Path) 4 Parietal Ctx 


75.8 



Tissue Name 


Rel. Exp.(%) 
Ag4139, Run 
268392257 


Tissue Name 


Rel. 

Exp.(%) 
Ag4139, 
Run 

268392257 


Colon cancer 1 


24.7 


Bladder cancer NAT 2 


0.0 


Colon cancer NAT 1 


0.7 


Bladder cancer NAT 3 


1.1 


Colon cancer 2 


19.2 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 2 


0.6 


Adenocarcinoma of the prostate 1 


2.0 


Colon cancer 3 


43.8 


Adenocarcinoma of the prostate 2 


1.1 


Colon cancer NAT 3 


1.0 


Adenocarcinoma of the prostate 3 


2.8 


Colon malignant cancer 4 


100.0 


Adenocarcinoma of the prostate 4 


9.1 


Colon normal adjacent 


0.0 


Prostate cancer NAT 5 


0.4 
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t ™ 4 — 








Lung cancer 1 


~ - 


Adenocarcinoma of the prostate 6 


0.8 


Lung NAT 1 




Adenocarcinoma of the prostate 7 


0.8 






Adenocarcinoma of the prostate 8 


0.6 


Lung NAT 2 




Adenocarcinoma of the prostate 9 




Squamous cell carcinoma 

3 


18.2 


Prostate cancer NAT 10 


0.3 


Lung NAT 3 


0.2 


Kidney cancer 1 


3.4 


metastatic melanoma 1 


4.1 


KidneyNAT 1 


1.0 


Melanoma 2 


2.0 


Kidney cancer 2 


9.5 


Melanoma 3 


0.8 


Kidney NAT 2 


2.3 


metastatic melanoma 4 


45 


Kidney cancer 3 


1.7 


metastatic melanoma 5 


3.1 


Kidney NAT 3 


6.4 


Bladder cancer 1 


0.1 


Kidney cancer 4 


6.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


1.4 


Bladder cancer 2 


0.8 


1 



CNSneurodegenerationvl.O Summary: Ag4139 This panel confirms the 
expression of the CG98030-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

Panel 4.1D Summary: Ag4139 Expression of the CG98030-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4139 Highest 
expression of the CG98030-01 gene is detected in malignant colon cancer sample 
(CT=29). Interestingly, higher expression of this gene is associated with number of cancer 
samples examined including colon, lung, melanoma, metastic melanoma, adenocarcinoma 
of the prostate, and kidney cancers. Therefore, expression of this gene may be used as 
diagnostic markers for these cancers. In addition, therapeutic modulation of this gene may 
be be useful in the treatments of these cancers. 



BB. CG98030-02: Tyrosine Phosphatase 
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Expression of full length physical clone CG98030-02 was assessed using the 
primer-probe set Ag6401, described in Table BBA. Results of the RTQ-PCR runs are 
shown in Table BBB. 

Table BBA . Probe Name Ag6401 



Primers 


Sequences 


Length | 


Start 
Position 


SEQ ID 

No 


Forward 


5 ' -atgaatctgtagaaaaggaggaactaa-3 ' 


! 27 | 


1676 


398 


Probe 


TET-5 1 -tctttctgccacatttctaccttcct-3 ' 


- TAMRA 26 j 


1705 


399 


Reverse 


5 1 -ctccatctcgggaattaagc-3 ' 


j~20 [ 


1731 


400 



5 fable BBE I !" Panel 4.1 D 



Tissue Name 


Rel. Exp.(%) 
Ag6401,Run 
269239989 


Tissue Name 


Rel. Exp.(%) 
Ag6401,Run 
269239989 


Secondary Till act 


17.2 


HUVECIL-lbeta 


37.6 


Secondary TrT act" 




nuvtt iriN gamma 


33 ' 9 ... . 




5.0 


nUvbt i Nr alpha + ll-N gamma 


16.5 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


23.8 


Secondary Th2 rest 


4.9 


HUVEC 1L-11 


13.4 


Secondary Trl rest 


3.7 


Lung Microvascular EC none 


100.0 


Primary Th 1 act 


58.2 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


32.8 


Primary Th2 act 


28.3 


Microvascular Dermal EC none 


14.1 


Primary Trl act 


35.4 


Microsvasular Dermal EC 
TNFalpha + 1L-1 beta 


9.3 


Primary Thl rest 


1.8 


Bronchial epithelium TNFalpha + 
lLlbeta 


21.0 


Primary Th2 rest 


2.5 


Small airway epithelium none 


12.2 


Primary Trl rest 


3.2 


Small airway epithelium TNFalpha 
+ 1L-Ibeta 


11.9 


CD45RA CD4 lymphocyte 
act 


30.6 


Coronery artery SMC rest 


37.9 


CD45RO CD4 lymphocyte 
act 


54.3 


Coronery artery SMC TNFalpha + 
IL-lbeta 


49.0 


CD8 lymphocyte act 


37.1 


Astrocytes rest 


19.9 


Secondary CD8 lymphocyte 
rest 


5.8 


Astrocytes TNFalpha + IL-lbeta 


12.9 


Secondary CDS lymphocyte 
act 


5.4 


KU-8 12 (Basophil) rest 


13.4 


CD4 lymphocyte none 


3.6 


KU-8 12 (Basophil) 
PMA/ionomycin 


5.3 
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2ryThl/Th2/Trl anti- 
CD95 CHI 1 




CCDl 106 (Keratinocytes) none 




LAK cells rest 


21.8 


CCD1106(Keratinocytes) 
1 iNralpna+ iL,-lbeta 


41.5 


T A V r*o\\o IT 1 
LAK CeilS 1L/-Z 


5.6 


Liver cirrhosis 


17.3 


L«/\IV ceiis 1 L,-ZT^ 1 L- 1 z 





NCl-Hz9z none 


13.3 


T AV t^cUc- II O-I-IPM 
L,/\K. CdlS IL-ZTMrrN 

gamma 


3.4 


NCI-H292 IL-4 


33.4 


LAK cells IL-2+ IL-18 


6.0 


NCI-H292 IL-9 


68.3 

24.7 


LAK cells PMA/ionomycin 


6 '- 9 


NCI-H292 IL-13 


INis. Cells 1L-/ rest 


16.3 


NCI-H292 I FN gamma 


25.0 


Two Way MLR 3 day 


5.1 


HPAEC none 


20.2 


l wo Way MLK j day 


3:1 


HPAEC TNF alpha + IL-1 beta 


64.6 


I wo Way MLK / day 


2.3 


Lung fibroblast none 


61.1 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL- 1 

beta 


16.5 


rtSIVH-. rWIVl 




Lung fibroblast IL-4 


17.0 


PBMC PHA-L 


17.7 


Lung fibroblast IL-9 


14.1 


Ramos (B cell) none 


6.8 


Lung fibroblast IL-13 


9.9 


Ramos (B cell) ionomycin 


9.5 


Lung fibroblast I FN gamma 


15.2 ~ 


B lymphocytes PWM 


8.0 


Derm A fil roblast CCD1070 rest 


78.5 


B lymphocytes CD40L and 
IL-4 




Dermal fibroblast CCD 1 070 TNF 
alpha 




EOL-1 dbcAMP 


6.2 


Dermal fibroblast CCD 1070 IL-1 
beta 


47.0 


nni 1 AUr> A l\/f D 

EvJL-1 ODCAMr 

PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


9.3 


Dendritic cells none 


10.2 


Dermal fibroblast IL-4 


47.6 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


39.8 


Dendritic cells anti-CD40 


10.2 


Neutrophils TNFa+LPS 


1.8 


Monocytes rest 


1.3 


Neutrophils rest 


4.1 


Monocytes LPS 


5.9 


Colon 


3.3 


Macrophages rest 


2.5 


Lung 


6.7 


Macrophages LPS 


0.0 


Thymus 


4.0 - 


HUVEC none 


16.8 


Kidney 


38.7 


HUVEC starved 


54.3 







Panel 4.1D Summary: Ag6401 Highest expression of the CG98030-02 is 
detected in lung microvascular endothelial cells and keratinocytes (CT=32.8). In addition, 
low levels of expression of this gene is also seen in dermal fibroblast, lung fibroblast, TNF 
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alpha + IL-1 beta treated HPAEC, HUVEC, resting LAK cells, activated primary Thl, 
Th2, and Trl cells, activated CD4 lymphocytes and kidney. Therefore, therapeutic 
modulation of this tyrosine phosphotase encoded by this gene may lead to the alteration of 
functions associated with these cell types and lead to improvement of the symptoms of 
patients suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

BC. CG98061-01 and CG98061-02: Novel Protein containing Histidine acid 
phosphatase domain 

Expression of gene CG98061-01 and variant CG98061-02 was assessed using the 
primer-probe set Ag4141, described in Table BCA. Results of the RTQ-PCR runs are 
shown in Tables BCB, BCC and BCD. 

Table BCA . Probe Name Ag4141 



i Primers 



Sequences 



; Length 



-gtgc ttggcaactgatgttc-3 ' 



™ T ~ 5 ' l tg ff acact 99 a 9 tct aatctccaaca-3 ' -TAMRA 



Rever se 5 ' -gcaccagagtctgttaattcca-3 



Start 
Position 



2241 



2265 



Table BCB. CN S n eu rodegen era tion^vlTo 



2303 



SEQID 

No 



401 



402 



403 



Tissue Name 


Rel. Exp.(%) 
Ag4141,Run 
215281611 


Tissue Name 


Rel. 

Exp.(%) 
Ag4141, 
Run 

215281611 


AD 1 Hippo 


15.9 


Control (Path) 3 Temporal Ctx 


4.4 


AD 2 Hippo 


20.7 


Control (Path) 4 Temporal Ctx 


13.1 


AD 3 Hippo 


6.6 


AD 1 Occipital Ctx 


13.1 


AD 4 Hippo 


2.8 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


58.6 


AD 3 Occipital Ctx 


6.7 


AD 6 Hippo 


100.0 


AD 4 Occipital Ctx 


12.4 


Control 2 Hippo 


17.7 


AD 5 Occipital Ctx 


18.0 


Control 4 Hippo 


13.9 


AD 6 Occipital Ctx 


21.9 


Control (Path) 3 Hippo 


7.1 


Control 1 Occipital Ctx 


6.0 


AD 1 Temporal Ctx 


13.5 


Control 2 Occipital Ctx 


37.4 


AD 2 Temporal Ctx 


23.3 


Control 3 Occipital Ctx 


11.8 


AD 3 Temporal Ctx 


1.6 


Control 4 Occipital Ctx 


8.0 


AD 4 Temporal Ctx 


12.9 


Control (Path) 1 Occipital Ctx 


48.6 
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AD "> In1 Um|»<ri! I 


62.4 


Control (Path) 2 Occipital Ctx 


6.3 


AD 5 Sup Temporal Ctx 


48.3 


Control (Path) 3 Occipital Ctx 


5.3 


AD 6 Inf Temporal Ctx 


83.5 


Control (Path) 4 Occipital Ctx 


7.5 


AD 6 Sup Temporal Ctx 


70.7 


Control 1 Parietal Ctx 


7.0 


Control I Temporal Ctx 


4.5 


Control 2 Parietal Ctx 


40.9 


Control 2 Temporal Ctx 


18.7 


Control 3 Parietal Ctx 


11.5 


Control 3 Temporal Ctx 


10.5 


Control (Path) 1 Parietal Ctx 


37.9 


Contn i 3 Tempoial Ctx 


7.5 


Control (Path) 2 Parietal Ctx 


18.3 


Control (Path) I Temporal Ctx 


30.8 


Control (Path) 3 Parietal Ctx 


5.4 


Control (Path) 2 Temporal Ctx 


22.1 


Control (Path) 4 Parietal Ctx 


29.3 


Table BCC. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) 
Ag4141, Run 
173118944 


Tissue Name 


Rel. 

Exp.(%) 
Ag4141, 
Run 

173118944 


Secondary Th 1 act 


82.4 


HUVEC IL-lbeta 


19.8 


Secondary Th2 act 


100.0 


HUVEC I FN gamma |l7.6 


Secondary Tr 1 act 


62.0 


HUVEC TNF alpha + IFN gamma 


15.4 


Secondary Th 1 rest ,6.9 


HUVEC TNF alpha +1L4 


25.3 


Secondary Th2 rest 


11.8 


HUVEC 1L-11 


8.3 


Secondary Trl rest 


11.2 


Lung Microvascular EC none 


24.5 


Primary Th 1 act 


61.1 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


17.2 


Primary Th2 act 


88.3 


Microvascular Dermal EC none 


14.1 


Primary Trl act 


88.3 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


11.8 


Primary Th 1 rest 


11.2 


Bronchial epithelium TNFalpha + 
ILlbeta 


12.6 


Primary Th2 rest 


5.8 


Small airway epithelium none 


5.8 


Primary Trl rest 


15.4 


Small airway epithelium TNFalpha 
+ IL-lbeta 


10.9 


CD45RA CD4 lymphocyte act 


55.9 


Coronery artery SMC rest 


9.4 


CD45RO CD4 lymphocyte act 


91.4 


Coronery artery SMC TNFalpha + 
IL-lbeta 


8.4 


CDS lymphocyte act 


60.7 


Astrocytes rest 


10.2 


Secondary CDS lymphocyte 
rest 


56.6 


Astrocytes TNFalpha + IL- 1 beta 


8.2 


Secondary CDS lymphocyte 
act 


21.8 


KU-812 (Basophil) rest 


35.4 


CD4 lymphocyte none 


4.3 


KU-812 (Basophil) 


66.0 
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PMA/ionomycin 




2ryThimi2ATrl anti-CD95 
CH11 


14.6 


CCD1 106 (Keratinocytes) none 


33.4 


LAK cells rest 


19.2 


CCD 11 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


31.2 


LAK cells IL-2 


28.9 


Liver cirrhosis 


2 7 


|LAK cells IL-2+IL- 12 133.7 


NCI-H292 none 


26.6 


jLAK cells IL-2+IFN gamma j 1 8.3 . 


NCI-H292 IL-4 j63.7 


;LAK cells IL-2+IL-18 i 1 6.6 


NCI-H292 IL-9 s69.7 


LAK cells PMA/ionomycin 


20.4 


NCI-H292 IL-13 


69 ^ 


NK Cells IL-2 rest 


40.1 


NCI-H292 IFN gamma 


67.8 


Two Way MLR 3 day 


24.3 


HPAEC none 


1 1.7 


Two Way MLR 5 day 


29.7 


HPAECTNF alpha -i II -' beia 


24.1 


Two Way MLR 7 day 


23.8 


Lung fibroblast none 


10.9 


iPBMC rest 


3.4 


Lung fibroblast TNF alpha + IL-1 ' 
beta ' I'" 5 


! PBMC PWM J30.4 


Lung fibroblast IL-4 ! 24.1 


PBMC PHA-L 


3 1 .6 jLung fibroblast IL-9 ; 22.8 


[Ramos (B cell) none 
'Ramos (B cell) ionomycin 


26.1 Lung fibroblast IL- 13 24.3 
30.1 Lung fibi blast IFN gamma |39.2 


B lymphocytes PWM 


43.2 Dermal fibroblast CCD1 070 rest |42.3 


B lymphocytes CD40L and IL- 
4 


25.0 


Dermal fibroblast CCD 1070 TNF 
alpha 


54.3 


EOL-1 dbcAMP 


51.8 


Dermal fibroblast CCD1070 IL-1 
beta 


26.1 


EOL-1 dbcAMP 
PMA/ionomycin 


20.2 


Dermal fibroblast IFN gamma 


29.1 


Dendritic cells none 


14.4 


Dermal fibroblast IL-4 


36.9 


Dendritic cells LPS 


12.1 


Dermal Fibroblasts rest 


15.0 


Dendritic cells anti-CD40 


15.9 


Neutrophils TNFa+LPS 


2.0 


Monocytes rest 


7.1 


Neutrophils rest 


5.6 


Monocytes LPS 


28.9 


Colon 


5.8 


Macrophages rest 


10.9 


Lung 


11.8 


Macrophages LPS 


9.1 


Thymus 


24.5 


HUVEC none 


14.6 


Kidney 


14.5 


HUVEC starved 


21.9 






Table BCD. General oncologv screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4141,Run 
268392261 


Tissue Name 


Rel. 

Exp.(%) 
Ag4141, 
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Run 

268392261 


Colon cancer 1 


0.6 


Bladder cancer NAT 2 


0.0 


Colon cancer NAT 1 


0.0 


Bladder cancer NAT 3 


0.8 


Colon cancer 2 


0.0 


Bladder cancer NAT 4 


1.3 


Colon cancer NAT 2 


0.6 


Adenocarcinoma of the prostate 1 


7.5 


Colon cancer 3 


1.3 


Adenocarcinoma of the prostate 2 


0.0 


Colon cancer NAT 3 


1.3 


Adenocarcinoma of the prostate 3 


10.4 


Colon malignant cancer 4 


8.7 


Adenocarcinoma of the prostate 4 


3.4 


Colon normal adjacent tissue 4 


0.0 


Prostate cancer NAT 5 


5.4 


Lung cancer 1 


8.4 


Adenocarcinoma of the prostate 6 


5.2 


Lung NAT 1 |3.4 


Adenocarcinoma of the prostate 7 


1.1 


Lung cancer 2 


23.7 


Adenocarcinoma of the prostate 8 


1.4 


Luna NAT 2 


1.8 


Adenocarcinoma of the prostate 9 


17.4 


Squamous cell carcinoma 3 


2.3 


Prostate cancer NAT 10 


0.4 


Lung NAT 3 


3.2 

29.5 "~" 


Kidney cancer 1 


20.6 


metastatic melanoma 1 


i idneyNA I 1 (4.6 


Melanoma 2 


0.5 


Kidney cancer 2 


100.0 

__ 


Melanoma 3 


0.0 


Kidney NAT 2 


metastatic melanoma 4 6.4 


Kidney cancer 3 


31.6 


metastatic melanoma 5 |6.4 


Kidney NAT 3 


4.5 


Bladder cancer 1 


0.9 


Kidney cancer 4 


3.1 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


1.5 


Bladder cancer 2 


0.9 







CNS_neurodegeneration vl.O Summary: Ag4141 This panel confirms the 
expression of the CG98061-01 gene at low levels in the brain in an independent group of 
individuals. This gene is found to be slightly upregulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, therapeutic modulation of the expression or 
function of this gene may decrease neuronal death and be of use in the treatment of this 
disease. 

General_screening_panel vl.4 Summary: Ag4141 Results from one 
experiment with the CG98061-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4.1D Summary: Ag4141 Highest expression of the CG98061-01 gene is 
detected in activated secondary Th2 cells (CT=29.6). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
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health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern suggests a role for the gene product in cell survival and proliferation. 
Therefore, modulation of the gene product with a functional therapeutic may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

General oncology screening panelv 2.4 Summary: Ag4141 Highest 
expression of the CG98061-01 gene is detected in kidney cancer sample (CT=31.8). In 
addition, significant expression of this gene is also seen in lung cancer, metastatic 
melanoma, kidney cancer and adenocarcinoma of the prostate. Therefore, therapeutic 
modulation of this gene may be beneficial in the treatment of these cancers. 



BD. CG98131-01: MDJ6 

Expression of gene CG98131-01 was assessed using the primer-probe set Ag4144, 
described in Table BDA. Results of the RTQ-PCR runs are shown in Tables BDB, BDC 
and BDD. 

Table BDA . Probe Name Ag4144 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID No 


Forward 


5 ' -cggaggacatcaagaaagc-3 ' 


19 


137 


404 


Probe 


TET-5 ' -ctaccgcaagctggcccttcgtt-3 ' -TAMRA 


23 


156 


405 


Reverse 


5 ' -tcctccttattgtcagggttct-3 1 


22 


191 


406 



Table BDB . CNS neurodegeneration vl.O 



Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

215294700 


Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

215294700 


AD 1 Hippo 


0.0 


Control (Path) 3 Temporal Ctx 


0.0 


AD 2 Hippo 


0.0 


Control (Path) 4 Temporal Ctx 


0.0 
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AD 3 Hippo 


0.0 


r\iy 1 ^CCipndl I^IX fU.O 


AD 4 Hippo 


0.0 


AD 2 Occipital Ctx (Missing) JO.O 


AD 5 hippo 


0.0 


AD 3 Occipital Ctx Jo 0 


AD 6 Hippo 


0.0 


AD 4 Occipital Ctx jo.O 


Control 2 Hippo 


0.0 


AD 5 Occipital Ctx |o.O 


Control 4 Hippo 


0.0 


AD 6 Occipital Ctx jo.O 


Control (Path) 3 Hippo 


24.8 


Control 1 Occipital Ctx |0.0 


AD 1 Temporal Ctx 




Control 2 Occipital Ctx Il3.2 


AD 2 Temporal Ctx 


0.0 


Control 3 Occipital Ctx 0.0 


AD 3 Temporal Ctx 


0.0 


Control 4 Occipital Ctx 


0.0 


AD 4 Temporal Ctx 


21 .6 


Control (Path) 1 Occipital Ctx 


19.5 


AD 5 Inf Temporal Ctx 


0.0 


Control (Path) 2 Occipital Ctx 


0.0 


AD 5 SupTemporal Ctx 


0.0 


Control (Path) 3 Occipital Ctx 


0.0 


AD 6 Inf Temporal Ctx 


0.0 


Control (Path) 4 Occipital Ctx 


fro 


AD 6 Sup Temporal Ctx 




Control 1 Parietal Ctx 


o.o 


Control 1 Temporal Ctx 


0.0 


Control 2 Parietal Ctx !o.O 


Control 2 Temporal Ctx 


47.3 


Control 3 Parietal Ctx 


o''o" 

47.0 


Control 3 Temporal Ctx 


0.0 


Control (Path) 1 Parietal Ctx 


Control 4 Temporal Ctx 


0.0 


Control (Path) 2 Parietal Ctx 0.0 


Control (Path) 1 Temporal Ctx 


0.0 


Control (Path) 3 Parietal Ctx 


0.0 


Control (Path) 2 Temporal Ctx 


0.0 


Control (Path) 4 Parietal Ctx 


0.0 



Table BDC . General_screening_panel_vl.4 



Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

221000453 


Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

221000453 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI-N87 


1.1 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.6 


Melanoma* LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK-MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell carcinoma SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


0.0 


Testis Pool 


100.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) PC-3 


0.0 


Colon ca. HCT-I16 


0.9 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 



697 



WO 03/010327 



PCT/US02/14199 



Ovarian ca. OVCAR-3 


2.7 


Colon ca. Colo-205 


0.5 


Ovarian ca. SK-OV-3 


0.6 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool jo.6 


Ovarian ca. OVCAR-5 


1.0 


Small Intestine Pool 


1.3 


Ovarian ca. IGROV-1 


0.0 


Stomach Pool jO.O 


( Ivarian ca. OVCAR-8 


0.6 


Bone Marrow Pool 


0.5 




0.0 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA-MB-231 


0.2 


Lymph Node Pool 


0.5 


Breast ca. BT 549 


0.8 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.2 


Thymus Pool 


2.0 


Trachea 


0.0 


CNS cancer (glio/astro) U87-MG 


_____ 


Lung 


0.0 


( NS cancer (L'lio/astro) I J-l 18 
MG 


0.0 


Fetal Lung 




0.0 


CNS cancer (neuro;met) SK-N- 
AS 


2.0 


Lungca.NCI-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) SNB-75 


1.1 


Lung ca. NCI-H146 


0.6 


CNS cancer (glio) SNB-19 


0.0 


Lung ca. SHP-77 


0.6 


CNS cancer (glio) SF-295 


0.0 


Lung ca. A549 


0.0 


Brain (Amvgdala) Pool 


0.0 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.5 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


1.5 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) Pool 


0.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


1.5 


Lung ca. NCI-H522 


0.0 


Brain (Substantia nigra) Pool 


0.5 




0.0 


Brain ( Thalamus) Pool 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.4 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


2.6 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


2.5 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.7 



Table BDD . Panel 4.1D 
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Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

173120070 


Tissue Name 


Rel. 

Exp.(%) 
Ag4144, 
Run 

173120070 


Secondary Th 1 act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


13.4 


Ml \\FPC TMr ol^Kn a. icm 

vll, i iNt aipna + lrN 
gamma 


0.0 


Secondary Th 1 rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


hi ivcr it i i 

nUV tL. 1L- I I 


0.0 


Secondary Trl rest 


2.2 


Lung Microvascular EC none 


0.0 


Primary Th 1 act 


0.0 


Lung Microvascular EC 
i iNraipna t il- j oeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


3.7 


Microsvasular Dermal EC 
TNFalpha+ FL-lbeta 


0.0 


Primary Th 1 rest ;0.0 


Bronchial epithelium TNFalpha + 
IL 1 beta 


0.0 


Pi -unary Th2 rest 'o.O 


Small airway epithelium none 


0.0 


Primary Trl rest jfj.O 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 lymphocyte act 


0.0 


Coronery artery SMC rest 


oTo" 


CD45RO CD4 lymphocyte act 


0.0 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.0 


CD8 lymphocyte act 


3.1 


Astrocytes rest 


1.6 


Secondary CD8 lymphocyte rest 


3.8 


Astrocytes TNFalpha + IL-lbeta 


0.0 


Secondary CD8 lymphocyte act 


0.0 


KU-8I2 (Basophil) rest 


0.0 


CD4 lymphocyte none 


2.7 


KU-8 12 (Basophil) 
PMA/ionomycin 


10.1 


2ryThl/Th2/Trl auti-CD95 
CH11 


0.0 


CCD 1 1 06 (Keratinocytes) none 


0.0 


LAK cells rest 


O.O 


CCD 11 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


3.0 


LAK cells IL-2 


3.0 


Liver cirrhosis 


3.0 


LAK cells IL-2+IL-12 


3.0 


MCI-H292 none 


7.8 


LAK cells IL-2+IFN gamma 


3.0 


MCI-H292 IL-4 


3.0 


LAK cells IL-2+ IL-18 


3.0 ] 


\fCI-H292 IL-9 


3.0 


LAK cells PMA/ionomycin ( 


5.9 I 


MCI-H292 IL-13 J( 


).0 


NK Cells IL-2 rest 


2.2 I 


MCI-H292 I FN gamma ( 


).0 


Two Way MLR 3 day ( 


•0 |HPAEC none ( 


.0 
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Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL-1 beta 


()() 


Two W o MLR , dav 


0.0 


Lung fibroblast none 


3.6 


PBMC rest 


9.1 


Lung fibroblast TNF alpha + IL-1 
beta 


0.0 


PBMC PWM 


0.0 


Lung fibroblast 1L-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast 1L-13 


0.0 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast I FN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


0.0 


B lymphocytes CD40L and 1L-4 


0.0 


Dermal fibroblast CCD1070 TNF 
alpha 


0.0 


EOL-I dbcAMP 


0.0 


Dermal fibroblast CCDI070 IL-1 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 1FN gamma 


0.0 


Dendr itic cells none 


0.0 


Dermal fibroblast 1L-4 


0.0 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


7.2 


Neutrophils rest 


0.0 


Monocytes LPS 


6.4 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


10.7 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag4144 Highest expression of the 
CG98131-01 gene is detected in superior temporal cortex of an Alzheimer's disease patient 
(CT=34.1). Therefore, therapeutic modulation of the expression or function of this gene 
may decrease neuronal death and be of use in the treatment of this disease. 



General_screeningjpanel_vl.4 Summary: Ag4144 Highest expression of the 
CG98131-01 gene is detected in testis (CT=29). Therefore, expression of this gene may be 
used to distinguish testis from other samples used in this panel and therapeutic modulation 
of this gene product may be useful in the treatment of disorders associated with testis such 
as fertility and hypogonadism. 

In addition, low expression of this gene is also seen in a CNS cancer, pancreatic 
cancer and an ovarian cancer cell lines. Therefore, therapeutic modulation of this gene 
product may be beneficial in the treatment of these cancers. 
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Panel 4.1D Summary: Ag4144 Highest expression of the CG98131-01 gene is 
detected in exclusively in kidney (CT=32). Therefore, expression of this gene may be used 
to distinguish kidney from other samples used in this panel. In addition, therapeutic 
modulation of this gene product may be beneficial in the treatment of autoimmune and 
inflammatory diseases that affect kidney including lupus and glomerulonephritis. 

BE. CG98164-01 and CG98164-02: LRR and Kinase Domain Protein 

Expression of gene CG98 164-01 and full length physical clone CG98 164-02 was 
assessed using the primer-probe sets Ag4145 and Ag4145, described in Tables BEA and 
BEB. Results of the RTQ-PCR runs are shown in Tables BEC, BED, BEE and BEF. 
Please note that CG98 164-02 represents a full-length physical clone of the CG98 164-01 
gene, validating the prediction of the gene sequence. 



Tabic BEA . Probe Name Ag4145 



Primers 


Sequences 


Length 


Start ; SEQ ID 
Position | No 


F. iuai 1 


5 ' -agtgacagtgagaccgaagaga-3 ' 




2340 ] 407 


Probe jTET-5 ' -cccggaaagcactacctatacaatca-3 ' -TAMRA 


26. 


2362 
2406 


408 
409 " 


Reverse js 1 -agtgttgtctgtgttggtgaga-3 ' 


22 


Table BEB. Probe Name Ae4145 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

No 


Forward 


5 ' -agtgacagtgagaccgaagaga-3 ' 


22 


2340 


410 


Probe 


TET-5 ' -cccggaaagcactacctatacaatca-3 ' -TAMRA 


26 


2362 


411 


Reverse 


5 1 -agtgttgtctgtgttggtgaga-3 ' 


22 


2406 


412 



Table BEC . CNSneurodegenerationvl.O 



Tissue Name 


Rel. 

Exp.(%) 
Ag4145, 
Run 

215300524 


Tissue Name 


Rel. 

Exp.(%) 
Ag4I45, 
Run 

215300524 


AD 1 Hippo 


25.2 


Control (Path) 3 Temporal Ctx 


19.6 


AD 2 Hippo 


33.9 


Control (Path) 4 Temporal Ctx 


72.7 


AD 3 Hippo 


26.4 


AD 1 Occipital Ctx 


23.7 


AD 4 Hippo 


62.4 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


28.1 


AD 3 Occipital Ctx 


6.8 


AD 6 Hippo 


65.1 


AD 4 Occipital Ctx 


55.1 


Control 2 Hippo 


52.5 


AD 5 Occipital Ctx 


17.3 



701 



WO 03/010327 



PCT/US02/14199 



Control 4 Hippo 


93.3 


AD 6 Occipital Ctx 


8.9 


Control (Path) 3 Hippo 


28.5 


Control 1 Occipital Ctx 


7.7 


AD 1 Temporal Ctx 


8.2 


Control 2 Occipital Ctx 


29.5 


AD 2 Temporal Ctx 


16.2 


Control 3 Occipital Ctx 


19.1 


AD 3 Temporal Ctx 


24.8 


Control 4 Occipital Ctx 


54.7 


AD 4 Temporal Ctx 


35.4 


Control (Path) 1 Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx 


57.0 


Control (Path) 2 Occipital Ctx 


27.4 


AD 5 Sup Temporal Ctx 


66.9 Control (Path) 3 Occipital Ctx 


0.0 


AD 6 Inf Temporal Ctx 


34.2 Control (Path) 4 Occipital Ctx 


15 6 


AD 6 Sup Temporal Ctx 


65.1 


Control 1 Parietal Ctx 


16.6 


Control 1 Temporal Ctx 


16.4 


Control 2 Parietal Ctx 


69.3 


Control 2 Temporal Ctx 


30.6 


Control 3 Parietal Ctx 


10.0 


Control 3 Temporal Ctx 


17.0 


Control (Path) 1 Parietal Ctx 


83 5 


Control 3 Temporal Ctx 


19.3 


Control (Path) 2 Parietal Ctx 


28.9 
29.5 


Control (Path) 1 Temporal Ctx |90.8 


Control (Path) 3 Parietal Ctx 


Control (Path) 2 Temporal Ctx 160.3 


Control (Path) 4 Parietal Ctx 


31.9 



Tissue Name 


Rel. Exp.(%) 
Ag4145, Run 
221000454 


Tissue Name 


Rel. 

Exp.(%) 
Ag4145, 
Run 

221000454 


Adipose 


0.5 


Renal ca. TK-10 


0.4 


Melanoma* Hs688(A).T 


0.1 


Bladder 


8.1 


Melanoma* Hs688(B).T 


0.0 


Gastric ca. (liver met.) NCI-N87 


3 4 


Melanoma* Ml 4 


6.2 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


4.0 


Colon ca. SW-948 


0.2 


Melanoma* SK-MEL-5 


100.0 


Colon ca. SW480 


0.2 


Squamous cell carcinoma SCC- 
4 


0.0 


Colon ca.* (SW480 met) SW620 


0.3 


Testis Pool 


23.2 


Colon ca. HT29 


0.3 


Prostate ca.* (bone met) PC-3 


0.8 


Colon ca. HCT-116 


5.6 


Prostate Pool 


1.5 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.4 


Uterus Pool 


0.7 


Colon ca.SW1116 


2.7 


Ovarian ca. OVCAR-3 


1.6 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV-3 


1.4 


Colon ca. SW-48 


0.0 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool 


0.9 


Ovarian ca. OVCAR-5 


1.3 


Small Intestine Pool 


2.3 
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Ovarian ca. IGROV-I 


1.6 


Stomach Pool 


1.2 


Ovai in ca < » v ( AR-8 


4.7 


Bone Marrow Pool 


0.6 


Ovary 


0.5 


Fetal Heart 


1.6 


Breast ca. MCF-7 


2.6 


Heart Pool 


0.4 


Breast ca. MDA-MB-23 1 


2.8 


Lymph Node Pool 


2.1 


Breast ca. BT 549 


5.8 


Fetal Skeletal Muscle 


1.2 


Breast ca. T47D 


0.9 


Skeletal Muscle Pool 


0.7 


Breast ca. MDA-N 


5.5 


Spleen Pool 


2.5 


Breast Pool 


1.7 


Thymus Pool 


22 ~ 


Trachea 


11.3 


t NS cancel ' lio'a tio) U87-MG 


0.7 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18-MG 


1.3 


Fetal Lung 


17.6 


CNS cancer (neuro:met) SK-N-AS 


1.2 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF-539 


0 0 


Lung ca. LX-I 


0.4 


CNS cancer (astro) SNB-75 


6.0 


Lung ca.NCI-H 146 


0.2 


CNS cancer (glio) SNB-19 


To" 


Lung ca. SHP-77 


0.8 


CNS cancer (glio) SF-295 


0.7 


Lung ca. A549 


1.5 


Brain (Amygdala) Pool 


1.0 j 


Lung ca. NC1-H526 


0.0 


Brain (cerebellum) 


1.3 


Lung ca. NCI-H23 


7.0 


Brain (fetal) 


3.7 " 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) Pool 


1.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


1.5 


Lung ca. NCI-H522 


0.5 


Brain (Substantia nigra) Pool 


0.5 


Liver 


0.0 


Brain (Thalamus) Pool 


- 1 


Fetal Liver 


1.0 


Brain (whole) 


1.4 


Liver ca. HepG2 


0 3 


Spinal Cord Pool 


1.7 


Kidney Pool 


2.3 


Adrenal Gland 


0.6 


Fetal Kidney 


7.0 


Pituitary gland Pool 


3.5 


Renal ca. 786-0 


0.3 


Salivary Gland 


0.8 


Renal ca. A498 


3.4 


Thyroid (female) 


1.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. CAPAN2 


2.3 


Renal ca. UO-31 


0.9 


Pancreas Pool 


1.7 


Table BEE. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4145, Run 
197470572 


Tissue Name 


Rel. 

Exp.(%) 
Ag4145, 
Run 

197470572 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


7.5 


Secondary Th2 act 


7.3 


HUVEC I FN gamm 


0.0 
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Secondary Trl act 


23.0 


HUVEC TNF alpha [] fgamn 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 




ITT T\/r?^ TT 1 1 

HU VbC 1L-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


11.3 


Primary Th 1 act 


0.0 


Lung Microvascular EC TNFalpha 
<^ 1 L- 1 Deta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


8.2 


Primary Trl act 


7.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


iis.o 


Primary Thl rest 


11.7 


Bronchial epithelium TNFalpha + 
ILlbeta 


30.6 


Primary Th2 rest 




Small airway epithelium none 


6.3 


Primary Trl rest 


7.7 


Small airway epithelium TNFalpha 
+ IL-lbeta 


15.5 


CD45RA CD4 lymphocyte act 


ill 


Coronery artery SMC rest 


14.6 


CD45RO CD4 lymphocyte act 


40.1 


Coronery artery SMC TNFalpha + 
IL-lbeta 


7.0 






Astrocytes rest 


6.0 " 


Secondary CD8 lymphocyte 


16.5 


Astrocytes TNFalpha + IL-lbeta 


0.0 


-Secondary CD8 lvmphocvte act 




KU-812 (Basophil) rest 


25.7 


CD4 lymphocyte none 


(To ~~ 


KU-812 (Basophil) PMA/ionomycin |40.6 


2ry Thl/Th2/Trl anti-CD95 
CHI I 


8.6 


CCD1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD 11 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


10.4 


Liver cirrhosis 


41.2 


LAK cells IL-2+IL- 12 


11.6 


NCI-H292 none 


3.3 


LAK cells IL-2+IFN gamma 


13.8 


NCI-H292 IL-4 


0.0 


LAK cells IL-2+ IL-18 


16.0 


NCI-H292 IL-9 


0.0 


LAK cells PMA/ionomycin 


62 


NCI-H292 IL-13 


0.0 


NK Cells IL-2 rest 


)() 


NCI-H292 I FN gamma 


5.5 


Two Way MLR 3 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL- 1 beta 


6.7 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


3.0 


Lung fibroblast TNF alpha + IL-1 
jeta 


7.1 


PBMC PWM 


3.0 


Lung fibroblast IL-4 


3.0 


PBMC PHA-L 


S.4 


Lung fibroblast IL-9 


3.0 


Ramos (B cell) none 


)4.6 


^ung fibroblast IL-13 


3.0 
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Ramos (B cell) ionomycin 


100.0 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


13.4 


B lymphocytes CD40L and IL- 
4 


14.4 


Dermal fibroblast CCD 1070 TNF 
alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1070 IL-I 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


28.3 


Dermal fibroblast IFN gamma 


0.0 
8.3 


Dendritic cells none 


45.7 


Dermal fibroblast IL-4 


Dendritic cells LPS 


16.0 


Dermal Fibroblasts rest 


o.o 


Dendritic cells anti-CD40 


23.5 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


20.2 


Neutrophils rest 


0.0 


Monocytes LPS 


15.7 


Colon 


0.0 


Macrophages rest 


19.1 


Lung 


15.7 


Macrophages LPS 


0.0 


Thymus 


8.2 


HUVEC none 


0.0 


Kidney 


15.6 


HUVEC starved 


0.0 







Tissue Name 


! 

jRel. Exp.(%) 
Ag4145, Run 
268861697 


Tissue Name 


iRel. 
|Exp.(%) 
Ag4145, 
Run 

268861697 


Colon cancer 1 


18.8 


Bladder cancer NAT 2 


13.6 


Colon cancer NAT 1 


9.0 


Bladder cancer NAT 3 


0.9 


Colon cancer 2 


14.5 


Bladder cancer NAT 4 


8.4 


Colon cancer NAT 2 


1.7 


Adenocarcinoma of the prostate 1 


0.0 


Colon cancer 3 


1 1.3 


Adenocarcinoma of the prostate 2 


0.0 


Colon cancer NAT 3 


3.2 


Adenocarcinoma of the prostate 3 


17.6 


Colon malignant cancer 4 


17.6 


Adenocarcinoma of the prostate 4 


0.0 


Colon normal adjacent tissue 4 


4.6 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


22.1 


Adenocarcinoma of the prostate 6 


1.7 


Lung NAT 1 


1.5 


Adenocarcinoma of the prostate 7 


1.0 


Lung cancer 2 


100.0 


Adenocarcinoma of the prostate 8 


0.0 


Lung NAT 2 


3.7 


Adenocarcinoma of the prostate 9 


0.0 


Squamous cell carcinoma 3 


11.0 


Prostate cancer NAT 10 


0.2 


Lung NAT 3 


4.2 


Kidney cancer 1 


30.1 


metastatic melanoma 1 


2.9 


KidneyNAT I 


48.0 


Melanoma 2 


0.2 


Kidney cancer 2 


47.3 
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Melanoma 3 


3.0 


K idi cy NAT ^ 


19.1 


metastatic melanoma 4 


0.1 


Kidney cancer 3 


2.9 


metastatic melanoma 5 


0.0 


Kidney NAT 5 


1.7 


Bladder cancer 1 


26 4 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 |0.0 


Kidney NAT 4 


_ 


Bladder cancer 2 J 1.1 



CNS neurodegeneration vl.O Summary: Ag4145 This panel confirms the 
expression of the CG98 164-01 gene at low levels in the brain in an independent group of 
individuals. This gene appears to be slightly upregulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, therapeutic modulation of the expression or 



5 function of this gene may decrease neuronal death and be of use in the treatment of this 
disease. 

General screeningjanel vl. 4 Summary: Ag4145 Highest expression of the 
CG98164-01 gene is seen in a melanoma cell line (CT=30). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 

10 marker to detect the presence of melanoma. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of melanoma. 

In addition, this gene is expressed at much higher levels in fetal lung (CT=32.5) 
when compared to expression in the adult counterpart (CT=40). Thus, expression of this 
gene may be used to differentiate between the fetal and adult source of this tissue. 

15 General_screening_panel_vl.5 Summary: Ag4 145 Results from one 

experiment with the CG98 164-01 gene are not included. The amp plot indicates that there 
were experimental difficulties with this run. 

Panel 4.1D Summary: Ag4145 Expression of the CG98164-01 gene is restricted 
to ionomycin and untreated samples from the B cell line Ramos (CTs=34.5). B cells 

20 represent a principle component of immunity and contribute to the immune response in a 
number of important functional roles, including antibody production. Production of 
antibodies against self-antigens is a major component in autoimmune disorders. Since B 
cells play an important role in autoimmunity, inflammatory processes and inflammatory 
cascades, therapeutic modulation of this gene product may reduce or eliminate the 

25 symptoms of patients suffering from asthma, allergies, chronic obstructive pulmonary 
disease, emphysema, Crohn's disease, ulcerative colitis, rheumatoid arthritis, psoriasis, 
osteoarthritis, systemic lupus erythematosus and other autoimmune disorders. Two 
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additional experiments with the same probe and primer show low/undetectable levels of 
expression (CTs>35). (Data not shown.) 

General oncology screening panel_v_2.4 Summary: Ag4145 Highest 
expression of the CG98 164-01 gene is detected in lung cancer (CT=29). Significant 
expression of this gene is seen in number of cancer samples including colon, lung, 
adenocarcinoma of prostate, bladder and kidney cancers. . Thus, the expression of this 
gene could be used to distinguish colon, lung and prostate cancers from the normal tissues. 
Therapeutic modulation of this gene product may be useful in the treatment of these 
cancers. 

BF. CG99588-01: Novel Transmembrane Protein 

Expression of gene CG99588-01 was assessed using the primer-probe set Ag4148. 
described in Table BFA. Results of the RTQ-PCR runs are shown in Tables BFB. BFC, 
BFD and BFE. 



Table BFA . Probe Name Ag4148 



Primers 


Sequences 


Length 


Start 
Position 


j SEQ ID 
• No 


Forward 


5 1 -ggtcactgtggtgaagagtga-3 • 




21 


12 


j 413 


Probe 


TET-5 1 -acccaaactggtgccgttcttcaag-3 


• -TAMRA 


25 


36 


[ 414 


Reverse 


5 ' -cagccagagcacaaaatacac-3 ' 


21 


70 


415 



Table BFB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 
215309158 


Tissue Name 


Rel. Exp.(%) 
Ag4 1 48, Run 
215309158 


AD 1 Hippo 


24.5 


Control (Path) 3 Temporal Ctx 


9.0 


AD 2 Hippo 


42.9 


Control (Path) 4 Temporal Ctx 


49.0 


AD 3 Hippo 


8.1 


AD 1 Occipital Ctx 


27.2 


AD 4 Hippo 


16.' 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


95.3 


AD 3 Occipital Ctx 


8.6 


AD 6 Hippo 


72.2 


AD 4 Occipital Ctx 


29.1 


Control 2 Hippo 


36.6 


AD 5 Occipital Ctx 


38.7 


Control 4 Hippo 


30.8 


AD 6 Occipital Ctx 


26.1 


Control (Path) 3 Hippo 


9.9 


Control I Occipital Ctx 


2.0 


AD 1 Temporal Ctx 


51.8 


Control 2 Occipital Ctx 


64.6 


AD 2 Temporal Ctx 


49.0 


Control 3 Occipital Ctx 


40.6 


AD 3 Temporal Ctx 


11.4 


Control 4 Occipital Ctx 


16.8 
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AD 4 Temporal Ctx 


52.5 


Control (Path) 1 Occipital Ctx 


60.3 


AD 5 Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


25.5 


AD 5 Sup Temporal Ctx 


67.8 


Control (Path) 3 Occipital Ctx 


3.3 


AD 6 Inf Temporal Ctx 


65.1 


Control (Path) 4 Occipital Ctx 


27.0 


AD 6 Sup Temporal Ctx 




Control 1 Parietal Ctx 




Control 1 Temporal Ctx 


18.9 


Control 2 Parietal Ctx 


64.6 


Control 2 Temporal Ctx 


40.6 


Control 3 Parietal Ctx 


19.5 


Control 3 Temporal Ctx 


28.3 


Cont 1 (Path) 1 Parietal 


71.7 


Control 3 Temporal Ctx 


23.5 


Control (Path) 2 Parietal Ctx 


34.9 


Control (Path) 1 Temporal 
Ctx 


63.3 


Control (Path) 3 Parietal Ctx 


10.9 


Control (Path) 2 Temporal 
Ctx 


53.2 


Control (Path) 4 Parietal Ctx 


50.0 



Table BFC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 

ZZZ 1 O 1 7JO 


Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 
222181956 . 


_ 

Ad ipose 




Kenai ca. I iv- 1 U 




Melanoma* Hs688(A).T 


5.9 


Bladder 


57.4 


Melanoma* Hs688(B).T 


3.4 


Gastric ca. (liver met.) NCI-N87 


32.3 


Melanoma* M14 


6.4 


Gastric ca. KATO III 


0.0 


Melanoma* LOX1MVI 


4.8 


Colon ca. SW-948 


3.3 


Melanoma* SK-MEL-5 


17.2 


Colon ca. SW480 


8.3 


Squamous cell carcinoma 
SCC-4 


6.4 


Colon ca.* (SW480 met) SW620 


1.7 


Testis Pool 


12.1 


Colon ca. HT29 


3.4 


Prostate ca.* (bone met) 
PC-3 


13.3 


Colon ca.HCT-1 16 


3.6 


Prostate Pool 


2.6 


Colon ca. CaCo-2 


1.8 


Placenta 


37.4 


Colon cancer tissue 


32.5 


Uterus Pool 


8.8 


Colon ca. SW1116 


3.5 


Ovarian ca. OVCAR-3 


17.8 


Colon ca. Colo-205 


1.5 


Ov nun c >F -< A 3 


5.9 


Colon ca. SW-48 


5.8 


Ovarian ca. OVCAR-4 


8.2 


Colon Pool 


19.2 


Ovarian ca. OVCAR-5 


33.7 


Small Intestine Pool 


22.2 


Ovarian ca. IGROV-1 


7.8 


Stomach Pool 


16.5 


Ovarian ca. OVCAR-8 


22.4 


Bone Marrow Pool 


10.5 


Ovary 


58.2 


Fetal Heart 


6.9 


Breast ca. MCF-7 


4.5 


Heart Pool 


11.9 
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Breast ca. MDA-MB-23 1 


3.6 


Lymph Node Pool 


14.6 


Breast ca. BT 549 


1.2 


Fetal Skeletal Muscle 


3.7 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


3.8 


Breast ca. MDA-N 


8.1 


Spleen Pool 


20.6 


Breast Pool 


16.4 


Thymus Pool 


15.6 


(Trachea 


1 1.5 


CNS cancer (glio/astro) US7-MG 


7 1 


Lung 

— 


5.4 


CNS cancer (glio/astro) U-l 18- 
MG 


0.6 


Fetal Lung 


14.8 


CNS cancer (neuromiet) SK-N- 
AS 


26.6 


Lungca. NCI-N417 


1.1 


CNS cancer (astro) SF-5'39 


8.8 


Lungca. LX-1 


6.7 


CNS cancer (astro) SNB-75 


27.0 


Lungca. NCI-H146 


0.2 


CNS c ncer(gl . V B lu 


4.7 


Lungca. SHP-77 


2.8 


CNS cancer (glio) SF-295 


13.9 


Lung ca. A549 


1.0 


Brain (Amygdala) Pool 


15.8 


Lung ca. NCI-H526 


6.3 


Brain (cerebellum) 


36.9 


Lungca. NC1-H23 


2.5 


Brain (retal) 


29.1 


(Lungca. NCI-H460 


0.5 " 


Brain (Hippocampus) Pool 


13.2 


Lung ca. HOP-62 


1.0 


Cerebral Cortex Pool 


16.0 


Lung ca. NC1-H522 


28.7- 


Brain (Substantia nigra) Pool 


19.9 


Liver 


3.4 


Brain (Thalamus) Pool 


21.0 


Fetal Liver 


12.1 


Brain (whole) 


23.5 


Liver ca. HepG2 


0.6 


Spinal Cord Pool 


18.2 


Kidney Pool 


28.1 


Adrenal Gland 


32.5 


Fetal Kidney 


9.3 


Pituitary gland Pool 


5.2 


Renal ca. 786-0 


3,1 


Salivary Gland 


8.4 


Renal ca. A498 


11.3 


Thyroid (female) 


10.3 


Renal ca. ACHN 


10.4 


Pancreatic ca. CAPAN2 


1.8 


Renal ca. UO-31 


11.8 : 


Pancreas Pool 


24.0 


Table BFD. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 
173123939 


Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 
173123939 


Secondary Thl act 


0.3 


HUVEC IH beta 


1.4 


Secondary Th2 act 


6.4 


HUVEC I FN gamma 


18.6 


Secondary Trl act 


1.1 


HUVEC TNF alpha +IFN 
gamma 


3.1 


Secondary Thl rest 


6.7 


HUVEC TNF alpha +IL4 


1.9 


Secondary Th2 rest 


3.4 


HUVEC IL-11 


2.2 
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Secondary Trl rest 


11.2 


Lung Microvascular EC none 


10.8 


Primary Th 1 act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


16.6 


Primary Th2 act 


4.2 


Microvascular Dermal EC none 


7.3 


Primary Trl act 


1.4 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


3.6 


Primary Thl rest 


1.1 


Bronchial epithelium TNFalpha 
+ ILlbeta 


6.0 


Primary Th2 rest 


4.0 


Small aii-way epithelium none 


16.7 


Primary Trl rest 


4.2 


Small airway epithelium 
TNFalpha + 1L-1 beta 


12.9 


CD45RACD4 
lymphocyte act 


1.2 


Coronery artery SMC rest 


5.7 


CD45RO CD4 
lymphocyte act 


2.7 


« c , nen itten SM< INI ilpha 
+ IL-lbeta 


10.5 


CD8 lymphocyte act 


2.4 


Astrocytes rest 


9.3 


Secondary CD8 
lymphocyte rest 


2.1 


Astrocytes TNFalpha + 1L-1 beta 


1.1 


Secondary CD8 1 
lymphocyte act 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


1.4 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


8.1 


CCD1 106 (Keratinocvtes) none 


8.7 


LAK cells rest 


17.1 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-lbeta 


2.9 


LAK cells IL-2 


4.5 


Liver cirrhosis 


7.3 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


20.3 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


15.6 


LAK cells IL-2+IL-18 


0.0 


xipi U9Q9 IT 0 


2g3 — 


LAK cells 
PMA/ionomycin 


1.0 


NCI-H292 IL-13 


21.3 

— — 


NK Cells IL-2 rest 


3.7 


NPT T-I9Q? IPM rramma 

iNv_.i-rizyz iriN gamma 




Two Way MLR 3 day 


9.5 


i i r /-vm_ none 




Two Way MLR 5 day 


2.3 


HPAEC TNF alpha + IL-1 beta 


4.9 


Two Way MLR 7 day 


0.2 


Lung fibroblast none 


3.2 


PBMC rest 


3.6 


Lung fibroblast TNF alpha + IL- 
1 beta 


0.0 


PBMC PWM 


1.7 


Lung fibroblast IL-4 


1.7 


PBMC PHA-L 


3.0 


Lung fibroblast IL-9 


1.1 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


1.0 
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Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma 


0.0 


H h i phocyt s PWM 


0.0 


Dermal fibroblast CCD 1070 rest 


3.7 


B lymphocytes CD40L 
and IL-4 


2.8 


Dermal fibroblast CCD 1070 
TNF alpha 


7.2 


EOL-I dbcAMP 


0.0 


Dermal fibroblast CCD 1070 IL-1 


2.8 


EOL-I dbcAMP 
PMA/ionomycin 


2.0 


— — 

Dermal fibroblast IFN gamma 


1.4 


Dendritic cells none 


71.7 


Dermal fibroblast IL-4 


0.9 


Dendritic cells LPS 


8.7 


Dermal Fibroblasts rest 


1.9 


Dendritic cells anti-CD40 


81.8 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


18.8 


Neutrophils rest 


0.0 


Monocytes LPS 


8.4 


Colon 


8.8 


Macrophages rest 


56.6 


Lung 


47.3 


Macrophages LPS 


1.0 


Thymus 


22.8 


HI \ EC none 


2.2 


K.idney 


100.0 


HUVEC starved 


12.2 






Tabic BFE. General oncology screening panel v 2.4 


Tissue Name 


Rel. Exp.(%) 
Ag4148, Run 
268623908 


Tissue Name 


Rel. Exp.(%) 
Ag4148. Run 
268623908 


Colon cancer 1 


12.6 


Bladder cancer NAT 2 


0.3 


Colon NAT 1 


7.6 


Bladder cancer NAT 3 


0.1 


Colon cancer 2 


19.1 


Bladder cancer NAT 4 


2.1 


Colon cancer NAT 2 


5.1 


Adenocarcinoma of the prostate 
1 


24.0 


Colon cancer 3 


23.0 


Adenocarcinoma of the prostate 
2 


1.3 


Colon cancer NAT 3 


11.9 


Adenocarcinoma of the prostate 
3 


10.6 


Colon malignant cancer 4 


30.1 


Adenocarcinoma of the prostate 
4 


21.8 


Colon normal adjacent 
tissue 4 


4.1 


Prostate cancer NAT 5 


1.1 


Lung cancer 1 


13.3 


Adenocarcinoma of the prostate 
6 


5.4 


Lung NAT 1 


5.0 


Adenocarcinoma of the prostate 
7 


7.9 


Lung cancer 2 


36.6 


Adenocarcinoma of the prostate 
8 


1.6 


Lung NAT: 


3.0 


Adenocarcinoma of the prostate 


20.0 
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9 




Squamous cell carcinoma 
3 


22.8 


Prostate cancer NAT 1 0 


0.4 


Lung NAT 3 


1.7 


Kidney cancer 1 


38.4 


metastatic melanoma 1 


33.9 


KidneyNAT 1 


24.0 


Melanoma 2 


8.2 


Kidney cancer 2 


66.9 


Melanoma 3 


2.6 


Kidney NAT 2 


96.6 


metastatic melanoma 4 


96.6 


Kidney cancer 3 


17.3 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


19.2 


Bladder cancer 1 


0.8 jKidney cancer 4 


8.8 


Bladder cancer NAT 1 


0.0 kidney NAT 4 


38 4 


Bladder cancer 2 


4.6 





CNS_neurodegeneration_vl.O Summary: Ag4148 This panel confirms the 
expression of the CG99588-01 gene at low levels in the brain in an independent group of 
individuals. This gene is found to be slightly upregulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, therapeutic modulation of the expression or 



5 function of this gene may decrease neuronal death and be of use in the treatment of this 
disease. 

General_screening_panel_vl.4 Summary: Ag4148 Highest expression of the 
CG99588-01 gene is detected in breast cancer cell line (CT=30.41). Significant expression 
of this gene is also seen in cluster of cancer cell lines including CNS, colon, gastric, renal, 
10 lung, breast, ovarian, prostate, squamous cell carcinoma, and melanoma cell lines. 

Therefore, therapeutic modulation of this gene product may be useful in the treatment of 
these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 

to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, heart, liver 
15 and the gastrointestinal tract. Therefore, therapeutic modulation of the activity of this gene 

may prove useful in the treatment of endocrine/metabolically related diseases, such as 

obesity and diabetes. 

In addition, this gene is expressed at moderate levels in all regions of the central 

nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
20 cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 

central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 

epilepsy, multiple sclerosis, schizophrenia and depression. 
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Ag4148 Results from one experiment (run 220982871) with the this gene are not 
included. The amp plot indicates that there were experimental difficulties with this run. 

Panel 4.1D Summary: Ag4148 Highest expression of the CG99588-01 gene is 
detected in kidney (CT=30.3). In addition, moderate to low levels of expression of this 
5 gene is also seen primary and secondary Till , Th2 and Trl cells, LAK cells, dendritic 
cells, monocytes, macrophages, endothelial cells, bronchial and small airway epithelial 
cells, coronery artery SMC, NCI-H292, astrocytes and normal tissues represent by colon, 
lung, and thymus. Therefore, therapeutic modulation of the gene product may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
10 symptoms of patients suffering from autoimmune and inflammatory diseases such as 

asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4148 Highest 
expression of the CG99588-01 gene is detected in metastic melanoma (CT=29). In 
15 addition, significant expression of this gene is also seen in number of cancer samples 

including kidney, colon, adenocarcinoma of prostate, lung and bladder cancer. Therefore, 
therapeutic modulation of this gene product through the use of small molecule drug may 
be beneficial in the treatment of these cancers. 

20 BG. CG99618-01: Protein-Tyrosine Phosphatase 2C 

Expression of gene CG996 18-01 was assessed using the primer-probe set Ag4151, 
described in Table BGA. Results of the RTQ-PCR runs are shown in Tables BGB, BGC 
and BGD. 

Table BGA. Probe Name Ag4151 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 

No 


Forward 


5 1 -catcatgggcaattaaaagaga-3 ' 


22 


258 


116 


Probe 


TET-5 ' -aaaatcctctgaactgtgcagatcct-3 ' -TAMRA 


26 


304 


417 


Reverse 


5 ' -catgaaaccacctttgagaagt-3 ' 


22 


330 


418 



25 Table BGB . CNSjneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
215318747 


Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
215318747 


AD 1 Hippo 


3.7 


Control (Path) 3 Temporal Ctx 


2.0 
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AD 2 Hippo 


6 1 


Control (Pati i 1 remp >ral Ctx 


30.8 


AD 3 Hippo 


0.7 


AD 1 Occipital Ctx 


12.9 


AD 4 Hippo 


3.5 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital Ctx 


3.8 


AD 6 Hippo 


12.4 


AD 4 Occipital Ctx 


15.9 


;Control 2 Hippo 


7.3 


AD 5 Occipital Ctx 


S.2 


Control 4 Hippo 


4.0 


AD 6 Occipital Ctx 


17.2 


i ontrol ( 1 ath) Hi, po 


0.0 


Control 1 Occipital Ctx 


1.1 


AD 1 Temporal Ctx 


3.7 


Control 2 Occipital Ctx 


32.1 


AD 2 Temporal Ctx 


14.1 


Control 3 Occipital Ctx 


7.3 


AD 3 Temporal Ctx 


3.2 


Control 4 Occipital Ctx 


4.9 


AD 4 Temporal Ctx 


17.6 


Control (Path) 1 Occipital Ctx 


49.3 


AD 5 (nf Temporal Ctx 


90.8 


Control (Path) 2 Occipital Ctx 


11.3 


AD 5 SupTemporal Ctx 


1 6.8 [Control (Path) 3 Occipital Ctx 


0.0 


AD 6 Inf Temporal Ctx 


0.0 


Control (Path) 4 Occipital Ctx 


19.1 


AD 6 Sup Temporal Ctx 


37.6 


Control 1 Parietal Ctx 




Control 1 Temporal Ctx 


4.8 


Control 2 Parietal Ctx 


26.2 


Control 2 Temporal Ctx 


8.5 


( mtiol 3 Parietal < t ; 


12.4 


Control 3 Temporal Ctx 


7.9 


Control (Path) 1 Parietal Ctx 


35.8 


Control 4 Temporal Ctx 


11.2 


Control (Path) 2 Parietal Ctx 


13.0 


Control (Path) 1 Temporal 
Ctx 


28.1 


Control (Path) 3 Parietal Ctx 


0.0 


Control (Path) 2 Temporal 
Ctx 


31.4 


Control (Path) 4 Parietal Ctx 


31.0 


Table BGC. General screening panel vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
221034281 


Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
221034281 


Adipose 


0.5 


Renal ca.TK-10 


0.0 


Melanoma* Hs688(A).T 


0.9 


Bladder 


1.5 


Melanoma* Hs688(B).T 


0.0 


G ic ca. (liver met.) NCI N87 


O.S 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* LOXIMVI 


0.0 


Colon ca. SW-948 


0.2 


Melanoma* SK-MEL-5 


100.0 


Colon ca. SW480 


0.0 


Squamous cell carcinoma 
SCC-4 


0.0 


Colon ca.* (SW480 met) SW620 


0.0 


Testis Pool 


4.2 


Colon ca. HT29 


0.0 


Prostate ca.* (bone met) 
PC-3 


0.4 


Colon ca. HCT-116 


0.3 
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Prostate Pool 


|4.2 




(Colon ca. CaCo-2 


0.1 




Placenta 


I 1 ' 5 




[Colon cancer tissue 


0.4 




Uterus Pool 


jo. 7 




Colon ca. SW1116 


0.1 




Ovarian ca. OVCAR-3 


jl.l 




Colon ca. Colo-205 


0.0 




Ovarian ca. SK-OV-3 


0.5 




Colon ca. SW-48 


0.0 


(Ovarian ca. OVCAR-4 


0.5 




Colon Pool 


4.2 




Ovarian ca. OVCAR-5 


J0.4 




Small Intestine Pool 


2.2 


Ovarian ca. IGROV-1 


jO.O 




Stomach Pool 


0.9 


O van an ca. OVCAR-8 


M 




Bone Marrow Pool 


1.0 


jOvary 


3 0 




Fetal Heart 


4.5 


jBreast ca. MCF-7 


0.1 




Heart Pool 


1.0 


jBreast ca. MDA-MB-23I 


0.5 




Lymph Node Pool 


2.1 


jBreast ca. BT 549 


3.5 


jFetal Skeletal Muscle 


2.3 


jBreast ca. T47D 


0.2 


Skeletal Muscle Pool 


1.6 


jBreast ca. MDA-N 


0.2 


Spleen Pool 




jBreast Pool 


1.6 




Thymus Pool 


4.0 j 


[Trachea 


1.9 




CNS cancer (glio/astro) U87-MG 


0.8 


{Lung 


5.6 


iCNS cancer (glio/astro) U-l 1 8-MG 


1.4 j 


Fetal Lung 


4.1 


CNS cancer (neuro;met) SK.-N-AS 


0.2 


Lung ca. NCI-N4I7 


0.0 


CNS cancer (astro) SF-539 


0.0 


(Lung ca. LX-I 


0.0 


CNS cancer (astro) SNB-75 


5.4 


Lung ca.NCI-H146 


0.0 


(CNS cancer (glio)SNB-19 


0.5 


Lung ca. SHP-77 


0.2 


jCNS cancer (glio) SF-295 


2.3 


Lung ca. A549 


0.2 


jBrain (Amygdala) Pool 


4.3 


Lungca.NCI-H526 


0.0 


jBrain (cerebellum) 


0.9 


Lungca. NCI-H23 


1.2 


II 


3rain (fetal) 


2.6 


Lungca. NCI-H460 


0.5 


jBrain (Hippocampus) Pool 


3.8 


Lung ca. HOP-62 


1.5 


jCerebral Cortex Pool 


8.5 


Lungca NCI-H 2 


0.0 


jBrain (Substantia nigra) Pool 


3.9 


Liver 


0.0 


jBrain (Thalamus) Pool 


13.6 


Fetal Liver 


0.2 


jBrain (whole) 


2.4 


Liver ca. HepG2 


0.0 


JSpinal Cord Pool 


2.5 


Kidney Pool 


4.7 


jAdrenal Gland 


.5 


Fetal Kidney 


2.3 


[Pituitary gland Pool 


.9 


Renal ca. 786-0 


2.2 


{Salivary Gland ;( 


).2 


Renal ca. A498 


3.0 


jThyroid (female) ( 


).0 


Renal ca. ACHN 


• 1 jPancreatic ca. CAPAN2 ( 


.5 


Renal ca. UO-31 


.0 


jPancreas Pool 2 


.1 
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Table BGD . Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag4151,Run 
173124788 


Tissue Name 


Rel. Exp.(%) 
Ag4151, Run 
173124788 


Secondary Thl act 


0.0 


HUVECIL-lbeta 


0.0 


Secondary Th2 act (1.6 


HUVEC IFN gamma 


2.6 


Secondary Trl act j0.9 


HUVEC TNF alpha - IFN gamma 


0.0 


Secondary Thl rest :0.0 


HUVEC TNF alpha + 1L4 


0.0 


Secondary TIi2 rest 


o.o Ihuvecil-ii 


0.0 


Secondary Trl rest 


3.2 


Lung Microvascular EC none 


3.3 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


0.0 


Primary Thl rest jO.O 


Bronchial epithelium TNFalpha + 
ILlbeta 


16.5 


Primary Th2 rest 0.0 'Small airway epithelium none 


1.0 


Primary Trl rest ;0.0 


Sm ill iii\ n ej ithelium i NT dph i 
+ IL-1 beta 


0.0 


CD45RA CD4 lymphocyte 
act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 lymphocyte 
act 


0.0 


Coronery artery SMC TNFalpha + 
IL-1 beta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


6.1 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL- 1 beta 


2.9 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


2.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) PMA/ionomycin 


3.4 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


2.3 


CCD1 106 (Keratinocytes) none 


1.8 


LAK cells rest 


0.0 


CCD 1106 (Keratinocytes) 
TNFalpha + IL-1 beta 


32.1 


LAK cells IL-2 


4.0 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


1.8 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


3.2 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-9 


1.8 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-1 3 


0.0 
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NK Cells IL-2 rest 


( 0 


NCI-H292 IFN gamma 


9.3 


Two Way MLR 3 day 


0.5 


ilPAFv n me 


0.0 


Two Way MLR 5 day 


0.0 


1PAE< FNF alpha + IL-1 beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


5.8 


PBMC rest 


2.0 


Lung fibroblast TNF alpha + IL-1 
beta 


4.1 


PBMC PWM 


0.0 [Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-1 3 


4.6 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IFN gamma 


1.7 


B lymphocytes PWM 


5.5 


Dermal fibroblast CCD1070 rest 


124 


B lymphocytes CD40L and 
IL-4 


0.0 


Dermal fibroblast CCD1070 TNF 
alpha 


^ 
2.3 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD1070 IL-1 
beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


11.1 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


2.9 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti-CD40 


0.0 


KontruphilsTNFa+LPS 


0.0 


Monocytes rest 


2.6 


Neutrophils rest 


0.0 


Monocytes LPS 


0.6 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


3.1 


Macrophages LPS 


0.0 


Thymus 


14.6 


HUVEC none 


0.0 


Kidney 


100.0 


HUVEC starved 


0.0 







CNSneurodegenerationvl.O Summary: Ag4151 This panel confirms the 



expression of the CG9961 8-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag41 5 1 Highest expression of the 
CG996 18-01 gene is detected in melanoma SK-MEL-5 cell line (CT=29.6). Therefore, 
expression of this gene can be used to distinguish this sample from other samples in the 
panel. Low expression of this gene is also detected in a breast cancer and a CNS cancer 
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cell lines. Therefore, therapeutic modulation of this gene product may be useful in the 
treatment of melanoma, breast and CNS cancers. 

In addition, this gene is expressed at low levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, cerebral 
5 cortex, and spinal cord. Therefore, this gene may play a role in central nervous system 
disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, 
schizophrenia and depression. 

Panel 4.1D Summary: Ag4151 Highest expression of the CG99618-01 gene is 
detected in kidney (CT=32.4). Therefore, expression of this gene can be used to 
1 0 distinguish kidney from other samples in this panel. In addition, therapeutic modulation of 
this gene product may be useful in the treatment of inflammatory and autoimmune 
diseases that affect kidney such as lupus and glomerulonephritis. 

Low levels of expression of this gene is also seen in TNFalpha + IL-lbeta treated 
keratinocytes. Interestingly, this expression in treated cells is higher (CT=34) as compared 
15 to the untreated keratinocytes (CT=38). Therefore, expression of this gene can be used to 
distinguish the treated from untreated keratinocytes. In addition, therapeutic modulation of 
this gene product may be useful in the treatment of psoriasis and wound healing. 

BH. CG99832-01: Novel Gene Containing NUDIX Hydrolase Domain 

20 Expression of gene CG99832-01 was assessed using the primer-probe set Ag4157, 

described in Table BHA. Results of the RTQ-PCR runs are shown in Tables BHB, BHC, 
BHD and BHE. 



Table BHA . Probe Name Ag4157 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

No 


Forward 


5 ' -agcttgctcgtttgtacatcat-3 ' 


22 


543 


419 


Probe 


TET-5 ' -tccaggaattccaaaagacacaaaat-3 1 -TAMRA 


26 


565 


420 


Reverse 


5 1 -cactcaatgttccgaatttctc-3 ' 


22 


609 


421 



Table BHB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
215331739 


Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
215331739 


AD 1 Hippo 


11.7 


Control (Path) 3 Temporal Ctx 


5.7 


AD 2 Hippo 


24.8 


Control (Path) 4 Temporal Ctx 


37.1 
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AD 3 Hippo 




AD 1 Occipital Ctx 


1 5 1 


AD 4 Hippo 


5.7 


AD 2 Occipital < 1 I Missing) 


0.0 


AD 5 hippo 


78.5 


AD 3 Occipital Ctx 


4.0 


AD 6 Hippo 


52.9 


AD 4 Occipital Ctx 


15.6 


Control 2 Hippo 


28.3 


AD 5 Occipital Ctx 


31.6 


Control 4 Hippo 


8.2 


AD 6 Occipital Ctx 


41.2 


Control (Path) 3 Hippo 


10.6 


Control 1 Occipital Ctx 


3.8 


AD 1 Temporal Ctx 


14.7 


Control 2 Occipital Ctx 


52.1 


AD 2 Temporal Ctx 


31.2 


( ontrol 3 Occij ital Ctx 


12.4 


AD 3 Temporal Ctx 


4.7 


Control 4 Occipital Ctx 


6.7 


AD 4 Temporal Ctx 


20.4 


Control (Path) 1 Occipital Ctx 


79.6 


AD 5 Inf Temporal Ctx 


100.0 


Control (Path) 2 Occipital Ctx 


9.7 


AD 5 SupTemporal Ctx 


40.6 


Control (Path) 3 Occipital Ctx 


4.5 


AD 6 Inf Temporal Ctx 


63.3 


Control (Path) 4 Occipital Ctx H 3.2 


AD 6 Sup Temporal Ctx 




Control 1 Parietal Ctx 


6.4 


Control 1 Temporal Ctx 


5.6 


Control 2 Parietal Ctx 


38.7 


Control 2 Temporal Ctx 


40.6 


Control 3 Parietal Ctx 


17.2 


Control 3 Temporal Ctx 


15.5 


Control (Path) 1 Parietal Ctx 


79.6 


Control 4 Temporal Ctx 


5.0 


Control (Path) 2 Parietal Ctx 


28.9 


Control (Path) 1 
Temporal Ctx 


70.7 


Control (Path) 3 Parietal Ctx 


3.6 


Control (Path) 2 
Temporal Ctx 


34.6 


Control (Path) 4 Parietal Ctx 


40.9 


Table BHC. General screening panel vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
221117670 


Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
221117670 


Adipose 


182 


Renal ca. TK-10 


28.1 


Melanoma* Hs688(A).T 


I 1 1 


Bladder 


23.8 


Melanoma* Hs688(B).T 


12.1 


Gastric ca. (liver met.) NC1-N87 


23.5 


Melanoma* M14 


32.3 


Gastric ca. KATO 111 


2-1 o 


Melanoma* LOXIMVI 


21.5 


Colon ca. SW-948 


(> i 


Melanoma* SK-MEL-5 


41.8 


Colon ca. SW480 


21.8 


Squamous cell 
carcinoma SCC-4 


11.7 


Colon ca.* (SW480 met) SW620 


14.4 


Testis Pool 


8.3 


Colon ca HT29 


18.4 ' 


Prostate ca.* (bone met) 
PC-3 


13.5 


Colon ca. HCT-116 


51.8 


Prostate Pool 


9.4 


Colon ca. CaCo-2 


80.1 
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Placenta 


32.8 


Colon cancer tissue 


14.7 


Uterus Pool 


8.3 


Colon ca. SWII16 


2.7 


Ovarian ca. OVCAR-3 


31.2 


Colon ca. Colo-205 


3.7 


Ovarian ca. SK-OV-3 


25.3 


Colon ca. SW-48 


4.5 


Ovarian ca. OVCAR-4 


14.3 


Colon Pool 


17.3 


Ovarian ca. OVCAR-5 


32.5 


Small Intestine Pool 


17.3 


Ovarian ca. 1GROV-1 ! 1 5.8 


Stomach Pool 


8.9 


Ovarian ca. OVCAR-8 1 1 1 .6 


Bone Marrow Pool 


10.2 


Ovary 


11.4 


Fetal Heart 


IL 5 _ 


Breast ca. MCF-7 


53.6 


Heart Pool 




Breast ca. MDA-MB- 
231 


34.6 


Lymph Node Pool 


17.7 


Breast ca. BT 549 


34.6 


Fetal Skeletal Muscle 


— 


Breast ca. T47D j 100.0 


Skeletal Muscle Pool 


— 


Breast ca. MDA-N -16.7 


Spleen Pool 


— - 


Breast Pool 1 1 8.6 


Thymus Pool 


■•~4- - — 


Trachea 1 11.0 


CNS cancer (glio/astro) U87-MG 





i 

Lung 7.4 


CNS cancer (glio/astro) U-l 18- 
MG 


20.6 


Fetal Lung 145.7 


CNS cancer (neuro;met) SK-N-AS 


27.2 


Lung ca.NCl-N417 


9.6 


CNS cancer (astro) SF-539 


15.6 


Lung ca. LX-1 


24.1 


CNS cancer (astro) SNB-75 


31.4 


Lung ca. NCI-H146 


17.1 


CNS cancer (glio)SNB-19 


17.2 


Lung ca. SHP-77 


48.3 


CNS cancer (glio) SF-295 


21.5 


Lung ca. A549 


14.1 


Brain (Amygdala) Pool 


10.5 


Lung ca. NCI-H526 


13.' 


Brain (cerebellum) 


18.3 


Lung ca. NC1-H23 


32.1 


Brain (fetal) 


48.3 


Lungca. NCI-H460 


10.8 


Brain (Hippocampus) Pool 


13.7 


Lung ca. HOP-62 


18.3 


Cerebral Cortex Pool 


17.9 


Lung ca. NCI-H522 


36.3 


Brain (Substantia nigra) Pool 


12.6 


Liver 


1.7 


Brain (Thalamus) Pool 


20.7 


Fetal Liver 


43.2 


Brain (whole) 


19.2 


Liver ca. HepG2 


17.8 


Spinal Cord Pool 


6.2 


Kidney Pool 


27.0 


Adrenal Gland 


16.2 


Fetal Kidney 


55.5 


Pituitary gland Pool 


4.5 


Renal ca. 786-0 


27.9 


Salivary Gland 


5.4 


Renal ca. A498 


4.9 


Thyroid (female) 


2.8 


Renal ca. ACHN 


8.5 


Pancreatic ca. CAPAN2 


30.4 
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[Renal ca.UO-31 [l2.2 [Pancreas Pool |l6?8~ 

Table BHD . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
173123943 


Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
173123943 


Secondary Th I act 


55.1 


HUVEC IL-lbeta 


20.4 


Secondary Th2 act 


54.7 


HUVEC I FN gamma 


28.7 


Secondary Trl act 


42.9 


HUVEC TNF alpha - [FN gamma 


15.6 


Secondary Th 1 rest 


23.7 


HUVEC TNF alpha + IL4 


12.9 


Secondary Th2 rest 


33.9 


HUVEC JL-1 1 


24.8 


Secondary Trl rest 


23.7 


Lung Microvascular EC none 


24.7 


Primary Th 1 act 


29.7 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


19.5 


Primary Th2 act 


44.8 


Microvascular Dermal EC none 


11.1 


Primary Trl act 


36.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


11.7 


Primary Thl rest 


38.4 


Bronchial epithelium TNFalpha + 
ILIbeta 


19.9 


Primary Th2 rest j44.4 


Small airway epithelium none 


4.5 


Primary Trl rest |48.0 


Small airway epithelium TNFalpha 
+ IL-lbeta 


12.9 


CD45RA CD4 
lymphocyte act 


21.2 


Coronery artery SMC rest 


9.3 


CD45RO CD4 
lymphocyte act 


46.3 


Coronery artery SMC TNFalpha + 
IL-lbeta 


11.2 


CD8 lymphocyte act 


28.7 


Astrocytes rest 


11.1 


Secondary CD8 
lymphocyte rest 


27.7 


Astrocytes TNFalpha + IL-1 beta 


10.9 


Secondary CD8 
lymphocyte act 


27.7 


KU-812 (Basophil) rest 


39.0 


CD4 lymphocyte none 


16.8 


KU-812 (Basophil) 
PMA/ionomycin 


52.9 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


52.1 


CCD1 1 06 (Keratinocytes) none 


13.0 


LAK cells rest 


34.2 


CCD 11 06 (Keratinocytes) 
TNFalpha + IL-1 beta 


14.3 


LAK cells IL-2 


37.4 


Liver cirrhosis 


7.2 


LAK cells IL-2+IL-12 


22.4 


NCI-H292 none 


11.2 


LAK cells IL-2+IFN 
gamma 


36.1 


NCI-H292 IL-4 


23.8 


LAK cells IL-2+ IL-18 


41.8 


NCI-H292 IL-9 


21.3 
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LAK cells 
PMA/ionomycin 


39.8 


NCI-H292 IL-13 


21.9 


NK Cells IL-2 rest 


54.7 


NCI-H292 IFN gamma 


]67 - 

' 


Two Way MLR 3 day 


43.2 


HPAEC none 




Two Way MLR 5 day 


44.8 


HPAEC TNF alpha + IL-1 beta 


-~ 


Two Way MLR 7 day 


35.4 


Lung fibroblast none 


7.6 


PBMC rest 


20.2 


Lung fibroblast TNF alpha + IL-1 


7.7 


PBMC PWM 


22.5 


Lung fibroblast IL-4 


7.3 


PBMC PHA-L 


33.7 


Lung fibroblast IL-9 


9.5 


Ramos (B cell) none 


75.8 


Lung fibroblast IL-13 


8.5 


Ramos (B cell) 
ionomycin 


64.2 


Lung fibroblast IFN gamma 


7.6 


B lymphocytes PWM 


35.4 


Dermal fibroblast CCD1070 rest 


17.6 


B lymphocytes CD40L 
and IL-4 


48.6 


Dermal fibroblast CCD1070 TNF 
alpha 


52.5 


EOL-I dbcAMP 


54.3 


Dermal fibroblast CCD1070 IL-1 
beta 


13.5 


EOL-1 dbcAMP 
PMA/ionomycin 


86.5 


Dermal fibroblast IFN gamma 


15.8 


Dendritic cells none 


58.6 


Dermal fibroblast IL-4 


------ - 


Dendritic cells LPS 


40.6 


?^ a l F . lb „ r °t'?i t !^ 


— 


Dendritic cells anti- 
CD40 


50.3 


Neutrophils TNFa+LPS 


28.7 


Monocytes rest 


52.1 


Neutrophils rest 


87.1 


Monocytes LPS 


100.0 


Colon 


6.9 


Macrophages rest 


38.4 


Lung 


11.6 


Macrophages LPS 


25.2 


Thymus 


66.4 


HUVEC none 


15.8 


Kidney 


19.9 


HUVEC starved 


21.0 







Table BHE . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
268624005 


Tissue Name 


Rel. Exp.(%) 
Ag4157, Run 
268624005 


Colon cancer 1 


12.8 


Bladder cancer NAT 2 


0.4 


Colon NAT 1 


4.8 


Bladder cancer NAT 3 


1.8 


Colon cancer 2 


26.1 


Bladder cancer NAT 4 


6.6 


Colon cancer NAT 2 


5.6 


Adenocarcinoma of the prostate 1 


34.9 


Colon cancer 3 


24.8 


Adenocarcinoma of the prostate 2 


3.5 


Colon cancer NAT 3 


132 


Adenocarcinoma of the prostate 3 


9.0 
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Colon malignant cancer 
4 


34.4 


Adenocarcinoma of the prostate 4 


34.4 


Colon normal adjacent 
tissue 4 


5.0 


Prostate cancer NAT 5 


3.6 


Lung cancer I 


1 1.7 


Adenocarcinoma of the prostate 6 


3.8 


Lung NAT I 


2.5 


Adenocarcinoma of the prostate 7 


5.4 


Lung cancer 2 


66.9 


Adenocarcinoma of the prostate 8 


1.9 


Lung NAT 2 


4.0 


Adenocarcinoma of the prostate 9 


17.3 


Squamous cell 
carcinoma 3 


18.7 


Prostate cancer NAT 10 


3.3 


Lung NAT 3 


1.1 


Kidney cancer 1 


15.2 


metastatic melanoma 1 


23.0 


KidneyNAT 1 


8.5 


Melanoma 2 


2.1 


Kidney cancer 2 


67.4 


Melanoma 3 


5.3 


Kidney NAT 2 


11.2 


metastatic melanoma 4 


92.0 


Kidney cancer 3 


12.5 


metastatic melanoma 5 


100.0 


Kidney NAT 3 


5.0 


Bladder cancer 1 


2.1 


Kidney cancer 4 


16.8 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


4.2 


Bladder cancer 2 


7.1 







CNS_ncurodegeneration_vl.6 Summary': Ag4157 This panel confirms the 
expression of the CG99832-01 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screeningjpanel_vl.4 Summary: Ag4157 Highest expression of the 
CG99832-01 gene is detected in breast cancer T47D cell line (CT=29). Moderate levels of 
expression of this gene is seen in cluster of cancer lines including pancreatic, CNS, colon, 
gastric , renal, lung, breast , ovarian, prostate, squamous cell carcinoma, and melanoma 
cell lines. Therefore, therapeutic modulation of this gene may be useful in the treatment of 
these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at 
moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 
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Interestingly, this gene is expressed at much higher levels in fetal (CT=30) when 
compared to adult liver (CT=34.8). This observation suggests that expression of this gene 
can be used to distinguish fetal from adult liver. In addition, the relative overexpression of 
this gene in fetal liver suggests that the protein product may enhance liver or development 
5 in the fetus and thus may also act in a regenerative capacity in the adult. Therefore, 

therapeutic modulation of the protein encoded by this gene could be useful in treatment of 
liver related diseases. 

In addition, this gene is expressed at moderate levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 

1 0 cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4157 Highest expression of the CG99832-01 gene is 
detected in LPS treated monocytes (CT=29). This gene is expressed at high to moderate 

1 5 levels in a wide range of cell types of significance in the immune response in health and 
disease. These cells include members of the T-cell. B-cell. endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 

20 product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screeningjpanel_vl .4 and also suggests a role for the gene product in cell 
survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 

25 to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

General oncology screening panel_v_2.4 Summary: Ag4157 Highest 
expression of the CG99832-01 gene is detected in metastatic melanoma (CT=29). Higher 

30 expression of this gene is seen in cancer samples including colon cancer, kidney cancer, 
lung cancer, prostate adenocarcinoma and melanoma. Therefore, therapeutic modulation 
of this gene may be useful in the treatment of these cancers. 
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BI. CG99842-01: Tensin-Like 

Expression of gene CG99842-01 was assessed using the primer-probe set Ag4158, 
described in Table BIA. Results of the RTQ-PCR runs are shown in Tables BIB, BIC and 
BID. 



Table BfA . Probe Name Ag4158 



Primers 


Sequences 


Length 


Start 
Position 


SEQ1D 

No 


Forward 


5 ' -aagggataccaagtgagaaagc-3 ' 


22 


221 


422 


Probe 


TET-5 ' -ccttcagttaaacaaaggggtacatca-3 ' -TAMRA 


27 


245 


423 


Reverse 


5 ' -tgttccaattgtcacctgattt-3 1 


22 


293 


424 



Table BIB . CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 
215337521 


Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 
215337521 


AD 1 Hippo 


10.7 


Control (Path) 3 Temporal Ctx 


11.7 


AD 2 Hippo 


28.3 


Control (Path) 4 Temporal Ctx 


36.3 


AD 3 Hippo 


13.4 


AD 1 Occipital Ctx 


23.0 


AD 4 Hippo 


15.3 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


84.7 


AD 3 Occipital Ctx 


9.2 


AD 6 Hippo 


92.7 


AD 4 Occipital Ctx 


13.4 


Control 2 Hippo 


26.2 


AD 5 Occipital Ctx 


20.4 


Control 4 Hippo 


25.5 jAD 6 Occipital Ctx 


27.9 


Control (Path) 3 Hippo 


11.8 


Control 1 Occipital Ctx 


11.2 


AD 1 Temporal Ctx 


14.8 


Control 2 Occipital Ctx 


39.2 


AD 2 Temporal Ctx 


35.1 


Control 3 Occipital Ctx 


31.2 


AD 3 Temporal Ctx 


11.1 


Control 4 Occipital Ctx 


7.6 


AD 4 Temporal Ctx 


16.0 


Control (Path) 1 Occipital Ctx 


100.0 


AD 5 Inf Temporal Ctx 


87.1 


Control (Path) 2 Occipital Ctx 


15.2 


AD 5 Sup Temporal Ctx 


44.1 


Control (Path) 3 Occipital Ctx 


6.9 


AD 6 Inf Temporal Ctx 


91.4 


Control (Path) 4 Occipital Ctx 


30.8 


AD 6 Sup Temporal Ctx 


89.5 


Control 1 Parietal Ctx 


8.2 


Control 1 Temporal Ctx 


10.8 


■ . ml 2 Parietal Ctx 


48.3 


Control 2 Temporal Ctx 


13.8 


Control 3 Parietal Ctx 


4.9 


Control 3 Temporal Ctx 


n.-j 


Control (Path) 1 Parietal Ctx 


37.1 


Control 3 Temporal Ctx 


7.6 


Control (Path) 2 Parietal Ctx 


30.8 


Control (Path) 1 


44.8 


Control (Path) 3 Parietal Ctx 


13.1 



725 



WO 03/010327 



PCT/US02/14199 



Temporal Ctx 








Control (Path) 2 
Temporal Ctx 


45.1 


Control (Path) 4 Parietal Ctx 


54.0 



Table BIC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 
221117871 


Tissue Name 


Rel. Exp.(%) 
Ag4l58, Run 
221117871 


Adipose 


8.5 


Renal ca. TIC- 10 


32.1 


Melanoma* Hs688(A).T 


16.7 


Bladder 


26.4 


Mclan .ma* Hs688(B).T 


16.3 


Gastric ca. (liver met.) NCI-N87 


56.6 


Melanoma* M14 


11.9 


Gastric ca. KATOIII 


54.7 


Melanoma* LOXIMVI 


30.1 


Colon ca. SW-948 


11.4 


Melanoma* SK-MFI -S 


29.9 


Colon ca. SW480 


36.3 


Squamous cell 
carcinoma SCC-4 


12.8 


Colon ca.* (SW480 met) SW620 


38.7 


Testis Pool 


5.0 


Colon ca. HT29 


18.7 


Prostate ca.* (bone met) L 
PC-3 f lA 


Colon ca. HCT-116 


100.0 


Prostate Pool 


7.5 


Colon ca. CaCo-2 


31.0 


Placenta 


3.5 iColon cancer tissue 


18.3 


Uterus Pool 


6.7 


Colon ca. SW1116 


tl 


Ovarian ca. OVCAR-3 


26.8 


Colon ca. Colo-205 


il 


Ovarian ca. SK-OV-3 


36.6 


Colon ca. S W-48 


i? 


Ovarian ca. OVCAR-4 


6.4 


Colon Pool 


1H — 


Ovarian c . OVCAR-5 


36.6 


Small Intestine Pool 




( > . an ca. IGROV-1 


9.5 


Stomach Pool 




( i ian ca. OVCAR-8 


5.0 


Bone Marrow Pool 


9.2 


Ovary 


9.7 


Fetal Heart 


5.2 


Bre istca. MCF-7 


48.6 


Heart Pool 


6.7 


Breast ca. MDA-MB- 
231 


34.6 


Lymph Node Pool 


17.7 


Breast ca. BT 549 


36.6 


Fetal Skeletal Muscle 


5.1 


Breast ca. T47D 


52.9 


Skeletal Muscle Pool 


11.2 


Breast ca. MDA-N 


io.; 


Spleen Pool 


15.4 


Breast Pool 


13.5 


Thymus Pool 


14.3 


Trachea 


13.8 


CNS cancer (glio/astro) U87-MG 


34.4 


Lung 


10.5 


CNS cancer (glio/astro) U-l 1 8- 
MG 


62.4 


Fetal Lung 


24.5 


CNS cancer (neuro;met) SK-N-AS 


28.7 
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Lungca.NCI-N417 


1.9 


CNS cancer (astro) SF-539 


10.2 


Lung ca. LX-1 


50.0 


CNS cancer (astro) SNli-75 


28.1 


Lung ca. NCI-H146 


11.1 


CNS cancer (glio) SNB- 19 


6.6 


Lung ca. SHP-77 


38.4 


CNS cancer (glio) SF-295 


41.2 

7.8 


Lung ca. A549 


39.0 


Brain (Amygdala) Pool 


Lung ca. NCI-H526 


1.8 


Brain (cerebellum) 


8.2 


Lungca. NCI-H23 


40.3 


Brain (fetal) 


18.6 


Lungca.NCI-H460 


40.9 


Brain (Hippocampus) Pool 


8.7 


Lung ca. HOP-62 


10.5 


Cerebral Cortex Pool 


7.2 


Lungca.NCI-H522 


17.6 


Brain (Substantia nigra) Pool 


4.8 


Liver 


0.0 


Brain (Thalamus) Pool 


14.6 


Fetal Liver 


12.5 


Brain (whole) 


6.4 


Liver ca. HepG2 


18.3 


Spinal Cord Pool 


9.5 


Kidney Pool 


33.7 


Adrenal Gland 


12.5 


Fetal Kidney 


26.1 


Pituitary gland Pool 


3.9 


Renal ca. 786-0 


20.4 


Salivary Gland 


4.1 


Renal ca. A498 


3.5 


Thyroid (female) 


2.2 


jRenal ca. AC UN 


28.5 


Pancreatic ca. CAPAN2 


50.0 


Renal ca. UO-3 1 


19.8 


Pancreas Pool 


22.4 


Table BID. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 
173123945 


Tissue Name 


Rel. Exp.(%) 
Ag4158, Run 
173123945 


Secondary Th 1 act 


46.0 


HUVEC IL-lbeta 


17.3 


Secondary Th2 act 


63.3 


HUVEC IFN gamma 


14.9 


Secondary Trl act 


35.4 


HUVEC TNF alpha + IFN gamma 


14.3 


Secondary Thl rest 


7.0 


HUVEC TNF alpha + IL4 


15.8 
12.2 
22.5 


Secondary Th2 rest 


13.2 


HUVEC IL-11 


Secondary Trl rest 


11 " 


Lung Microvascular EC none 


Primary Thl act 


55.9 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


27.4 


Primary Th2 act 


81.8 


Microvascular Dermal EC none 


15.7 


Primary Trl act 


51.1 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


10.0 


Primary Thl rest 


9.5 


Bronchial epithelium TNFalpha + • 
ILlbeta 


12.5 


Primary Th2 rest 


5.7 


Small airway epithelium none 


5.2 


Primary Trl rest 


20.2 


Small airway epithelium 
TNFalpha + IL-lbeta 


18.0 
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CD45RA CD4 
lymphocyte act 


31.2 


jCoronery artery SMC rest 


3.3 


CD45RO CD4 
lymphocyte act 


52.5 


jCoronery artery SMC TNFalpha + 
]IL-lbeta 


6.9 


CD8 lymphocyte act 


43.8 


|Astrocytes rest 


9.7 


(Secondary CD8 
j lymphocyte rest 


40.3 


jAstrocytes TNFalpha + IL- 1 beta 


7.1 


Secondary CDS 
lymphocyte act 


13.7 


jKU-8 12 (Basophil) rest 


16.8 


CD4 lymphocyte none 


13.0 


[KU-8 12 (Basophil) 
jPMA/ionomycin 


14.8 


2ry Thl/Th2/Trl_anti- 
CD95 CH11 


20.6 


i'CCDl 106 (Keratinocytes) none 




LAK cells rest 


12.6 


jCCDl 106 (Keratinocytes) 
jTNFalpha+ lL-lbeta 


12.0 


LA K cells IL-2 


21.8 


jLiver cirrhosis 


1.4 


LAK cells IL-2+IL-12 


20.3 


|NCI-H292 none 


21.5 


LAK cells IL-2+IFN 


26.2 




INCI-H292 IL-4 

! 


33.4 


LAK cells IL-2+ IL-I8 




IMfl 140 QO II O 


43.2 


LAK cells 


42.0 


" 1 ' " • ~ 

JNC1-H292IL-13 


31.2 


NK Cells IL-2 rest 


33.7 


|NCI-H292 I FN gamma 


~-°- 8 ~- 


Two Way MLR 3 day 


32.1 


Ili'AI ( none 


ill 


Two Way MLR 5 day 


24.8 


jHPAEC TNF alpha + IL-1 beta 




Two Way MLR 7 day 


18.8 


jLung fibroblast none 


12.3 


PBMC rest 


8 j jLung fibroblast TNF alpha + IL-1 
Jbeta 


4.6 


PBMC PWM 


30.4 


[Lung fibroblast IL-4 


8.2 


PBMC PHA-L 


27.5 


Lung fibroblast IL-9 


13.4 


Ramos (B cell) none 


59.5 


jLung fibroblast IL-13 


11.8 


Ramos (B cell) 
ionomycin 


45.4 


|Lung fibroblast IFN gamma 


15.5 


B b mphocyt* P\V\] 


25.3 


jDermal fibroblast CCD1070 rest ; 


25.0 


B lymphocytes CD40L 
and IL-4 


24.0 


|Dermal fibroblast CCD1070 TNF 
jalpha 


34.9 


EOL-1 dbcAMP 


29.5 


]Dermal fibroblast CCD1070 IL-1 
|beta 


9.7 


EOL-1 dbcAMP 
PMA/ionomycin 


18.6 


jDermal fibroblast IFN gamma 


9.3 


Dendritic cells none 


12.1 


JDermal fibroblast IL-4 


17.8 


Dendritic cells LPS 


9.9 


jDermal Fibroblasts rest 


3.9 
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Dendritic cells anti-CD40 


14.0 


Neutrophils TNFa+LPS 


6.7 


Monocytes rest 


15.0 


Neutrophils rest 


8.4 


Monocytes LPS 


27.5 


Colon 


6.4 


Macrophages rest 


16.7 


Lung 


6.7 


Macrophages LPS 


4.4 


Thymus 


34.6 


HUVEC none 


9.2 


Kidney 


100.0 


HUVEC starved 


13.9 







CNSneurodegenerationvl.O Summary: Ag41 58 This panel confirms the 
expression of the CG99842-0 1 gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 



5 experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 

treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag41 58 Highest expression of the 

CG99842-01 gene is detected in colon cancer HCT-1 16 cell, line (CT=30.7). Moderate 

expression of this gene is associated with cluster of cancer cell lines including pancreatic, 
1 0 CNS, colon, gastric, renal, lung, breast, ovarian, prostate, squamous cell carcinoma and 

melanoma cancer cell lines. Therefore, therapeutic modulation of this gene product may 

be useful in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 

to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
15 muscle, heart, fetal liver and the gastrointestinal tract. Therefore, therapeutic modulation 

of the activity of this gene may prove useful in the treatment of endocrine/metabolically 

related diseases, such as obesity and diabetes. 

Interestingly, this gene is expressed at much higher levels in fetal (CT=33.7) when 

compared to adult liver (CT=40). This observation suggests that expression of this gene 
20 can be used to distinguish fetal from adult liver. In addition, the relative overexpression of 

this gene in fetal liver suggests that the protein product may enhance liver growth or 

development in the fetus and thus may also act in a regenerative capacity in the adult. 

Therefore, therapeutic modulation of the protein encoded by this gene could be useful in 

treatment of liver related diseases. 
25 In addition, this gene is expressed at low to moderate levels in all regions of the 

central nervous system examined, including amygdala, hippocampus, substantia nigra, 
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thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a 
role in central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4158 Highest expression of the CG99842-01 gene is 
5 detected in kidney (CT=31). This gene is expressed at high to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 
cells include members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 

10 This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. This pattern is in 
agreement with the expression profile in General_screening_panel_vl.4 and also suggests 
a role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 

15 associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

20 BJ. CG99944-01: ABC Transporter 

Expression of gene CG99944-01 was assessed using the primer-probe set Ag4184, 
described in Table BJA. Results of the RTQ-PCR runs are shown in Table BJB. 

Table BJA . Probe Name Ag4184 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -agctaaaaggggaagacatcac-3 ' 


22 


965 


425 


Probe 


TET-5 1 -aaaacctcgaaagatcactgaacatg-3 1 -TAMRA 


26 


993 


426 


Reverse 


5 ' -cttctggcacatgtcctacac-3 ' 


21 


1019 


427 



Table BJB . General oncology screening panel_v_2.4 



Tissue Name 


Rel. Exp.(%) 
Ag4184, Run 
268695207 


Tissue Name 


Rel. Exp.(%) ' 
Ag4184, Run 
268695207 


Colon cancer 1 


0.0 


Bladder cancer NAT 2 


0.0 


Colon NAT 1 


0.0 


Bladder cancer NAT 3 


0.0 
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Colon cancer 2 


0.0 


Bladder cancer NAT 4 


0.0 


Colon cancer NAT 2 


0.0 


Adenocarcinoma of the prostate 1 


0.0 


Colon cancer 3 


0.0 


Adenocarcinoma of the prostate 2 


0.0 


[Colon cancer NAT 3 


0.0 


Adenocarcinoma of the prostate 3 


0.0 


Colon malignant cancer 

[4 


0.0 


Adenocarcinoma of the prostate 4 


0.0 


Colon normal adjacent 
tissue 4 


0.0 


Prostate cancer NAT 5 


0.0 


Lung cancer 1 


0.0 


.... 

Adenocarcinoma of the prostate 6 




0 0 -- — 


:Lung NAT 1 


0.0 


Adenocarcinoma of the prostate 7 




Lun^ cancer 2 


0.0 


Adenocarcinoma of the prostate 8 


— 

0.0 


Lung NAT 2 


0.0 


Adenocarcinoma of the prostate 9 


0.0 


Squamous cell 
carcinoma 3 


0.0 


Prostate cancer NAT 10 


0.0 


Lung NAT 3 


0.0 


Kidney cancer 1 


0.0 


metastatic melanoma 1 


0.0 


KidneyNAT 1 


0.0 


Melanoma 2 


100.0 kidney cancer 2 


0.0 


Melanoma 3 


0.0 .Kidney NAT 2 


0.0 


metastatic melanoma 4 


0.0 .Kidney cancer 3 


0.0 


metastatic melanoma 5 


0.0 


Kidney NAT 3 


0.0 


Bladder cancer 1 


0.0 


Kidney cancer 4 


0.0 


Bladder cancer NAT 1 


0.0 


Kidney NAT 4 


0.0 


Bladder cancer 2 


0.0 







CNSneurodegenerationvl.O Summary: Ag4184 Results from one experiment 
with the CG99944-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 



General_screening_panel_vl.4 Summary: Ag4184 Expression of the CG99944- 
01 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

Panel 4.1D Summary: Ag41 84 Expression of the CG99944-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General oncology screening panel_v_2.4 Summary: Ag4184 Highest 
expression of the CG99944-01 gene is detected exclusively in a melanoma sample 
(CT=34). Therefore, expression of this gene can be used to distinguish this sample from 
other samples used in this panel and therapeutic modulation of this gene product may be 
beneficial in the treatment of melanoma. 
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BK. CG99963-01: Cyclophilin 18 

Expression of gene CG99963-01 was assessed using the primer-probe set Ag4160, 
described in Table BKA. Results of the RTQ-PCR runs are shown in Tables BKB, BKC, 
BKD and BKE. 



Tabic BKA . Probe Name Ag4160 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
No 


Forward 


5 1 -ggcaagaccagcaagaagat-3 1 


20 


545 


428 


Probe 


TET-5 1 -caccattgctgactgtggacaactct-3 ' -TAMRA 


26 ! 565 


429 


Reverse 


5 1 -aaaggaatggtctggtggtt-3 ' 


20 | 617 


430 



Table BKB . CNS_neu rodege neration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag4160, Run 
215342806 


! Rel. Exp.(%) 

Tissue Name |Ag4 1 60, Run 
J2I5342806 


AD 1 Hippo 


13.4 


Control (Path) 3 Temporal Ctx ;6.5 


AD 2 Hippo 


30.4 jControl (Path) 4 Temporal Ctx |35.6 


AD 3 Hippo 


8.0 ;AD 1 Occipital Ctx |20.0 


AD 4 Hippo 


8.5 |AD 2 Occipital Ctx (Missing) ;0.0 


AD 5 Hippo 


84.7 jAD 3 Occipital Ctx 


6.0 


AD 6 Hippo 


37.9 


AD 4 Occipital Ctx 


25.5 


Control 2 Hippo 


64.2 


AD 5 Occipital Ctx 


84.7 


Control 4 Hippo 


7.9 


AD 6 Occipital Ctx 


29.3 


Control (Path) 3 Hippo 


8.0 


Control 1 Occipital Ctx 


4.4 


AD 1 Temporal Ctx 


13.5 


Control 2 Occipital ( t\ 


98.6 


AD 2 Temporal Ctx 


40.1 


Control 3 Occipital Ctx 


14.9 


AD 3 Temporal Ctx 


5.0 


Control 4 Occipital Ctx |8.I 


AD 4 Temporal Ctx 


23.5 


Control (Path) 1 Occipital Ctx j 100.0 


AD 5 Inf Temporal Ctx 


71.2 


Control (Path) 2 Occipital Ctx ill .5 


AD 5 Sup Temporal Ctx 




Control (Path) 3 Occipital Ctx J3.7 


AD 6 Inf Temporal Ctx 


41.5 


Control (Path) 4 Occipital Ctx j 1 1 .3 


AD 6 Sup Temporal Ctx 


39.2 


Control 1 Parietal Ctx 


7.0 


Control 1 Temporal Ctx 


5.6 


Control 2 Parietal Ctx 


25.0 


Control 2 Temporal Ctx 


65.1 


Control 3 Parietal Ctx 


23.3 


Control 3 Temporal Ctx 


18.8 


Control (Path) 1 Parietal Ctx 


94.0 


Control 3 Temporal Ctx 


6.6 


Control (Path) 2 Parietal Ctx 


26.2 


Control (Path) 1 


67.4 


Control (Path) 3 Parietal Ctx 


4.7 
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Temporal Ctx 








Control (Path) 2 
Temporal Ctx 


45.4 


Control (Path) 4 Parietal Ctx 


49.3 



Table BKC. ( 


2eneral_sc reening_panel_v 1 .4 




Tissue Name 


Rel. Exp.(%) 
Ag4160, Run 
221297229 


Tissue Name 


Rel. Exp.(%) 
Ag4160, Run 
221297229 


Adipose 


5.0 


Renal ca. TK-10 


25.3 


Melanoma* Hs688(A).T 


20.9 


Bladder 


13.5 


Melanoma* Hs6SS(B) f 


16.6 


Gastric ca. (liver met.) NCI-N87 


20.6 


Melanoma* M 1 4 


31.6 


Gastric ca. KATO III 


75.8 


Melanoma* LOXIMVI 


25.5 


Colon ca. SW-948 


14.7 


Melanoma 1 SK-ME1 -5 


52.9 


Colon ca. SW480 


77.4 


Squamous cell carcinoma 
SCC-4 


27.0 


Colon ca.* (SW480 met) SW620 


33.9 


Testis Pool 


2.7 


Colon ca. HT29 


30.S 


Prostate ca.* (bone met) 
PC-3 


25.0 


Colon ca. HCT-1 16 


66.4 


Prostate Pool 


3.7 


Colon ca. CaCo-2 


27.4 


Placenta 


1.7 


Colon cancer tissue 


14.4 


Uterus Pool 


2.6 


Colon ca. SW11 16 


8.0 


Ovarian ca. OVCAR-3 


16.7 


Colon ca. Colo-205 


7.8 


Ovarian ca. SK-OV-3 


18.0 


Colon ca. SW-48 


13.2 


Ovarian ca. OVCAR-4 


14.1 


Colon Pool 


8.1 


Ovarian ca. OVCAR-5 


52.5 


Small Intestine Pool 


3.0 


Ovarian ca. IGROV-1 


27.4 


Stomach Pool 


1.6 


Ovarian ca. OVCAR-8 


25.5 


Bone Marrow Pool 


3.7 


Ovary 


5.8 


Fetal Heart 


5.5 


Breast ca. MCF-7 




Heart Pool 


4.2 


Breast ca. MDA-MB-23 1 


.,() i 


Lymph Node Pool 


5.1 


Breast ca. BT 549 


45.4 


Fetal Skeletal Muscle 


4.2 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


1.9 


Breast ca. MDA-N 


14.5 


Spleen Pool 


2.6 


Breast Pool 


3.9 


Thymus Pool 


6.7 


Trachea 


4.8 


CNS cancer (glio/astro) U87-MG 


40.9 


Lung 


1.9 


CNS cancer (glio/astro) U-l 18- 
MG 


54.7 


Fetal Lung 


11.2 


CNS cancer (neuro;met) SK-N-AS 


30.4 


Lung ca.NCI-N417 


10.4 


CNS cancer (astro) SF-539 


28.7 
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Lung ca. LX-1 


31.0 


CNS cancer (astro) SNB-75 


51.4 


Lungca.NCI-H146 


9.0 


CNS cancer (glio) SNB-1 9 


16 


Lung ca. SHP-77 


42.3 


CNS cancer (glio) SF-295 


28.9 


Lung ca. A549 


26.8 


Brain (Amygdala) Pool 


7.2 


Lung ca. NCI-H526 


10.7 


Brain (cerebellum) 


10.3 


Lung ca. NCI-H23 


20.4 


Brain (fetal) 


9 2 


Lung ca. NCI-H460 


9.3 


Brain (Hippocampus) Pool 


6.7 


Lung ca. HOP <>2 


13.8 


Cerebral Cortex Pool 


9.4 


Lung ca.NCI-II522 


22.5 


Brain (Substantia nigra) Pool 


7.9 


Liver 


1.6 


Brain (Thalamus) Pool 


6.8 


Fetal Liver 


11.8 


Brain (whole) 


6.3 


Liver ca. HepG2 


9.6 


Spinal Cord Pool 


3.3 


Kidney Pool 


7.8 

_ 


Adrenal Gland 


9.1 


Fetal Kidney 




Pituitary gland Pool 


1.9 


Renal ca. 786-0 


20.0 


Salivary Gland 


3.7 


Renal ca. A498 


8.7 


Thyroid (female) 


3.8 


Renal ca. ACHN 


13.6 


Pancreatic ca. CAPAN2 


30.8 


Renal ca. UO-3 1 


21.6 


Pancreas Pool 


10.4 


Table BKD. Panel 4.1D 


Tissue Name 


Rel. E.\p.(%) 
Ag4160, Run 
173118878 


Tissue Name 


Rel. Exp.(%) 
Ag4160, Run 
173118878 


Secondary Th 1 act 


57.8 


HUVEC IL-lbeta 


70.7 


Secondary Th2 act 


48.3 


HUVEC IFN gamma 


51.4 


Secondary Trl act 


53.6 


HUVEC TNF alpha + IFN gamma; 


31.4 


Se ( iid in riil rest 


12.3 


HUVEC TNF alpha + IL4 


36.6 


Secondary Th2 rest 


13.0 


HUVEC IL- 11 


30.4 


V ndany Trl rest 


14.9 


Lung Microvascular EC none 


58.2 


Primary Thl act 


58.2 


Lung Microvascular EC TNFalpha 
+ IL-lbeta 


48.3 


Primary Th2 act 


64.2 


Microvascular Dermal EC none 


43.2 


Primary Trl act 


68.3 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


39.8 


Primary Thl rest 


25.7 


Bronchial epithelium TNFalpha + 
ILlbeta 


37.4 


Primary Th2 rest 


14.5 


Small airway epithelium none 


22.8 


Primary Trl rest 


29.5 


Small airway epithelium 
TNFalpha + IL-lbeta 


42.3 


CD45RA CD4 lymphocyte 


47.6 


Coronery artery SMC rest 


33.0 
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act 








CD45RO CD4 lymphocyte 
act 


77.4 


Coronery artery SMC TNFalpha + 
IL-lbeta 


27.4 


CDS lymphocyte act 


69.7 


Astrocytes rest 


18.4 


Secondary CDS 
lymphocyte rest 


53.6 


Astrocytes TNFalpha + IL-lbeta 


17.8 


Secondary CDS 
lymphocyte act 


24.0 


KU-S12 (Basophil) rest 


43.8 


CD4 lymphocyte none 


6? !KU-S 12 (Basophil) 
j PMA/ionomycin 


49.3 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


24.8 


CCD 1 106 (Keratinocytes) none 


51.1 


LAK cells rest 


28.7 


CCD 1 106 (Keratinocytes) 
TNFalpha + 1L-1 beta 


49.0 


LAK cells IL-2 


31.2 


Liver cirrhosis 


7.6 


LAK cells IL-2+IL-12 


26.1 


NCI-H292 none 


71.2 


LAK cells IL-2+IFN 
gamma 


28.7 


NCI-H292 IL-4 


90.8 


l.Alv ceils Wj-Jjt 1L-1S 


26 8 


NCI-H292 IL-9 


r - 


LAK cells 
PMA/ionomycin 


38.4 


NCI-H292 IL-13 


85.9 


in is. ^eiis il-z rest 


28 7 


NCI H292 II N gamma 


111 




26.1 


HPAEC none 


— 


Two Way MLR 5 day 


44.4 


HP A EC TNF alpha + IL-1 beta 




1 wo way iviL-tv / oay 


30.4 


Lung fibroblast none 


— ■ 

293 ■ ■ 


PBMC rest 


9.6 


Lung fibroblast TNF alpha + IL- 1 
beta 


20.4 


PBMC PWM 


53.2 


Lung fibroblast IL-4 




PBMC PHA-L 


47.0 


Lung fibroblast IL-9 


52.5 


Ramos (B cell) none 


82.9 


Lung fibroblast IL-13 


28.7 


Ramos (B cell) ionomycin 


81.2 


Lung fibroblast IFN gamma 


43.5 


B lymphocytes PWM 


43.2 


Dermal fibroblast CCD 1 070 rest 


47.0 


B lymphocytes CD40L and 
IL-4 


37.6 


Dermal fibroblast CCD 1070 TNF 
alpha 


62.4 


EOL-1 dbcAMP 


36.9 


Dermal fibroblast CCD 1 070 IL-1 

beta 


33.2 


EOL-1 dbcAMP 
PMA/ionomycin 


28.5 


Dermal fibroblast IFN gamma 


30.6 


Dendritic cells none 


39.5 


Dermal fibroblast IL-4 


56.6 


Dendritic cells LPS 


27.7 


Dermal Fibroblasts rest 


42.3 


Dendritic cells anti-CD40 


26.1 


Neutrophils TNFa+LPS 


3.7 
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[Monocytes rest 


11.7 


Neutrophils rest 


5.4 


Monocytes LPS 


14.9 


Colon 


7.1 


[Macrophages rest 


37.4 


Lung 


13.1 


(Macrophages LPS 


18.3 


Thymus 


20.7 


|HUVEC none 


37.9 


Kidney 


40.1 


[hUVEC starved 


51.1 






Table BKE. General oncology ^wnino p*™! v ? -\ 


Tissue Name 


Rel. Exp.(%) 
Ag4160, Run 
268624163 


Tissue Name 


Kcl. CXp.^ /o ) 

Ag4l60, Run 
268624163 


Colon cancer 1 


32.5 


Bladder cancer NAT 2 


0.5 


Colon cancer NAT I 


11.6 


Bladder cancer NAT 3 


0.3 


Colon cancer 2 


52.9 


Bladder cancer NAT 4 


6.2 


Colon cancer NAT 2 


16.6 


Adenocarcinoma of the prostate 1 


22.7 


Colon cancer 3 


85.9 


Adenocarcinoma of the prostate 2 


1.6 


Colon cancer NAT 3 


19.1 


\denocarcinoma of the prosiat .• 3 


9.0 

25. r '~ 


Colon malignant cancer 4 


100.0 


Adenocarcinoma of the prostate 4 


Colon normal adjacent 
tissue 4 


10.0 


Prostate cancer NAT 5 


6.0 


Lung cancer I 


39.2 (Adenocarcinoma of the prostate 6 


Lung NAT 1 


0.8 ^Adenocarcinoma of the prostate 7 


6.4 


Lung cancer 2 


82.4 (Adenocarcinoma of the prostate 8 


2.2 


Liins NAT 2 


1 -9 iAdenocarcinoma of the prostate 9 


27.5 


Squamous cell carcinoma 
3 


1 8.3 Jprostate cancer NAT 1 0 


0.9 


Lung NAT 3 


1 -7 jKidney cancer 1 


15.0 


metastatic melanoma 1 


36.6 {KidneyNAT 1 


5.1 


Melanoma 2 


5.3 jKidney cancer 2 


59.0 


Melanoma 3 


6.7 jKidney NAT 2 


15.2 


metastatic melanoma 4 


22.5 jKidney cancer 3 


20.6 


metastatic melanoma 5 


72.7 jKidney NAT 3 


4.9 


Bladder cancer 1 


3-6 jKidney cancer 4 


20.0 


Bladder cancer NAT 1 


3.0 jKidney NAT 4 


11.0 


Bladder cancer 2 


0.2 





CNS_neurodegeneration_vl.O Summary: Ag4160 This panel confirms the 
expression of the CG99963-01 gene at low levels in the brain in an independent group of 
individuals. This gene is found to be slightly down-regulated in the temporal cortex of 
Alzheimer's disease patients. Therefore, up-regulation of this gene or its protein product, 
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or treatment with specific agonists for this protein may be useful in reversing the neuronal 
death and dementia/memory loss associated with this disease. 

The CG99963-01 gene codes for cyclophilin 1 8 (Cyclophilin A; CyP-A) homolog. 
Cyp-A, a soluble cytoplasmic immunophilin, is known for its involvement in T cell 
5 differentiation and proliferation. Although CyP-A has a pivotal role in the immune 
response, it is most highly concentrated in brain. It is known to play a role in neuronal 
differentiation and proliferation of human embryonic brain cells (Nahreini et al., 2001, 
Cell Mol Neurobiol 21(l):65-79, PMID: 1 1440199). Therefore, therapeutic modulation of 
Cyp-A like protein encoded by this gene may be useful in treatment of neurological 
1 0 disorders. 

General_screening_panel_vl.4 Summary: Ag4160 Highest expression of the 
CG99963-01 gene is detected in a breast cancer T47D cell line (CT==21). High expression 
of this gene is also seen in cluster of cancer cell lines including melanoma, squamous cell 
carcinoma, pancreatic, CNS, colon, gastric, renal, breast, ovarian, and prostate cancer cell 

1 5 lines. Therefore, therapeutic modulation of this gene through the use of small molecule 
drug may be beneficial in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 

20 activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in 

25 central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag4160 Highest expression of the CG99963-01 gene is 
detected in IL-9 treated NCI-H292 cells (CT=23). This gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 

30 health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, 
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lung, thymus and kidney. This ubiquitous pattern of expression suggests that this gene 
product may be involved in homeostatic processes for these and other cell types and 
tissues. This pattern is in agreement with the expression profile in 
General_screening_panel_vl .4 and also suggests a role for the gene product in cell 
5 survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 
10 General oncology screening panel_v_2.4 Summary: Ag4 160 Highest 

expression of the CG99963-01 gene is detected in malignant colon cancer (CT=23). In 
addition, high expression of this gene is also delected in number of cancer samples 
including squamous cell care 
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OTHER EMBODIMENTS 

Although particular embodiments have been disclosed herein in detail, this has been done 
by way of example for purposes of illustration only, and is not intended to be limiting with 
respect to the scope of the appended claims, which follow. In particular, it is 
5 contemplated by the inventors that various substitutions, alterations, and modifications 
may be made to the invention without departing from the spirit and scope of the invention 
as defined by the claims. The choice of nucleic acid starting material, clone of interest, or 
library type is believed to be a matter of routine for a person of ordinary skill in the art 
with knowledge of the embodiments described herein. Other aspects, advantages, and 
1 0 modifications considered to be within the scope of the following claims. The claims 
presented are representative of the inventions disclosed herein. Other, unclaimed 
inventions are also contemplated. Applicants reserve the right to pursue such inventions 
in later claims. 
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What is claimed is: 
What is claimed is: 

1 . An isolated polypeptide comprising the mature form of an amino acid sequenced 
selected from the group consisting of SEQ ID NO:2n, wherein n is an integer 
between 1 and 101 

2. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO:2n, wherein n is an integer between 1 and 101 . 

3. An isolated polypeptide comprising an amino acid sequence which is at least 95% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:2n. wherein n is an integer between 1 and 101. 

4. An isolated polypeptide, wherein the polypeptide comprises an amino acid 
sequence comprising one or more conservative substitutions in the amino acid 
sequence selected from the group consisting of SEQ ID NO:2n, wherein n is an 
integer between 1 and 101. 

5. The polypeptide of claim 1 wherein said polypeptide is naturally occurring. 

6. A composition comprising the polypeptide of claim 1 and a carrier. 

7. A kit comprising, in one or more containers, the composition of claim 6. 

8. The use of a therapeutic in the manufacture of a medicament for treating a 
syndrome associated with a human disease, the disease selected from a pathology 
associated with the polypeptide of claim 1, wherein the therapeutic comprises the 
polypeptide of claim 1 . 
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9. A method for determining the presence or amount of the polypeptide of claim 1 in 
a sample, the method comprising: 

(a) providing said sample; 

(b) introducing said sample to an antibody that binds immunospecifically to the 
polypeptide; and 

(c) determining the presence or amount of antibody bound to said polypeptide, 
thereby determining the presence or amount of polypeptide in said sample. 

1 0. A method for determining the presence of or predisposition to a disease associated 
with altered levels of expression of the polypeptide of claim 1 in a first mammalian 
subject, the method comprising: 

a) measuring the level of expression of the polypeptide in a sample from the 
first mammalian subject; and 

b) comparing the expression of said polypeptide in the sample of step (a) to 
the expression of the polypeptide present in a control sample from a second 
mammalian subject known not to have, or not to be predisposed to, said 
disease, 

wherein an alteration in the level of expression of the polypeptide in the first subject as 
compared to the control sample indicates the presence of or predisposition to said disease. 

11. A method of identifying an agent that binds to the polypeptide of claim 1 , the 
method comprising: 

(a) introducing said polypeptide to said agent; and 

(b) determining whether said agent binds to said polypeptide. 

12. The method of claim 1 1 wherein the agent is a cellular receptor or a downstream 
effector. 

13. A method for identifying a potential therapeutic agent for use in treatment of a 
pathology, wherein the pathology is related to aberrant expression or aberrant 
physiological interactions of the polypeptide of claim 1 , the method comprising: 
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(a) providing a cell expressing the polypeptide of claim 1 and having a 
property or function ascribable to the polypeptide; 

(b) contacting the cell with a composition comprising a candidate substance; 
and 

(c) determining whether the substance alters the property or function ascribable 
to the polypeptide; 

whereby, if an alteration observed in the presence of the substance is not observed when 
the cell is contacted with a composition in the absence of the substance, the substance is 
identified as a potential therapeutic agent. 

1 4. A method for screening for a modulator of activity of or of latency or 
predisposition to a pathology associated with the polypeptide of claim 1, said 
method comprising: 

(a) administering a test compound to a test animal at increased risk for a 
pathology associated with the polypeptide of claim 1, wherein said test 
animal recombinantly expresses the polypeptide of claim 1: 

(b) measuring the activity of said polypeptide in said test animal after 
administering the compound of step (a); and 

(c) comparing the activity of said polypeptide in said test animal with the 
activity of said polypeptide in a control animal not administered said 
polypeptide, wherein a change in the activity of said polypeptide in said test 
animal relative to said control animal indicates the test compound is a 
modulator activity of or latency or predisposition to, a pathology associated 
with the polypeptide of claim 1. 

15. The method of claim 1 4, wherein said test animal is a recombinant test animal that 
expresses a test protein transgene or expresses said transgene under the control of a 
promoter at an increased level relative to a wild-type test animal, and wherein said 
promoter is not the native gene promoter of said transgene. 

1 6. A method for modulating the activity of the polypeptide of claim 1 , the method 
comprising contacting a cell sample expressing the polypeptide of claim 1 with a 
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compound that binds to said polypeptide in an amount sufficient to modulate the 
activity of the polypeptide. 

1 7. A method of treating or preventing a pathology associated with the polypeptide of 
claim 1, the method comprising administering the polypeptide of claim 1 to a 
subject in which such treatment or prevention is desired in an amount sufficient to 
treat or prevent the pathology in the subject. 

1 8. The method of claim 1 7, wherein the subject is a human. 

19. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal a polypeptide in an amount that is sufficient to 
alleviate the pathological state, wherein the polypeptide is a polypeptide having an 
amino acid sequence at least 95% identical to a polypeptide comprising the amino 
acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n is 
an integer between 1 and 101 or a biologically active fragment thereof. 

20. An isolated nucleic acid molecule comprising a nucleic acid sequence selected 
from the group consisting of SEQ ID NO:2n-l, wherein n is an integer between 1 
and 101. 

2 1 . The nucleic acid molecule of claim 20, wherein the nucleic acid molecule is 
naturally occurring. 

22. A nucleic acid molecule, wherein the nucleic acid molecule differs by a single 
nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
ID NO: 2n- 1 , wherein n is an integer between 1 and 101. 

23. An isolated nucleic acid molecule encoding the mature form of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID 
NO:2n, wherein n is an integer between 1 and 101. 
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24. An isolated nucleic acid molecule comprising a nucleic acid selected from the 
group consisting of 2n-l, wherein n is an integer between 1 and 101. 

25. The nucleic acid molecule of claim 20, wherein said nucleic acid molecule 
hybridizes under stringent conditions to the nucleotide sequence selected from the 
group consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 101, 
or a complement of said nucleotide sequence. 

26. A vector comprising the nucleic acid molecule of claim 20. 

27. The vector of claim 26, further comprising a promoter operably linked to said 
nucleic acid molecule. 

28. A cell comprising the vector of claim 26. 

29. An antibody that immunospecifically binds to the polypeptide of claim 1 . 

30. The antibody of claim 29, wherein the antibody is a monoclonal antibody. 

31 . The antibody of claim 29, wherein the antibody is a humanized antibody. 

32. A method for determining the presence or amount of the nucleic acid molecule of 
claim 20 in a sample, the method comprising: 

(a) providing said sample; 

(b) introducing said sample to a probe that binds to said nucleic acid molecule; 

and 

(c) determining the presence or amount of said probe bound to said nucleic 
acid molecule, 

thereby determining the presence or amount of the nucleic acid molecule in said sample. 

33. The method of claim 32 wherein presence or amount of the nucleic acid molecule 
is used as a marker for cell or tissue type. 
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34. The method of claim 33 wherein the cell or tissue type is cancerous. 

35. A method for determining the presence of or predisposition to a disease associated 
with altered levels of expression of the nucleic acid molecule of claim 20 in a first 
mammalian subject, the method comprising: 

a) measuring the level of expression of the nucleic acid in a sample from the 
first mammalian subject; and 

b) comparing the level of expression of said nucleic acid in the sample of step 
(a) to the level of expression of the nucleic acid present in a control sample 
from a second mammalian subject known not to have or not be predisposed 
to, the disease; 

wherein an alteration in the level of expression of the nucleic acid in the first subject as 
compared to the control sample indicates the presence of or predisposition to the disease. 

36. A method of producing the polypeptide of claim 1 , the method comprising 
culturing a cell under conditions that lead to expression of the polypeptide, wherein 
said cell comprises a vector comprising an isolated nucleic acid molecule 
comprising a nucleic acid sequence selected from the group consisting of SEQ ID 
NO:2n-l, wherein n is an integer between 1 and 101. 

37. The method of claim 36 wherein the cell is a bacterial cell. 

38. The method of claim 36 wherein the cell is an insect cell. 

39. The method of claim 36 wherein the cell is a yeast cell. 

40. The method of claim 36 wherein the cell is a mammalian cell. 

41. A method of producing the polypeptide of claim 2, the method comprising 
culturing a cell under conditions that lead to expression of the polypeptide, wherein 
said cell comprises a vector comprising an isolated nucleic acid molecule 



745 



WO 03/010327 PCT/US02/14199 

comprising a nucleic acid sequence selected from the group consisting of SEQ ID 
NO:2n-l , wherein n is an integer between 1 and 101. 

42. The method of claim 41 wherein the cell is a bacterial cell. 

43. The method of claim 41 wherein the cell is an insect cell. 

44. The method of claim 41 wherein the cell is a yeast cell. 

45. The method of claim 41 wherein the cell is a mammalian cell. 
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